Gene Franean1_1947 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_1947 
Symbol 
ID5670348 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp2341986 
End bp2344784 
Gene Length2799 bp 
Protein Length932 aa 
Translation table11 
GC content75% 
IMG OID641240868 
ProductWD-40 repeat-containing serine/threonin protein kinase 
Protein accessionYP_001506290 
Protein GI158313782 
COG category[K] Transcription
[L] Replication, recombination and repair
[R] General function prediction only
[T] Signal transduction mechanisms 
COG ID[COG0515] Serine/threonine protein kinase
[COG2319] FOG: WD40 repeat 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0876454 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.992815 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACCGGAG CTACGCCACC ACCGGTGTCC GGGAACACCG CGCGCCCGTT GCGGTCCGAG 
GACCCGGTAC AGCTCGGCGC CTACCGGGTG GTGGGCCGGC TCGGCCAGGG CGGGATGGGC
GCCGTCTTCC TCGGGCAGGC GCCGGACGGC ACCGCCGTCG CCATCAAGGT GATCCGCCCC
GAGCTGGCCT CCCGGCCGGA GTTCCGCGCC CGCTTCGCCC GCGAGACCGA GAGCGCCCGC
CGGGTCCGCC GCTTCACCAC CGCGGCCGTG CTCGACGCCG ACCCGCACGG GCCGCAGCCG
TACCTGGTCA CGGAGTTCGT CGAGGGCCCG ACGCTCTCCC GCCACGTGGC CGCGCGCGGC
CCGATGCGGC CGGCCGATCT CGAACAGCTC GCGGTCAGCG TCGCCACCGC GCTGTCGGCC
ATCCACGCCG CCGGCATCGT GCACCGCGAC CTCACCCCGG CGAACGTGCT ACTCTCCCCG
GTCGGCCCGA AGGTGATCGA CTTCGGGCTG GCGCGTGAGT ACGACACGGT CAGCGACCTG
TCCCGCAACG TGAAGCAGGC GATCGGCACG CCCGGCTACA TGTCGCCGGA GCAGATCCTC
GACCTGCCGA TCACCGCCGC GGTCGACATC TTCGCCTGGG GATCAATCAT GATCTTCGCG
GCGACCGGGC ACCCGCCGTT CGGGCAAGGC CGGATGGAGG CCGTCCTCTA CCGCATCATC
AACGAGCAGC CGCAGCTCGA CGGCGTGACG GGCGAGCTGC GCGAGCTCGT CGAGCTCGCC
ATGCGTAAGG ACCCGACCAC CCGGCCGAGC GCCGAGGAGC TGCGCGCGTC GCTGATGGGC
GGGGTGGCTA TCCCCGACCG GAGCGCGGCG CCCGGGCCGC CGGGCGGCGC CGAAGGCACG
GCCGGGGCGC CCCGCGGCCG CCGCTGGTCC CGCGGCGGGC GCCGCGACCG CGCGCAGACA
GCCTCGGGAA CCGCCGCCGG CGCAGCCGCC GGAGCGGCCG CGAGCGGGCC GGTCGGACCA
GGCGACGTGT CCGGAGCAGG CGGAACGCCC GGAGCAGGTG CTGGCGGAGC AGGTGCTGGT
GGAGGCGGGA GCGCTGCCGG AAGCAGGAGC GGGGGCGCTG TCGGAGGCCG TGGCGGGAGC
GGAACCGCGG CGGCGGCGCT CGGGCCGCTG ACACACGCGC CGCGGGCCGG TGCCGGGCCC
GCGCTGGGCA CCCCACCGCC GGCGTCGTCG TTTCCGGGTG TCTCCTCACC GGGCTCCGCC
CAGCCCCGCA CACCGTCGGG ACCGATCTCC CAGCCGCCTC CATACCCGCT GTCACCCGCA
CCCCAAGGCC AGGGCGGCCA ACCGGCCGGC GGCCAGGGCA CCGGCGGGCC GGGGTGGTCG
GGGCCGGTGC CCGCGGGGCC GCTCGCGCGA CCAACCGAGT CGTCCGGGCC GGTGTCACCG
GCGCCGGGTG CCCCGGCCCC GGCCGGCTCC GGCGAAGGAG CGGGTGCCGA TCCGTCGCAG
GGCGCGGAGT CCGGCCACGG GTCCAGGCGC CGCACCATGG TCCTCGCCGG GTTCGCGGCG
CTACTCGTCG CCGCCGTCGT CGTACCGATC GTGACCCTCG GTGGCGGCGG TGACGACGGG
TCGAGCGCGG ACCGCGAGGC CATCGCGGCG CACCTCGCCG CGACCGCCGC GGCCAGCCGC
GCGCAGGACC CGGCGCTCGC CGCTCGGCTG AGCCTCGCCG CCTACCGGAT CGCGCCCGTC
CAGGCCGCCG AGGACGCCAT GGTCGCCTCG TTCGCAGGTG CGTCCGCGGT GCGCACCCCG
GCCTCGGACG TCCCCTACGG GGATATCGCC ATCAACCCCG CCGGCACCGT CCTCGCCGCC
ACCAGCGCGG ACGGCGTACT CCGGCTGTTT CGGCTGATCG ACGGCGGCGA ACCGGCGCTG
ATCAGCGAAC GCCGCTCGGA CGACCCGTCC GACGGCATCG CGTTCACCGG CGACGGCACG
CGGCTCGCGA CCGGCGGAAC GCAGAGCGCG GCCCGCCTGT GGCAGGTCAC CGATCCGGCG
AACCCCCAAC AGGTCGCCCA GCTGGACGGC CTGTCCCGTC CCGTTCACGT GGCGCTGTCC
GCGGACGGAT CGCTGCTGGC CGCGGCCGCC CAGGACGGCA CGTTCGGTCT GTGGAACGTC
TCGAACCCGG CCGCGCCCGC GATGCTGCGC CTCCAGCTCA CCACCGCCGT GATCACGGAC
ATGGCCCTGA CCCCGGACGG GAAGCTGCTG GCCACCGCGG GCATCGGTGG TGACGTCCAG
CTGTGGAACA TCACGGACCC TCGTAAACCG GTCCAGGCAG GGGTGGCGTC CGGCGCGGTC
GGCGCGGTGA ATGCCGTCAC CTTCAGCACA GATGGCCACC AGATGATCAC CGGTGGCGAC
GATCGCACCG TCCGTGTCTG GGATGTGCGC GACCCGATGG CCGCCCATAT CACCAGTGAG
CTGCACGGTC ACACGGCCCC GGTCAACGCC GTCGTGTTCG GTGCCGGCGG CCAGCCCGTC
AGCGGTGACC AGGCGGGCGT CGTCGCCTAC TGGGACACCT CGAGTGCGGC GCCGATGGTC
CAGGTGGGCA ATCTGAAGAG CTCAGTCCTC GCCCTGGCGA CGGACGCCGC GGATGACCGC
CTCGCACTGA GCACCGAGTC CGGGCAGGTC GCGGTGTGGT CGACGGACGC CGCGAAGCTC
ACGACGATCG CCTGCGCCGA CCCGGACGCC CGCATCAGCC GGGCCGAGTG GGAGCAGCGG
ATCAGCGAGC TCCCGTTCCG GGACCCGTGC ACCGTCTGA
 
Protein sequence
MTGATPPPVS GNTARPLRSE DPVQLGAYRV VGRLGQGGMG AVFLGQAPDG TAVAIKVIRP 
ELASRPEFRA RFARETESAR RVRRFTTAAV LDADPHGPQP YLVTEFVEGP TLSRHVAARG
PMRPADLEQL AVSVATALSA IHAAGIVHRD LTPANVLLSP VGPKVIDFGL AREYDTVSDL
SRNVKQAIGT PGYMSPEQIL DLPITAAVDI FAWGSIMIFA ATGHPPFGQG RMEAVLYRII
NEQPQLDGVT GELRELVELA MRKDPTTRPS AEELRASLMG GVAIPDRSAA PGPPGGAEGT
AGAPRGRRWS RGGRRDRAQT ASGTAAGAAA GAAASGPVGP GDVSGAGGTP GAGAGGAGAG
GGGSAAGSRS GGAVGGRGGS GTAAAALGPL THAPRAGAGP ALGTPPPASS FPGVSSPGSA
QPRTPSGPIS QPPPYPLSPA PQGQGGQPAG GQGTGGPGWS GPVPAGPLAR PTESSGPVSP
APGAPAPAGS GEGAGADPSQ GAESGHGSRR RTMVLAGFAA LLVAAVVVPI VTLGGGGDDG
SSADREAIAA HLAATAAASR AQDPALAARL SLAAYRIAPV QAAEDAMVAS FAGASAVRTP
ASDVPYGDIA INPAGTVLAA TSADGVLRLF RLIDGGEPAL ISERRSDDPS DGIAFTGDGT
RLATGGTQSA ARLWQVTDPA NPQQVAQLDG LSRPVHVALS ADGSLLAAAA QDGTFGLWNV
SNPAAPAMLR LQLTTAVITD MALTPDGKLL ATAGIGGDVQ LWNITDPRKP VQAGVASGAV
GAVNAVTFST DGHQMITGGD DRTVRVWDVR DPMAAHITSE LHGHTAPVNA VVFGAGGQPV
SGDQAGVVAY WDTSSAAPMV QVGNLKSSVL ALATDAADDR LALSTESGQV AVWSTDAAKL
TTIACADPDA RISRAEWEQR ISELPFRDPC TV