Gene Franean1_4688 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_4688 
Symbol 
ID5673030 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp5601800 
End bp5603362 
Gene Length1563 bp 
Protein Length520 aa 
Translation table11 
GC content71% 
IMG OID641243545 
Productserine/threonine protein kinase 
Protein accessionYP_001508961 
Protein GI158316453 
COG category[K] Transcription
[L] Replication, recombination and repair
[R] General function prediction only
[T] Signal transduction mechanisms 
COG ID[COG0515] Serine/threonine protein kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.414102 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATCAAG GGCTGACGCT GCCTTCTGGC ATGGTGTCGT ACCCGGAGGG GCGCACGGTT 
CCCGCGGCCG AGACGTTCGT CCCCGGTGAA CCCCTGTTGC CGCACGAGCC GACCGCCATC
GGTCCGTACG TCCTGTTGAG CCGGCTGGGC GTGGGTGGGA TGGGCGCCGT CTACTACGCG
CGCGACCAGC GGGGGCGGCC GGTCGCGGTC AAGGTGATCA GGTCGGATCG GGCCAGGGAC
CAGGAGTTCC GCCGCCGTTT CCGTCGAGAG GTCGAGGCAG CTCGCAGCGT CGCCTCGTTC
TGCACCGCCG AGGTTCTCGA CGCCGACCCG GACGCGTTCG CGCCGTACCT GGTGACGGAG
TACATCGACG GCCTCCGGCT GGATCAGGCT GTCGCCGACA GGGGTCCGCT CGACTCGTCC
ACCCTTACCG GGCTGGCGGT CGGGGTCGCG ACGGCGTTGA CCGCGATCCA CCACGCCGGG
CTTGTGCATC GGGATCTCAA GCCCGGCAAC GTGATCCTGT CGCTGTCCGG TCCACGGGTC
ATCGACTTCG GTATCGCCCT GGCCTTGGAC AGCACCGGCG GTAGGCCGAC CGACTGGGGG
TTCGGCTCGG CCGGGTGGAT GGCACCCGAA CAGATCAACG GCCAGCCGAT CAGTGCGGCG
GCTGACGTGT TCGCCTGGGG TGTCCTCGTC GCGTACGCGG GCACCGGACG GCATCCCTTC
GGGGACGGCC ACGATGTCGG TCTCGCCCAC CGGATCACCA CGGCCGAACC GGACCTGACT
GGCCTTCCTC CGCAGGTGGA GGATCTCGTC CGCGATGCTC TGACGAAAGA GCCGGCCAGC
CGGCCTGATG CCCGGGGCCT GCTGCTGCGC CTGGTCGAGC GCCGGCCGGG GGAGCGGTCC
GCCGATCCGG CTGTCCGACT GCTCGGGCTC ACCGCGGAGC TCGCGCACTC CCCGGACGGG
TCCGCACCTG CGCGGCGGCG GGGCTGGGGT CGTGGCCGTG TTCTGCTGAC CGCCAGCCTG
ACGCTGGTAC TGGCGGCCCT CACCGTGATC GGGTCGGTGG CGGCGAACCG TGATGACGGG
ACGTCGACGC GACCGGCCGC GCCCACAGCC AGGTCGGGGG CATCCACGGC CAGGTCCGTG
GAATCCTCGT CCGCGCCGGT CTCCGGCCCC GGCACCCCAC CGAGTCCCGG CGGCTTCCGG
GACGGGCCGC TGCTGTTCGC CGTGGACGAT GTCGAATGTG GGGTCGAGCA GCTCGGCCTC
GGGTTCCTGG CCCGCCATCC GGACGGTCAG TTCTGCCTGG TCACCATGAC GGTTCGTAAC
ACCGGTGCAT CCTCGGGCGC CCTGGAGAAC GCCTACCAGT ACGCGTACGA CAGCACCGGC
GCCCGGCGCA CCGCCGACTA CCTGTCGCGT TTCTACCTGC CCGGCGAGAC GATCTGGAAT
CCCGCCGGAC CGGGTGCCTC CATCCACGGC ACGCTGGTTT TCGATATCCC GCGGGGCGCC
GCGTTGCAGC GGCTGGAGCT CCATGACAGC CCGACCGGCA GCGGCGTTCT CATTCCGTTG
TAG
 
Protein sequence
MDQGLTLPSG MVSYPEGRTV PAAETFVPGE PLLPHEPTAI GPYVLLSRLG VGGMGAVYYA 
RDQRGRPVAV KVIRSDRARD QEFRRRFRRE VEAARSVASF CTAEVLDADP DAFAPYLVTE
YIDGLRLDQA VADRGPLDSS TLTGLAVGVA TALTAIHHAG LVHRDLKPGN VILSLSGPRV
IDFGIALALD STGGRPTDWG FGSAGWMAPE QINGQPISAA ADVFAWGVLV AYAGTGRHPF
GDGHDVGLAH RITTAEPDLT GLPPQVEDLV RDALTKEPAS RPDARGLLLR LVERRPGERS
ADPAVRLLGL TAELAHSPDG SAPARRRGWG RGRVLLTASL TLVLAALTVI GSVAANRDDG
TSTRPAAPTA RSGASTARSV ESSSAPVSGP GTPPSPGGFR DGPLLFAVDD VECGVEQLGL
GFLARHPDGQ FCLVTMTVRN TGASSGALEN AYQYAYDSTG ARRTADYLSR FYLPGETIWN
PAGPGASIHG TLVFDIPRGA ALQRLELHDS PTGSGVLIPL