Gene Franean1_0938 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_0938 
Symbol 
ID5669352 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp1097189 
End bp1098820 
Gene Length1632 bp 
Protein Length543 aa 
Translation table11 
GC content75% 
IMG OID641239865 
Producthypothetical protein 
Protein accessionYP_001505300 
Protein GI158312792 
COG category[R] General function prediction only 
COG ID[COG0661] Predicted unusual protein kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0169298 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.687676 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTCCGACA TCCCACGGCG AGCGGTCGTC CGCACCGCCA AGCTCGCGAC CCTCCCGATC 
GGCATCGCCG GCCGGGCGAC ACTCGGCGTC GGAAAGCGCC TCGGCGGCAG GCCCGCCGAG
GCCGTCGCGA CCGAGCTGCA GCAGCGCACC GCGGCGCAGA TCTTCCGGGT GCTCGGCGAG
CTCAAGGGCG GGGCCATGAA GCTCGGGCAG GCGCTGTCCG TCTTCGAGGC CGCGCTGCCC
GACGAGGTCG CCGGCCCCTA CCGCGCCGCG CTGACGAAAC TGCAGGAGGC GGCGCCACCG
CTGCCCGCCG CCACGGTGCA CAAGGTCCTC GCCGAGGAGC TCGGCCCCGA ATGGCGGGCC
CTGTTCGCCA GCTTCGACGA CACACCCGCC GCCGCCGCGA GCATCGGCCA GGTGCACCGC
GCCGTGTGGG CGGACGGCCG GCCCGTCGCG GTGAAGATCC AGTACCCGGG GGCGGGCTCG
GCCCTGCTGG CTGATCTGAA CCAGCTGGGC CGCGCCGCGC GGCTGTTCGG CGCGCTGACG
CCCGGTCTCG ACATCAAGCC GCTGGTCGCC GAGCTCAAGG CACGCATCAC CGAGGAGCTC
GACTACCGGC TCGAGGCCGC CTGGCAGCGG GCGTTCGCGC AGGCCTACGC CGACGACCCG
GACATCGTCG TCCCCCGGCC GATCGCCGGG GCGGACCGGG TCCTGGTCAG CGAGTGGATC
GACGGGGTGC CGCTGTCGAC CATCATCGAC CGGGGGACGC AGGAGGAGCG GGACCGCGCC
GGCCTGCTCT TGGTCCGCTT CCTCTACTCC TGCCCCGGCC GGGCCGGGCT GCTCCACGCC
GACCCGCACC CGGGCAACTT CCGGCTCCTC GCGGACGGCC GGCTCGGTGT TCTCGACTTC
GGGGCGGTGA ACCGTCTGCC CGGCGGGCTG CCCGAGCCGA TCGGCCGGCT CGCCCGGCTG
ACCCTCGCCG GTGACGCCGA GGCCGTCGCC GAAGGGCTGC GGGCGGAGGG GTTCATCCCG
GAGGGCGCCG CCATCCCCGC GGAGGACCTG CTCGACTACC TGGCACCGAT GCTCGCGCCG
ATCACCGACG AGGAGTTCAC CTTCTCCCGC GACTGGCTGC GCGGGGAGGC GCTCCGCCTC
GGTGACTGGC GCTCCGCCGC GGCGCAGCTC GGCCGCCAGC TCAACCTGCC ACCGTCCTAC
CTGCTGATCC ACCGGGTGAC GCTCGGCGCG ATCGGGATCC TCTGCCAGCT CGGCAGCTCC
GGCCCGTTCC GGGCCGAGAT GGAGCGCTGG CAGCCCGGGT TCGCCCCGCC GCGCAGCGCC
GCCGCACGCC ACGCGGCAGC CGCGAACCGG CCGAACCGCC GCCTTCCCAG ACTCGACATC
GAGGACGGCA CCGGCGTCAT CCGTCCGCTC CCCGGACCGG TGGTCCTCGC CACCGCCCCC
GCGCAGCGCT CGGGGCGCTC GGGCCGCGCA CGCAGCCGCG TCCGCCCGCC GGAGCAGGCC
AGCACGCCGG AGCAGGCTGG TCCACCGGAG ACCAGCCGAC CGTCGCGGCA GGCCCGGCCA
GCTCCAGGAA ACCGACGCAA GCTCGAGAAA GAGGCCCAGC CCGAACCAGG GACCCAGCCC
GAACCCCGCT GA
 
Protein sequence
MSDIPRRAVV RTAKLATLPI GIAGRATLGV GKRLGGRPAE AVATELQQRT AAQIFRVLGE 
LKGGAMKLGQ ALSVFEAALP DEVAGPYRAA LTKLQEAAPP LPAATVHKVL AEELGPEWRA
LFASFDDTPA AAASIGQVHR AVWADGRPVA VKIQYPGAGS ALLADLNQLG RAARLFGALT
PGLDIKPLVA ELKARITEEL DYRLEAAWQR AFAQAYADDP DIVVPRPIAG ADRVLVSEWI
DGVPLSTIID RGTQEERDRA GLLLVRFLYS CPGRAGLLHA DPHPGNFRLL ADGRLGVLDF
GAVNRLPGGL PEPIGRLARL TLAGDAEAVA EGLRAEGFIP EGAAIPAEDL LDYLAPMLAP
ITDEEFTFSR DWLRGEALRL GDWRSAAAQL GRQLNLPPSY LLIHRVTLGA IGILCQLGSS
GPFRAEMERW QPGFAPPRSA AARHAAAANR PNRRLPRLDI EDGTGVIRPL PGPVVLATAP
AQRSGRSGRA RSRVRPPEQA STPEQAGPPE TSRPSRQARP APGNRRKLEK EAQPEPGTQP
EPR