Gene Franean1_5569 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_5569 
Symbol 
ID5675767 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp6749091 
End bp6750251 
Gene Length1161 bp 
Protein Length386 aa 
Translation table11 
GC content71% 
IMG OID641244423 
Producthypothetical protein 
Protein accessionYP_001509827 
Protein GI158317319 
COG category[R] General function prediction only 
COG ID[COG3211] Predicted phosphatase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.261658 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTGGACA GGCGCCTGTT TCTGCGGTCC GCATTTGTCG GTGTGGGTGT AGCGACTCTT 
TCCGGTGTCG TTCTGGAGGC CGCTTCGGCG GTTCCGGCGC AGCCCGGGGC CAGCCCGTAC
GGATCCCTTC TGGCGGCCGA TGCGAACGGA GTGGCGCTGC CGACCGGATT CACCAGCAGA
ATTGTCGCCC GTTCCGGGCA GACAGTGCCC GGAACGAGCT ATGTCTGGCA TGCCGCACCG
GACGGCGGCG CCTGCTACCC GAACGGCTCG GGCTGGATGT ATGTCTCGAA CTCGGAGGTG
AGCGGCAGCG GCGGGGCGTC CGTGCTGCGC TTCCACTCGG CGGGCACCGT CACCTCCGCG
CAGCGGCTGC TCTCCGGCAC GAGCTCCAAC TGCGCCGGCG GCGCCACCCC GTGGGGGAGC
TGGCTGTCCT GCGAGGAGAC GTCCACCGGC CGGGTCTGGG AGACCTACCC GGCCACCGGC
GCCGCGGCCG TCGCCCGGCC CGCGATGGGC CGGTTCACCC ACGAGGCCGC GGCCTGCGAC
CCGGTCCGCC AGGTGATCTA CCTGACCGAG GACCGGACGG ACGGCTGCTT CTACCGCTTC
CGGCCAACGA CCTGGGGCAA CCTGGCGACG GGAACCCTCG AGGTCCTGTG CGCGTCGGCG
TCGGCCACCT CGGGCACCGC CACCTGGCAG ACCGTCCCCG ACCCGGACGG CTCGCCGACC
TCGACCCGCG CCCAGGTCTC CGCGGCGAAG CACTTCAACG GCGGCGAGGG CGTCTACTAC
GCGAACAACA CGGTCTGGTG GACCACCAAG GGCGACAACC GGGTCTGGAA GCTGAACTGC
GCCACCAACG CCTTCGAGCT CGCCTACGAC GACTCCCTGG TGGGCGGGAC CGCGCCGCTG
ACCGGCGTCG ACAACATCAC CGGCTCCAGC TACGGCGACC TCTACGTCGC CGAGGACGGC
GGCAACCTCG AGATCTGCGT CATCACGCCG GCCGCGGTCG TGGCGCCGAT CCTGCGCCTG
ACCGGGCACA ACTCGTCGGA GATCACCGGG CCGGCATTCT CCCCGGACGG CTCCCGGCTG
TACTTCTCCT CCCAGCGGGG CACCACCGGA TCGTCCTCCG GCGGCATCAC CTTCGAGGTC
CGCGGCCCCT TCCGCACCTG A
 
Protein sequence
MVDRRLFLRS AFVGVGVATL SGVVLEAASA VPAQPGASPY GSLLAADANG VALPTGFTSR 
IVARSGQTVP GTSYVWHAAP DGGACYPNGS GWMYVSNSEV SGSGGASVLR FHSAGTVTSA
QRLLSGTSSN CAGGATPWGS WLSCEETSTG RVWETYPATG AAAVARPAMG RFTHEAAACD
PVRQVIYLTE DRTDGCFYRF RPTTWGNLAT GTLEVLCASA SATSGTATWQ TVPDPDGSPT
STRAQVSAAK HFNGGEGVYY ANNTVWWTTK GDNRVWKLNC ATNAFELAYD DSLVGGTAPL
TGVDNITGSS YGDLYVAEDG GNLEICVITP AAVVAPILRL TGHNSSEITG PAFSPDGSRL
YFSSQRGTTG SSSGGITFEV RGPFRT