Gene Franean1_6137 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_6137 
Symbol 
ID5674458 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp7466903 
End bp7467943 
Gene Length1041 bp 
Protein Length346 aa 
Translation table11 
GC content70% 
IMG OID641244989 
ProductNAD-dependent epimerase/dehydratase 
Protein accessionYP_001510387 
Protein GI158317879 
COG category[G] Carbohydrate transport and metabolism
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0451] Nucleoside-diphosphate-sugar epimerases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.206898 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.924748 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGGCCAC GTCGCGTGCT TGTGACAGGG GTGTCGCGTC CGCTGGGCGC CGAGGTCGCT 
GCCGCACTCG CCGCCGACCC CGAGATCGTC GATGTGGTCG GGGTCGACAC CATCGCCCCG
ACGGCGGATC TCGGCCGGAC CCAGTTCGTC CGGGTGGACA TCCGAAACCC GCTGATCGCG
AAGGTCATCT CCACCGCCGC CATCGACACC GTCCTGCATC TGAGTGTGCT CGCCACGCCG
CTCGGCGCCG GCGGGCGCAC GGCGATGAAG GAGATCAACG TCATCGGGAC GATGCAGCTC
CTCGCCGCCT GCCAGAAGAC CCCCGGGGTG AAGAAACTGG TGGTGAAGTC GACGACGTCG
ATCTACGGCT CGTCGCCGCG CGACCCCGCG CTGTTCACCG AGGAGATGGA ACCGCGCGGC
CTTCCCGGCG GCGGTTACGC CAAGGATGCC GTCGAGGTCG AGGGGTACGT CCGAGGCTTC
GGCCGGAGGC GCCCCGACAT CGCGGTGACG GTGCTGCGCC TGGCGAACGT GCTCGGCCCG
CGAGTGGACA GCCCGCTCGC GCGGTACCTC GACCTTCCGC TGGTCCCGAC GGTCCTCGGC
TTCGACCCAC GGATTCAGCT GCTGCACTCC GACGACGCGA TGGCGGTACT CCTGAAGGCC
ACCCGGGAGA CCCACGCCGG CACCTTCAAC GTGGCCGGGG ACGGCGTGCT CCTGCTCTCG
CAGGCGATCC GGCGGGCCGG GCGGCCGGCG CTGCCAGTGC CCTTCCCCGC GATCGGGTCG
TTGGGCAACA TCGCCCGCCG GCTGCGGCTC GTCGACTTCT CCTCCGAGCA GCTCGGGTTC
CTCGCGCACG GGCGCGCGGT GGACACCACC AAGCTCAAAG AGGTGTTCGG ATACGTTCCT
CAGTACACGA CCGTCGCGAC GTTCGACAGT TTCGTTCAGG ACCGCGGACT GCGGTTCACC
ATCGACCACG AGCTGGTCTC GCGGGTGGAG CACGGCCTTC AGGGCGCGCT CGCCCGACGC
CGGCTGCTCG GCACGTCCTG A
 
Protein sequence
MRPRRVLVTG VSRPLGAEVA AALAADPEIV DVVGVDTIAP TADLGRTQFV RVDIRNPLIA 
KVISTAAIDT VLHLSVLATP LGAGGRTAMK EINVIGTMQL LAACQKTPGV KKLVVKSTTS
IYGSSPRDPA LFTEEMEPRG LPGGGYAKDA VEVEGYVRGF GRRRPDIAVT VLRLANVLGP
RVDSPLARYL DLPLVPTVLG FDPRIQLLHS DDAMAVLLKA TRETHAGTFN VAGDGVLLLS
QAIRRAGRPA LPVPFPAIGS LGNIARRLRL VDFSSEQLGF LAHGRAVDTT KLKEVFGYVP
QYTTVATFDS FVQDRGLRFT IDHELVSRVE HGLQGALARR RLLGTS