Gene Franean1_3198 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_3198 
Symbol 
ID5671574 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp3777263 
End bp3778729 
Gene Length1467 bp 
Protein Length488 aa 
Translation table11 
GC content74% 
IMG OID641242092 
Productmannitol dehydrogenase domain-containing protein 
Protein accessionYP_001507512 
Protein GI158315004 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0246] Mannitol-1-phosphate/altronate dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.247953 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAACCATT CCGCGGCCGT GCCTGTGAAC CGCCCAGTGG CGGCGCCCGC CAGCGGCCCC 
GTGGCCCCGT CGCCGAGTCG TCCTGCGGCT GTCGCTCGAC CGCTGTCGCG GCCCGGTGGC
GACGGGCGGC CGGCGGCTCG GGTTCGCATT CTGCATCTCG GGCTCGGTAG CTTCTTCCGG
GCCCATCAGG CCTGGTACAC CGACCGGGCG CCGGACGCCC GGCAGTGGGG GATCGCGGCG
TTCTCCCACC GCCGCCGGGG CCTCGCGCAC GCGCTGACCG CCCAGGACGG GCTCTACACC
CTGCTGACGC GCGGTGCCGA AGGCGACCGC TTCGACGTGA TCAGCTCGCT TGCCGCCGCG
CACGCGGGCA GCGACCACGA CGCCTGGCTG GCGTACTGGC GGCAGCCGGC CCTCGCCGTC
GTCACCCTGA CGGTCACGGA GGCCGGTTAC GCCTGTGACC CCGACGGAGG TCTCGACCTG
GCGCGCGGGG ACGTCGCCGC CGACATCGCG GCGCTGCGCA CCGACCCCGG CGCCCTGGTG
ATGACGGCTC CCGCCCGCCT GCTCGCCGGA CTGCACGCGC GGGCCCGGGC CGGCCGCAGT
CCGGTCGCGA CAGTGCCGTG CGACAACCTG GCGGGCAACG GCGCTGTGGC CGGGCGCGCG
GTACGCGAGC TGGCCGCGGC GACCGGACGA CCGGAGCTCG TCGCCGCCGC GGACGGCGCC
TCGTGGGTGA CCACGATGGT CGACCGCATC ACCCCGGGCA CCACCGACGC CGACGGCGCC
GCGGTGCGCG CCGCGACCGG CCGCGACGAC GCCGTGCCGG TGGTCACCGA GCCGTTCTCG
GAATGGGTGC TCAGCGGTGA CTTCCCGGGT GGGCGGCCCG AGTGGGAACA CGCGGGTGCC
CGGTTCGTCG CCGACCTGAC GCCGTTCGAG AACCGCAAGC TGTGGCTGCT CAACGGCGCG
CATTCACTGC TGGCCTACGC CGGTCCCCGC CGCGGGCACG TCACGGTCGC GCAGGCCGTC
GCCGACCCCC GCTGCCGCGG GTGGCTGATC GAGTGGTGGG CGGAAGCCTC CCGCCACCTG
AGCATGACCA GGACCGAGCT CGAGTCCTAT CAGCATGCGT TGCTGGAACG CTTCGAGAAC
CCGCGGATTC GGCATCTGCT CGCTCAGATC GCCATGGATG GCTCTCTGAA GCTTCCGGTG
CGTATCCTGC CGGTCCTGCG CGCTGAACGT GCCCGCGGCG TGATGCCCCG GGCCGGAATC
CGAGTGATCG CCGCCTGGAT GCTGCATCTG CGGGAGGGGA CGGCGTCGGT ACGCGATGCT
GAGGCGGGCC GGTCGGTCGC CGCGGCCCGG GCTCCGCTGC CCGAAGCCGC CAGCCTGGTC
CTCGATCTTC TCGGCCCCGG ACTGGGCCGG GACGGGGAGC TGGTGGCGGC GTTGGCCGCA
CAGGTGACCG AGCTGGGCGA TCCCTGA
 
Protein sequence
MNHSAAVPVN RPVAAPASGP VAPSPSRPAA VARPLSRPGG DGRPAARVRI LHLGLGSFFR 
AHQAWYTDRA PDARQWGIAA FSHRRRGLAH ALTAQDGLYT LLTRGAEGDR FDVISSLAAA
HAGSDHDAWL AYWRQPALAV VTLTVTEAGY ACDPDGGLDL ARGDVAADIA ALRTDPGALV
MTAPARLLAG LHARARAGRS PVATVPCDNL AGNGAVAGRA VRELAAATGR PELVAAADGA
SWVTTMVDRI TPGTTDADGA AVRAATGRDD AVPVVTEPFS EWVLSGDFPG GRPEWEHAGA
RFVADLTPFE NRKLWLLNGA HSLLAYAGPR RGHVTVAQAV ADPRCRGWLI EWWAEASRHL
SMTRTELESY QHALLERFEN PRIRHLLAQI AMDGSLKLPV RILPVLRAER ARGVMPRAGI
RVIAAWMLHL REGTASVRDA EAGRSVAAAR APLPEAASLV LDLLGPGLGR DGELVAALAA
QVTELGDP