Gene Franean1_2032 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_2032 
Symbol 
ID5670433 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp2441642 
End bp2442952 
Gene Length1311 bp 
Protein Length436 aa 
Translation table11 
GC content74% 
IMG OID641240953 
Productshort-chain dehydrogenase/reductase SDR 
Protein accessionYP_001506375 
Protein GI158313867 
COG category[R] General function prediction only 
COG ID[COG0300] Short-chain dehydrogenases of various substrate specificities 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0304773 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.779517 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGGCCCG GCGCCCGGGA CGCGGGCTGG AGCCTGGCCG ACGTGCCGGA CCAGCGGGGA 
CGGACCGCCG TCGTCACCGG TGCGAACAGC GGCATCGGCT TCGAGGCGGC GAAGGCGCTC
GCCGAGCGCG GCGCGACCGT CGTCCTGGCC TGCCGCGAGC CGGCCCGGGC CGAGCAGGCC
GCCGCACGCA TCCGGGCCGC CGCGCCGCGG GCGGACGTCC ACGTCGAGCC GCTCGACCTG
ATCTCGATGA CCTCCGTGCG CGCGGCCGCC GAACGGATCC GGGCCGCCCA CCCGCGGCTC
GACCTGCTCG TCAACAACGC CGGGGTGATG CTGCCGTCGC CCGGTCCCGG CGGTGCCGCC
CTCGGTGCCG GCTTTGGTGA CGACGGCCTC GGTGATCACG GCATCGAGCC GACGTTCGGG
GTCAACCACC TGGGGCACTT CGCGCTGACG GGCCTCCTGC TCGACCGCCT TCTGGCGACG
CCGCACTCGC GGATCGTGAC GGTCAGCAGC CTCGCCCACC TCGTCGGCCG CCTCGACCTC
GAGCAGCTGC GCCGCCTCCC GCCGCACGCC GACCGACCGG GGGGCAGCCC GCCCGTGGTC
GTGCCCGACC AGCGTGCCGG GTCGCCGTCC GAACCACCCC TGGCGTCGCC GGCCCGCCAG
CGTTCACAGC GGAGGTCAGA AACACAGCGG AGGTCAGAAA CACAGCGGAG GTCAGAAACA
CAGCGGGGGT CAGGAACACC GCGCGCGCAG GGCATGCAGC GCCTCCAGAG TCTTCAGGGC
ACGCGGGGCC CGCAGAGCCG GCGGCTGCCC CGGCAGCGCC TGCCGTATCC GACATCCAAG
CTGGCGAACC TGATGTTCAC CTTCGAGCTG CAGCGCCGGC TCGCGGCCAC CGGAGCGTCG
ACGATCGCGG TGGCCGCGCA CCCGGGCATC GCCCGCACGG GGCTCACCCG CAACCTCATG
CCCCCGGCGC GGGTGTTCAT GGGCCAGCGA CTGGCACCGC TGATGTCGTG GCTGGTCCAG
GAGCCGGAGA TCGGCGCGCT CGCCACCGTC CGGGCCGCCC TCGACCCGGC GGTGCACGCC
GGTGAGTACT ACGGACCGTC AGGGCTCTTC CGCAGCACCG GTCCGCCCGC TGTCGGGCGA
TCCAGCGCCC GCTCGCGCGA CGCCCGCCAG CAGCAGCGGC TGTGGGCGGA GTCCGAGCGG
CTCACCGGCG TCACTTACGA CTTCGATCCC TCCTCGGGGA CAGGGCCCGG CGGTGTCGGA
TCCTGGAGGA ACAACGGGAG AAGCGCGTCG CGGAGCGCGG AACTGACCTG A
 
Protein sequence
MRPGARDAGW SLADVPDQRG RTAVVTGANS GIGFEAAKAL AERGATVVLA CREPARAEQA 
AARIRAAAPR ADVHVEPLDL ISMTSVRAAA ERIRAAHPRL DLLVNNAGVM LPSPGPGGAA
LGAGFGDDGL GDHGIEPTFG VNHLGHFALT GLLLDRLLAT PHSRIVTVSS LAHLVGRLDL
EQLRRLPPHA DRPGGSPPVV VPDQRAGSPS EPPLASPARQ RSQRRSETQR RSETQRRSET
QRGSGTPRAQ GMQRLQSLQG TRGPQSRRLP RQRLPYPTSK LANLMFTFEL QRRLAATGAS
TIAVAAHPGI ARTGLTRNLM PPARVFMGQR LAPLMSWLVQ EPEIGALATV RAALDPAVHA
GEYYGPSGLF RSTGPPAVGR SSARSRDARQ QQRLWAESER LTGVTYDFDP SSGTGPGGVG
SWRNNGRSAS RSAELT