Gene Francci3_2240 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_2240 
Symbol 
ID3905008 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp2612561 
End bp2613628 
Gene Length1068 bp 
Protein Length355 aa 
Translation table11 
GC content65% 
IMG OID637879571 
Productzinc-binding alcohol dehydrogenase 
Protein accessionYP_481337 
Protein GI86740937 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG1063] Threonine dehydrogenase and related Zn-dependent dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0360217 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCAGCCA CCATGCAGGC ACTGGCGTTT CTCGGCATCG GCAAGGCCGG CGTCATCGAG 
AAGCCCATAC CGAAGCCAGG GCCAACTGAC GCGATCGTAC GGACAACATC GGCGCTTATC
TGCACCTCCG ATGTGCACAC CGTCCGGGGC GCCATTCCCG TTCCCGAAGG CCGCGCTCTC
GGGCACGAGG CGGTCGGTGT CGTCCACGAC CTGGGCGCCG CGGTCACTGG ATTCGAGGCC
GGTGAGCGGG TAGCGGTCGG GGCGCTTACG CCGTGCTTTC ACTGCGGCCC CTGCCAGCGG
GGTTTCAGTA CCCAGTGCCA GGGAATGCTC GGTGGGTACA AATTCACCAC GCAGCGCGAT
GGCAACATGG CGGAGTACTT TCTCGTCAAC AATGCGGCCG CCAACCTCGC TCGCATTCCG
GCCGACCTGC CCGACGAGAA AGCCGTCTAC GCGACCGACA TGCTCTCCAC CGGGTTCGGT
GGCGCGGAGA ACGCGCAGCT GCGGCTCGGT GAGTCCGTCG CGATCTTCGC TCAGGGGCCG
GTAGGGTTAT CCGCCACCAT CGGCTGCCGG CTGCTCGGCG CCGGACTGAT CATCGCCGTG
GAAGGACGGC CCGAACGGCA GGAGCTGGCA CGTCGATTCG GCGCGGACGT GGTCGTCGAC
CCCGCCGCTG GTGATGTGGT GAACCAGATC CTCGATCTCA CCGGCGGCGT CGGCGTGGAC
GGCGCGATCG AGGCGCTCGG TCATCCGCAG ACCTTCGAGG ACTGCATCCG GGTGACCAAA
CCCGGTGGCC GGATATCGAA TATCGGGTAT CACGGTGAGA ACCCGGCACC GCTGCAGATC
CCGTTGGAAC CGTTCGGCCT GGGTATGTCG GACAAGAAGA TCCTGACGTC GCTCTGCCCA
GGCGGAAGCG ATCGGCTCGA GCGAATCTTC ACCCTCATGC GTTCCGGCCG GGTGGATCCT
ACGCCGATGA CGACCCATGA GTTCGGGTTC GACGAGATCG AACGTGCCTT CAGCATGATG
GAAACCAAGG AGGACGGCGT CATCAAACCC CTCATCCGTT TCGCATAA
 
Protein sequence
MPATMQALAF LGIGKAGVIE KPIPKPGPTD AIVRTTSALI CTSDVHTVRG AIPVPEGRAL 
GHEAVGVVHD LGAAVTGFEA GERVAVGALT PCFHCGPCQR GFSTQCQGML GGYKFTTQRD
GNMAEYFLVN NAAANLARIP ADLPDEKAVY ATDMLSTGFG GAENAQLRLG ESVAIFAQGP
VGLSATIGCR LLGAGLIIAV EGRPERQELA RRFGADVVVD PAAGDVVNQI LDLTGGVGVD
GAIEALGHPQ TFEDCIRVTK PGGRISNIGY HGENPAPLQI PLEPFGLGMS DKKILTSLCP
GGSDRLERIF TLMRSGRVDP TPMTTHEFGF DEIERAFSMM ETKEDGVIKP LIRFA