Gene Francci3_2277 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_2277 
Symbol 
ID3904811 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp2656954 
End bp2658102 
Gene Length1149 bp 
Protein Length382 aa 
Translation table11 
GC content77% 
IMG OID637879608 
Productzinc-binding alcohol dehydrogenase 
Protein accessionYP_481374 
Protein GI86740974 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG1063] Threonine dehydrogenase and related Zn-dependent dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000000419237 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000159352 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGACCCGCG GGTTGGAGCT GTATGCGTCG GTGTCGCGGC AGGCGGCGTC GCGGCTGGTC 
GGGGCGGGGC CGTCGGGGTG GTCGGGGCTG GCCGGGGTGG CGGCGCCGCT GCGCTATGTC
GAGCATGGTG ATCCGGTGGT GCCGGGTCCG GGGTGGGTGA CGGTGCGGCC GCGGCTGGCG
GGGATCAGCG GGTCGGATCT GGCGTTGGTG ACGGGTCGGG TGTCGGCCTA TCTGACGGCG
ATGGTGGGTC TGCCGTTCGT GCCGGGTCAG GAGGTCGTCG CCGAGGTGCA GGAGTCGGTG
ACGTTGGACG ACGGGCGGGT GCTGGCGGCC GGGGACCGGG TCGTTGTCGA TCCGGCGCCG
GGCAGCGGTC CCGGGACCGG TGACGGCAGA GTCGGTGACG GCGGGTGGGC GCGGGGGGTG
GGCGGCGGTG GGTGGAGCCG GGTGATGCTC GCCCACCGTG GGCAGCTGTG CCCGGTGCCC
GCGACGTTGC CGGATGCCCG GGCGGTGCTC GTCGACCCGC TGGCCGCCGC GGTGCATGCC
GTGGACCGGG CGCGGGTCGG CGCCGGGCAG CGGGTCCTCG TCGTCGGGGC GGGGGCGGCC
GGGTTGTGCA CGGTGCTTGC GTTGCGGGCC CATACCGAGG CGGGGCAGGT GGCGGTGGTG
GCGAAGTATC CCCGCCAGGC GGAGCTGGCG CGCCGGTTCG GGGCGGACGT GGTGTTCGAC
CCGGACGGGG CGGTCGCCGG GGTGCGTCGG GCCACGCATG CGATGCGGGT GTCGCCGCGG
GCGGGTGGGG CGTTCCTGCT CGGCGGGGTG GATGTCGCGT TCGACGCGGC GGGGCGGGCG
TCGTCGTTGT CCACGGCGCT GCGCACGACC CGGGCGGGGG GGCGGGTGGT GCTTTCGGGT
GTGCCCACGG GCCGGGTGGA TCTGACGCCG CTGTGGGCGC GGGGGTTGGA GTTGGTGGGC
GCGGCGCGGC GCGGCGCGGC GCCTTCGGTG CTGGCCCGGG CGTTCGCGTT GGCCGCCGGC
GCCCCGTTGG ACGGGGTGGT CGCGGCGACG TATCCGCTGA CCCGCTGGCG GGAGGCGTTG
GAGCACGCGT TGTGCGCGGG TCGGCTCGGT GCGGTGCGGA TTGCGTTCGA TCCGACGGTG
GCGCTGTGA
 
Protein sequence
MTRGLELYAS VSRQAASRLV GAGPSGWSGL AGVAAPLRYV EHGDPVVPGP GWVTVRPRLA 
GISGSDLALV TGRVSAYLTA MVGLPFVPGQ EVVAEVQESV TLDDGRVLAA GDRVVVDPAP
GSGPGTGDGR VGDGGWARGV GGGGWSRVML AHRGQLCPVP ATLPDARAVL VDPLAAAVHA
VDRARVGAGQ RVLVVGAGAA GLCTVLALRA HTEAGQVAVV AKYPRQAELA RRFGADVVFD
PDGAVAGVRR ATHAMRVSPR AGGAFLLGGV DVAFDAAGRA SSLSTALRTT RAGGRVVLSG
VPTGRVDLTP LWARGLELVG AARRGAAPSV LARAFALAAG APLDGVVAAT YPLTRWREAL
EHALCAGRLG AVRIAFDPTV AL