Gene RoseRS_4454 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRoseRS_4454 
Symbol 
ID5211439 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus sp. RS-1 
KingdomBacteria 
Replicon accessionNC_009523 
Strand
Start bp5584654 
End bp5585691 
Gene Length1038 bp 
Protein Length345 aa 
Translation table11 
GC content65% 
IMG OID640598033 
Productalcohol dehydrogenase 
Protein accessionYP_001278736 
Protein GI148658531 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG1063] Threonine dehydrogenase and related Zn-dependent dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.321717 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATGACAT CCACGATGGA CGCGCTGGTC TGGCTCGGAC CGCGCAGGAT GGAACTGCGT 
CGCGAGCCTG CGCCAACGCC GGAACCGGGT GAGGTGCTCG TGGCAGTGGA AGCGGTTGGC
ATTTGCGGGT CGGAACTGAG CGGGTACCTT GGCCAGAATA GTTTGCGAAA ACCACCGCTG
ATCATGGGGC ACGAAGCGGC GGGGCGAATC GCCTTCGACA GTGATGCCGC GCTGAGCGAC
GGGTCGCCAG CGCGCGCTGG CGTGCGCGTA ACCTTCAACC CGTTGCTGAC GTGCGGCGCA
TGTGATCGTT GCCGGGCGGG AAAGAGCAAC CTGTGCCGCA ACCGACAACT GATCAGCGCC
CATCGCCCGG GCGCATTCGC CACCTACGTG GCAGTGCCAG CAGATCTCTG CATCCCTCTG
CCCGATCACG TGTCGCTGAC GCTGGGATCG CTCACCGAAC CGCTGGCGTG CAGTGTCCGC
GCTGTAGCGC ACACCGGAAC GCCGGAGCGC CTGGCTATTC TTGGCGCCGG TCCGATCGGG
CTACTTTGCC TGGTTGCTGC GCGTGCCGCG GGGATCGAAC ACATCCTGAT GAGCGACGTC
TCCGATCGGC GACTGGCAGT GGCGCGCGCC TGGGGTGCAA CTGTAACCAT CAATGCACGT
CATAACGTCC TCAATGCAGT GCAGGCATTC GCTCCCGGCG GCGTCGATGC CGTCATCGAC
GCAGTGGGTC TCACCGTCAC CCGCGATCAG GCAGTGCGCG CCGTCACCCC TGGCGGACGT
GTTGTTTTCA TCGGGCTCCA CGAAGAAGAG TCGATGCTTC CTGCCAACTA CATTGTGCGC
CAGGAAATCA CCGTGACCGG CAGTTTCACC TACAGCGACG CCGATTTTGC GCGCGCGCTC
GCGCTGCTGG CAGAAGGGCG CGTTTCGCTC GACGGCGACT GGCTCGAAGA ACGACCACTG
GCGGCAGGAC CGGCAGCGTT CGAGGAATTG CTGGCAGGCG CAACACGCGC AGCGAAGATC
GTGCTGCGCG TCGCGTGA
 
Protein sequence
MMTSTMDALV WLGPRRMELR REPAPTPEPG EVLVAVEAVG ICGSELSGYL GQNSLRKPPL 
IMGHEAAGRI AFDSDAALSD GSPARAGVRV TFNPLLTCGA CDRCRAGKSN LCRNRQLISA
HRPGAFATYV AVPADLCIPL PDHVSLTLGS LTEPLACSVR AVAHTGTPER LAILGAGPIG
LLCLVAARAA GIEHILMSDV SDRRLAVARA WGATVTINAR HNVLNAVQAF APGGVDAVID
AVGLTVTRDQ AVRAVTPGGR VVFIGLHEEE SMLPANYIVR QEITVTGSFT YSDADFARAL
ALLAEGRVSL DGDWLEERPL AAGPAAFEEL LAGATRAAKI VLRVA