Gene Saro_3643 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_3643 
Symbol 
ID5077791 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_009427 
Strand
Start bp271827 
End bp272870 
Gene Length1044 bp 
Protein Length347 aa 
Translation table11 
GC content70% 
IMG OID640481366 
Productalcohol dehydrogenase 
Protein accessionYP_001166028 
Protein GI146275868 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG1063] Threonine dehydrogenase and related Zn-dependent dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value0.826426 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCCGGAA CAGCCAAGAT GCGCCTCGCG CGCGTCCACG GGCCGGGCGA CGTGCGACTG 
GACGAAGTGC CGGTGCCGCA TTGCGGGCCC GGCGAGGCAC TGGTGCAAGT TGCCGCCTGC
GGGGTCTGCG GAAGCGACCT AGGCTATATC GCGCAGGGCG GACTGGGCGG CGTCGAGCCG
CTGTCCGCGC CGCTGCCGAT CGGGCACGAA TTTGCCGGAA CCGTCGTTGC GGTGGGCCAA
TGCGTGACGA GCGTGGCGCC GGGAATGCGC GTCGCGGTGA ACCCGGACCG CGCCTATATC
GGCGGTGGCG GGCCGGACGG TGCGATGGCG GCCTTCATTC GCGTGGCCGG AGCCGAAATC
GGCGAAACGC TGTTCCCCCT GCCCGACCAC CTGCCCTTTG CCGAAGCATC GCTGGCCGAA
CCGCTTTCGG TGGGCCTGCA CGGCCTGCGG GTGGCCGGAG CGAAAACGGA AGACCGGATC
GCGATCCTGG GCGCGGGCCC CATCGGGCTG TGCGCGCTGG TCATGGCACG ACACCTTGGG
GCGCGCGACG TGGCCATCTT CGACCGCGTG CCGGAACGGC TGGAGCGTGC CAGGGCGCTC
GGCGCGGGGC TGGCGGTGGA TGTAACGCAG GAGTCGCTGA CAGAAGCGTT GGCGCGGTTC
CACGGATCGG GCGAGCGGTT CGGGGCGAAG TTCGTGGGCA CGGACGTGTT CGTCGATTGC
GCCGGGTCGG CGGCGGCGCT GGAAGAAGTT GTCGCGGTGG CGAAGTACCG CGCGCGCATC
GCGGTCGTCG CGCTGCACCA CAAGCCACTC GCGCTCGACC TGTGGCGGAT GATGGCTAAC
GAGATCAGCC TTGCAGGCTC CATCGCCGAT GCGCGCTCCG AAGAGTTCGG CGAGGCAGTG
GCGATGCTGG CGGCGGGGGG GCAGGCGCTG TCCCCGCTCA TCAGCCACCG CTTCGACTTC
TCCCGCTTTC ACGAAGCCCT TGCCGTGGCC GCCGACGGCG CACGCGCGGC AAAGGTGATC
CTGACCTTTC CGGAGGCCGC ATGA
 
Protein sequence
MAGTAKMRLA RVHGPGDVRL DEVPVPHCGP GEALVQVAAC GVCGSDLGYI AQGGLGGVEP 
LSAPLPIGHE FAGTVVAVGQ CVTSVAPGMR VAVNPDRAYI GGGGPDGAMA AFIRVAGAEI
GETLFPLPDH LPFAEASLAE PLSVGLHGLR VAGAKTEDRI AILGAGPIGL CALVMARHLG
ARDVAIFDRV PERLERARAL GAGLAVDVTQ ESLTEALARF HGSGERFGAK FVGTDVFVDC
AGSAAALEEV VAVAKYRARI AVVALHHKPL ALDLWRMMAN EISLAGSIAD ARSEEFGEAV
AMLAAGGQAL SPLISHRFDF SRFHEALAVA ADGARAAKVI LTFPEAA