Gene Saro_2768 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_2768 
Symbol 
ID3916928 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp2987418 
End bp2988449 
Gene Length1032 bp 
Protein Length343 aa 
Translation table11 
GC content64% 
IMG OID640445547 
Productalcohol dehydrogenase 
Protein accessionYP_498038 
Protein GI87200781 
COG category[C] Energy production and conversion
[R] General function prediction only 
COG ID[COG0604] NADPH:quinone reductase and related Zn-dependent oxidoreductases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCACGC CAACCATGAT CGCAGCCGTT GTTGAGGAAG CCAACGGGCC CTTCGTTCTT 
CGCAAGCTTG CGCGTCCGCA GCCGGCCCCT GGCCAGGTAC TTGTACAGAT TGAGGCAAGC
GGCACCAATC CGCTTGATGC CAAGATCCGG GCTGGCGAGG CACCGCATGC CCAGCAGCCT
CTACCCGCAA TCCTCGGAAT GGACCTTGCA GGAACCGTCG TCGCGGTCGG GCCGGAGGTG
GATAGTTTCC GCGTCGGCGA CGCTGTCTTC GGACTGACGG GTGGGGTCGG CGGACTGCAA
GGCACACACG CGCAGTTCGC GGCGGTGGAT GCGCGCTTGC TGGCATCCAA ACCGGCTGCC
CTGACCATGC GACAGGCGTC TGTTCTGCCG TTGGTCTTCA TCACCGCGTG GGAAGGTCTG
GTGGATCGCG CGCAGGTGCA GGATGGACAG ACGGTTCTGA TCCAGGGCGG CGGCGGCGGT
GTCGGCCATG TTGCCATACA GATCGCGCTT GCGCGGGGAG CCCGGGTGTT CGCAACCGCG
CGGGGCAGCG ATCTCGAGTA TGTCCGAGAC CTTGGCGCCA CCCCGATCGA CGCCTCGAGA
GAGCCCGAGG ATTACGCCGC CGAGCACACC GCAGGGCAGG GTTTCGACCT TGTCTACGAT
ACGCTCGGTG GCCCGGTACT CGACGCCTCG TTCAGTGCCG TGAAGCGGTT TGGGCACGTG
GTAAGCTGTC TCGGCTGGGG CACGCACAAG CTCGCCCCGC TCTCCTTCAA GCAGGCGACG
TATTCGGGCG TGTTCACGCT GCACACCCTG TTGGCAAACG AGGGTCTGGC CCACTTCGGC
GAGATGCTGA GAGAGGCTGA CGCGCTCGTT CAGACGGGCA AACTCGCCCC TCGTCTCGAT
CCACGGACCT TCTCCATCGC GGAAATCGGT TCTGCCTATG ACGCGGTCCT CGGTCGCAAC
GACGTGCCAC GGCAGCGAGG AAAGATCGCG ATCACGGTCG AACCGCAATT CAACCTTCAC
GAGCAGCGCT GA
 
Protein sequence
MTTPTMIAAV VEEANGPFVL RKLARPQPAP GQVLVQIEAS GTNPLDAKIR AGEAPHAQQP 
LPAILGMDLA GTVVAVGPEV DSFRVGDAVF GLTGGVGGLQ GTHAQFAAVD ARLLASKPAA
LTMRQASVLP LVFITAWEGL VDRAQVQDGQ TVLIQGGGGG VGHVAIQIAL ARGARVFATA
RGSDLEYVRD LGATPIDASR EPEDYAAEHT AGQGFDLVYD TLGGPVLDAS FSAVKRFGHV
VSCLGWGTHK LAPLSFKQAT YSGVFTLHTL LANEGLAHFG EMLREADALV QTGKLAPRLD
PRTFSIAEIG SAYDAVLGRN DVPRQRGKIA ITVEPQFNLH EQR