Gene TM1040_1271 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_1271 
Symbol 
ID4077431 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp1369443 
End bp1370549 
Gene Length1107 bp 
Protein Length368 aa 
Translation table11 
GC content62% 
IMG OID638006579 
Productglucose sorbosone dehydrogenase 
Protein accessionYP_613266 
Protein GI99081112 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2133] Glucose/sorbosone dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTACGCC CGCTCGCCGT GATCGCCAGT TTGATGTTTA GTGCCGCGAT GGCCGTGCCC 
TCGCTGGCGC AGCAAATGGA CTCAAGCCAA GGCACGCTCC GCGTTGAAAA AATGGCCGAT
GGGTTTGATG TTCCTTGGGG TTTTACGTTT TTGCCCGGAC GCGCACTGCT GGTCACGGAG
CGCTCCGGAC AGCTTTGGTA TCTGAACGGC GCGCGACGCC AACAGGTCGA TGGCGTCCCT
GAGATCGCTG CGGACGGGCA GGGTGGCCTT CTGGACGTTG TTGCCGCGCG GGATTTCGTT
CAGAGCCGCA CCGTATATCT CACGTTTGCC CGTCCGCAGG GGCGCGGCGC GGGGACCGCT
GTGGCACGGG CTGAACTCTC TGAAGACGGC AGCCGATTCG ACTCGCTTGA GGTGATTTTT
GAAGCCACGC CAGGCGCGCG GGGCGGGCGG CACTTTGGCT CGCGGCTGGT CGAAGCCCCC
GACGGCAGTC TTTATGTCAG CCTCGGAGAA CGTGGTGACC GTCCCAGCGC ACAGGATCTG
TCACGCGAGC AGGGTTCGAT TATTCGCATC CTTCCGGATG GCAGCATTCC CTCGGACAAT
CCTTTTGTAA ATTCTGAGGA CGCGCGTCCG GCGATCTGGT CCTACGGCCA CCGCAACCCG
CAGGGCATGG CGCTTGATGC GGCCGGCGAC ATCTGGGCCG TTGAACATGG CGCGCGCGGC
GGCGATGAGA TCAACCGGAT CACACGGGGC GCCAACTATG GTTGGCCGGT CATTTCCTAC
GGGCGCCACT ATTCGGGGCT GAAGATCGGC GAGGGCACCG AAAAGCCGGG GCTGCAACAG
CCGGAGTGGT ATTGGGATCC CTCCATCGCG CCCTCGGGTA TGATGATCTA CTCGGGCAAG
CTCTGGCCCA ACTGGCGCGG AGACATCTTT GTGGGATCCC TGAAATTTGA TTACATCTCA
AGGCTCTCGG GGGCACCCCT GCAGGAGGTC GAGCAGATGA AATCGCCCGA AACCGCAAGG
GTGCGCGATA TCCGCGAAGC CCCCGATGGC AGCATCTGGT TTGCCTCGGA ATACGAGGGC
GCCCTCTTTC GGATCACCCC GAACTGA
 
Protein sequence
MLRPLAVIAS LMFSAAMAVP SLAQQMDSSQ GTLRVEKMAD GFDVPWGFTF LPGRALLVTE 
RSGQLWYLNG ARRQQVDGVP EIAADGQGGL LDVVAARDFV QSRTVYLTFA RPQGRGAGTA
VARAELSEDG SRFDSLEVIF EATPGARGGR HFGSRLVEAP DGSLYVSLGE RGDRPSAQDL
SREQGSIIRI LPDGSIPSDN PFVNSEDARP AIWSYGHRNP QGMALDAAGD IWAVEHGARG
GDEINRITRG ANYGWPVISY GRHYSGLKIG EGTEKPGLQQ PEWYWDPSIA PSGMMIYSGK
LWPNWRGDIF VGSLKFDYIS RLSGAPLQEV EQMKSPETAR VRDIREAPDG SIWFASEYEG
ALFRITPN