Gene TM1040_2820 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_2820 
Symbol 
ID4076639 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp2984942 
End bp2985931 
Gene Length990 bp 
Protein Length329 aa 
Translation table11 
GC content57% 
IMG OID638008148 
Product3-beta hydroxysteroid dehydrogenase/isomerase 
Protein accessionYP_614814 
Protein GI99082660 
COG category[G] Carbohydrate transport and metabolism
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0702] Predicted nucleoside-diphosphate-sugar epimerases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.0203208 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.613644 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCAAAC TGGTCACCAT TTATGGCGGT TCAGGGTTTG TCGGTCGCTA TATCGCGCGC 
CGCATGGCAA AAGAAGGCTG GCGCGTGCGC GTGGCCGTTC GTCGCCCGAA TGAGGCAATG
CACGTCAAAC CCTATGGTGT GCCGGGGCAG GTTGAGCCGG TTTTCTGCAA CATCCGCGAT
GACGCTTCTG TCGCTGCTGT TATGGCAGGC GCAGATGCGG TGGTAAATTG CGTGGGTGTT
CTGAACGAAG TCGGCAAAAA CACGTTCTCT GCGGTGCAGT CCGAAGGGGC TGGTCGCATC
GCGCGGATCG CGGCTGATAC AGGTGTTGAA CGTCTTGTTC ATGTTTCTGC GATTGGTGCA
GATGCTGATG GCGACAGCGC GTATGCCCGC ACCAAGGCCG AAGGGGAAGC TGCGGTGCTT
GAAGCTTTTC CCTCTGCAAT GATCCTGCGT CCCTCGATCA TCTTTGGCCC CGAAGACCAG
TTCTTTAATC GCTTTGCGAG CATGACGCGC TTTGGCCCCG TTCTGCCCAT CGCAGGAGGG
ACGACACGGT TTCAGCCGGT CTATGTCGAT GACGTCGCGA AAGCTGCTGT TGCGGGTCTG
ACTGGGCAGG CTGCTGCAGG AACCTATGAG CTTGGTGGCC CCGAGGTCAA AAGCTTTACA
GAGTTGATGT CGCAAATGCT TGATGTGATC CATCGCCGCC GTCTCGTTGT GTCGCTACCG
AATTTTGTCG CCCGCCTCAT GGCTTTTGGG TTCGATATGG CGCAGGCGGT GACCTTTGGC
CTGTTTACAA ACGGCCTGCT GACGCGCGAC CAACTAAAGA ACCTGCAAAA CGACAATGTG
GTCAGTGAAG GCGCCAAAGG TCTGGCAGAC CTCGGGATCG AACCGGTTAC CATGGGGTCC
GTTCTACCCG ACTATCTGTG GAAGTTCCGC CCATCCGGTC AGTACGACGA ATTGATGAAA
TCGGCCGGTA ACCTGCGCGG AGACATCTGA
 
Protein sequence
MSKLVTIYGG SGFVGRYIAR RMAKEGWRVR VAVRRPNEAM HVKPYGVPGQ VEPVFCNIRD 
DASVAAVMAG ADAVVNCVGV LNEVGKNTFS AVQSEGAGRI ARIAADTGVE RLVHVSAIGA
DADGDSAYAR TKAEGEAAVL EAFPSAMILR PSIIFGPEDQ FFNRFASMTR FGPVLPIAGG
TTRFQPVYVD DVAKAAVAGL TGQAAAGTYE LGGPEVKSFT ELMSQMLDVI HRRRLVVSLP
NFVARLMAFG FDMAQAVTFG LFTNGLLTRD QLKNLQNDNV VSEGAKGLAD LGIEPVTMGS
VLPDYLWKFR PSGQYDELMK SAGNLRGDI