Gene TM1040_3776 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_3776 
Symbol 
ID4074871 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008042 
Strand
Start bp22480 
End bp23481 
Gene Length1002 bp 
Protein Length333 aa 
Translation table11 
GC content54% 
IMG OID638004435 
ProductNAD-dependent epimerase/dehydratase 
Protein accessionYP_611171 
Protein GI99077912 
COG category[G] Carbohydrate transport and metabolism
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0451] Nucleoside-diphosphate-sugar epimerases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones51 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGAACCG CTCTCATCAC CGGCACGGCT GGCTTCATTG GCTACCATCT TGCGACTTAT 
CTTCTAGCCT CAGGCTGGCA GGTTGTAGGG CTCGATTGTC TTTCACCTTA TTATGACATC
GCCTTAAAAA GGCGCCGCCA CGCTATGCTG GAGGTCAACG ATAATTTCAT CCCTGTGATC
GGCAAGCTCG AAGATCCAGG GCGCTTAATG GGCCTACTCG CTGACCACAA ACCCAATGCG
GTGATCCATC TGGCCGCTCA AGCCGGAGTG CGCCATTCAA TTGACGCGCC GCGCGACTAT
CTCGAGGCCA ACCTGATAGG TACTTTTGAA GTGCTGGAAG CTGCCCGCGC GCATCCGCCC
GAGCATATAA TGATTGCCTC CACGTCTTCG GCTTATGGTG CCAATACCAA CATCCCTTTC
GATGAGCACC AGAAAGCAGA TCATCAAATG TCATTTTATG CCGCCACCAA AAAGGCAGGC
GAGACGATGG CTCATTCCTA TGCACACCTC TATGGTCTAC CAACCACGAT GTTCCGGTTC
TTCACGGTGT ACGGCCCCTG GGGTCGACCG GATATGGCGT TGTTCAAGTT CACCAAAGCG
ATAGAGGCCG GTGAGGCGAT CGATGTCTAT AACCATGGAC GCATGAGCCG AGACTTTACT
TATATCGATG ATTTGGTGGC GGGTATCACC GGACTGATTG AGGCAGTGCC CGGTGATACG
CCTGTCTCTA CGCAAGACAC CCTGAGCCCA GTTGCCCCTT TCAGGATCGT CAATATCGGG
GCCTCAAAAC CCACGCCGCT GATGGATTAT ATTGCTGCGC TAGAAACCGC GCTAGAGACC
ACCGCCCGAA AGAACTTGAT GGAGATGCAG CCAGGAGACG TGCCGGCAAC CTGGGCAGAC
ACCACTTTGT TGAGCCAGCT TACCGGCTAT GAGCCTCAGG TTAGTGTCGA AGAGGGTGTC
GCCCGTTTTG TCGCTTGGTA CCGAGGTTAT TATGCCAGCT GA
 
Protein sequence
MRTALITGTA GFIGYHLATY LLASGWQVVG LDCLSPYYDI ALKRRRHAML EVNDNFIPVI 
GKLEDPGRLM GLLADHKPNA VIHLAAQAGV RHSIDAPRDY LEANLIGTFE VLEAARAHPP
EHIMIASTSS AYGANTNIPF DEHQKADHQM SFYAATKKAG ETMAHSYAHL YGLPTTMFRF
FTVYGPWGRP DMALFKFTKA IEAGEAIDVY NHGRMSRDFT YIDDLVAGIT GLIEAVPGDT
PVSTQDTLSP VAPFRIVNIG ASKPTPLMDY IAALETALET TARKNLMEMQ PGDVPATWAD
TTLLSQLTGY EPQVSVEEGV ARFVAWYRGY YAS