Gene TM1040_3871 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_3871 
Symbol 
ID4074934 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008042 
Strand
Start bp125204 
End bp127015 
Gene Length1812 bp 
Protein Length603 aa 
Translation table11 
GC content55% 
IMG OID638004528 
ProductBeta-glucuronidase 
Protein accessionYP_611263 
Protein GI99078004 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3250] Beta-galactosidase/beta-glucuronidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones39 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.311387 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTCGAAC GTTCCGAACT CGCAGATAAC GCGCTGCAAG CTTTGCATGA CGAGGATTAC 
GACCGTCCTT TCAATACGCA AAACCTCAAC GCCGATACGC TGATATTCAC CCGAGGCCGC
CAAGGGCAGT CTTTGGATGG GGATTGGAAC TTCTGCGTCG ACCTTCTGGA CACCGGACTG
CGGCAAAAAT GGTTCGCCAT GACGCCAGAA ACTCCAGCGG ATCGAGTCGA ACCTTGGGAT
TATGACCCCT ACATGGGCGA CACGGTGCCT GTGCCCTCCT GCTGGCAGAT GCAAAAAGAG
AAGTGGTACT TTTTTGAGGG CTCTGCCTGG TATACGCGTC CGTTGACGCA CACCCCGACA
CCGGAGCGGC GGCAGGTGCT ACGCATCGGT GCTGCGCAAT ATGATTGTAA AGTATTTTTA
AATGGCGCGT TTCTCGGTAA CCACTACGGG GGCTCAACAC CATTCTGCGT GGAATTGACC
GAGGCCCTGA AGCCGGGACA AAATTGGCTG ATGCTCTGCG TCAACAACAC CCGCACGCTG
GACCGTGTTC CGATGCGGAA CACCGATTGG TTCAACTATG GCGGCGTCTA CCGAGAGGTA
ACGCTGTATG ATCTCCCGTC CGTTGTGATC CGCGATCTGT TTGTTCGCCT TGAGGGGAAT
GCAATCCGCG TATCGGCTGA GGTCGATGGC GAGTGCGCGT CAGCTCAGCT TTTGATTCCC
GAGTTGGGTA TCGATGTGGA GTTGCCCTTG ATCGCAGGAA AAGGTGATTT GACTCTTCCC
GCCTCACCCG AATTGTGGTC GCCGGACAAC CCTAAGCTCT ATGACGTTTC TCTGACTGCA
GGTGAGGATC AGGTACGCGA CCGGGTCGGG TTTCGCAGCA TCTCTCGCGA GGGAACCGAA
ATCCTGTTGA ACGGAACACC CCTCTTCCTG AGAGGCATTT CCGTGCATGA AGACGATGCA
ACGCAAGGAA AGGTCACCAG CGAGACCGAC ATCCGCCGCC GGTTTGCTCA CGCCAAAGAA
CTGGGATGTA ACTTCCTGCG CCTTGCCCAT TACCCACATC ACGAAAGAGC ATCCGAAATT
GCTGATGAAA TGGGTTTGAT GCTCTGGGAG GAGGTTCCGG TCTATTGGGC GATCGACTTT
GCCAACCCCG CAACTCGGCG GGATGCTGAG AACCAGTTGC GCGAACTCAT TCGCCGCGAT
CGAAACCGCG CTAGCGTGAT CATCTGGTCG GTGGGCAACG AAAATCCTGA CACCGATGCA
CGGCTTGATT TCATGCGCGG ACTCGCAGAT CTGGCCAAGA CCGAGGACCC AACCCGCCTG
ACATCTGCCG CCTGTCTGGT GAACCATACA AAGCTAAAGA TTGAAGACCG GCTTGCCGAA
TATATCGATA TCATTGGCTT GAACGAATAT TATGGCTGGT ACGAAGAGAA CTTTGACGAA
CTAAATGAGA TTGGACGCAA TTCTGCGCCT GACCGCCCGG TGGTCATCTC TGAAACCGGC
GCGGACGGCG ACAGCACTGA ACAGGGGCCA GAACGCGGAC TGTTCAGTCT CGCCTATCAG
GATGAGGTCT ATGCCAAACA GATCGCTACA CTTCGCACGC TGGACTATGT GAAAGGCATG
TCGCCTTGGA TCCTCTATGA TTTTCGAGTG GAACGACGAC AGGGAATCTT TCAGCGCGGC
TGGAACCGCA AGGGACTGAT CGCAGGTGAT AAGGCGACCA AAAAACCTGC ATTCCATCGG
TTGGCCGCCT ATTATGCCGA ACGCGCGGCA GTTGAGCACC AACCACCATT CGCTCCCGAG
GAGACAGCAT GA
 
Protein sequence
MLERSELADN ALQALHDEDY DRPFNTQNLN ADTLIFTRGR QGQSLDGDWN FCVDLLDTGL 
RQKWFAMTPE TPADRVEPWD YDPYMGDTVP VPSCWQMQKE KWYFFEGSAW YTRPLTHTPT
PERRQVLRIG AAQYDCKVFL NGAFLGNHYG GSTPFCVELT EALKPGQNWL MLCVNNTRTL
DRVPMRNTDW FNYGGVYREV TLYDLPSVVI RDLFVRLEGN AIRVSAEVDG ECASAQLLIP
ELGIDVELPL IAGKGDLTLP ASPELWSPDN PKLYDVSLTA GEDQVRDRVG FRSISREGTE
ILLNGTPLFL RGISVHEDDA TQGKVTSETD IRRRFAHAKE LGCNFLRLAH YPHHERASEI
ADEMGLMLWE EVPVYWAIDF ANPATRRDAE NQLRELIRRD RNRASVIIWS VGNENPDTDA
RLDFMRGLAD LAKTEDPTRL TSAACLVNHT KLKIEDRLAE YIDIIGLNEY YGWYEENFDE
LNEIGRNSAP DRPVVISETG ADGDSTEQGP ERGLFSLAYQ DEVYAKQIAT LRTLDYVKGM
SPWILYDFRV ERRQGIFQRG WNRKGLIAGD KATKKPAFHR LAAYYAERAA VEHQPPFAPE
ETA