Gene TM1040_3486 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_3486 
Symbol 
ID4075126 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008043 
Strand
Start bp517580 
End bp519646 
Gene Length2067 bp 
Protein Length688 aa 
Translation table11 
GC content63% 
IMG OID638005001 
Productglycoside hydrolase, clan GH-D 
Protein accessionYP_611720 
Protein GI99078462 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3345] Alpha-galactosidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGACG GGCTGCAACT CTGGCGGATC GATGCGCCGG GTCAAACGCT GGTGCTGTCG 
TCCGATGGCG GTCTGCCCGG CGCGCTCTAC TGGGGCCCTG CGCTGCAGCC GGAGACGGAT
CTGATCGCAC TGGCGCGCGC CGTTGAACAA GAGGTGACAG GCGGGATGAT CGACCGGCTG
CCGCCCTTGT CGCTCTGCCC GGAGGCCGGG CTCTCCTTTG AGGGTCAGCC GGGGCTGGTT
GCCTATCGTG ACGGAGCGCC GCTTTATCCA CGCTTTCGCC TCGAAGACAC CAACGGCGCG
CAGTTTACCT GCCGTGATCG CGCGCTAGGA CTGACGCTGT TCTTTGACTT CGAGGTGCGG
GGCGGGACGA TTGCGGCCTC GACCACGCTG ACCTCGGAGC AGGACATCAT CCTGCACCAT
CTCGCTGCGC CCGTTCTGCC GGGTCCTCAG ATGGGGCAGG AGATTGTCGA TGTCTGTGGC
CGCTGGATCG GTGAATTCCA ACTGCAACGC ACCCCATGGC GCGCGGGCAT TCACAAGCGC
GAGGCCCGTA CCGGGCGTTC GGGGCACGAG CACCCGCCCT TGGCCTATTT TCCCGAGGCG
GGCGCGACCA ACACCCACGG CACCGTTTTT GCCATGCATT ACGGCTGGTC CGGCGGGCAT
GTGATGCTGG CCGAAGAGCT GCCCTCTGGG CGGCGCCAGA TCCAGTGGGG CCATGCCACG
GGCACGCAAG GCGCAGGGAC AAAGTTCCGC AGCGCGCCCC TCTACCTGGC GGTGTCGCAA
GCAGGGTTCA ACGGTTGTGC AGTGGCCTTT CAGCGTCTGC TCAGCGATCA TGTGGTTGTC
TGGCCCAAGC CCGAAACTCC GCGTCCGGTG CATTACAACT GCTGGGAAGC GGTCTATTTC
GATCATGACT TCACTGTCCT GAGCGACATT GCCGAGCGCG CCGCGGCCCT GGGTGCAGAA
CGGTTTGTGC TTGATGATGG CTGGTTTGGC AGACGCGATG ATGACACCAC CTCCCTTGGG
GATTGGCAGA TCGACCGGCG CAAATGGCCC GAGGGGCTAG GCCCGCTCAT TGATCACGTC
GAACGCCTAG GGATGACATT TGGTCTCTGG GTCGAGCCGG AAATGGTGAA CCTCGACAGT
GATCTGGCGC GGGCACATCC CGACTGGGTC TTGGGGCCTG TGGATCAGAT CGAAGGCCGC
CAGCAGCGGG TGTTGGACCT GTCGCGCGAA GACGTGCGAG AGCATCTTTT TCAAGTGCTG
TCCGCGCTCT TGAGCGAGAA CCGCATCGAC TACCTGAAAT GGGACCACAA CCGCCTGTTG
CCGATTGCGG ATGCGGCCCA GACGGAGGGT ATCTATGCCC TGTTGCGGGA CCTTCGCAGC
GCGTTCCCTA AGGTCGAAAT CGAGAGCTGC GCCTCCGGCG GCGGGCGGAT CGACGCGGGT
ATCCTGAGCC TCACGCAGCG GGTTTGGCTC TCGGACAGCA ATGACGCGCT GGAGCGGATG
CGCATTCAGC ATGATGCGGC GCTCCTGTTG CCACTATCCG TGACGGGCTC TCACGTAGGG
CCGCGCACCT GCCATACCTC CGGGCGAACC ATCGATATCC ACATGCGCGC CTGGGTCGCC
GCCCAGCGGC ACATGGGGTT TGAGATGGAT CCGCGCGAGC TGACCCCGAC AGAGATCGAG
GCGCTGACGC GAGTCACAAC CTGGTGGAAA AACAATCGCA ACTGGCGCAT GCAGGCGGAT
ATCTTGCGGT TGGATGCGCC TGATCAAAGC GTGATCGCAG AGCAGCAGAT GGCCGCGGAT
GCCAGCCGCT TTGTGGTGTT TGCGGGCAAG GCACGGAGCG TGCATCAGAT CCTGCCGCGC
CCCTTGCGCC TGACAGAACT TGACCCCAAA GCCCAATACC GGATCGATCT CGTCAACCGC
GAGGCGTTGC ACCACCTGTC GCGCGGGCGC ACCGCGCTCA AGGACGGGCC GCTTACGCTC
AGCGGTGAGA TTCTGATGCA GCAGGGGCTG ACTCTGCCCT GGCAATTTCC TGAAACGATC
TGGGTGATCG AAGGAGAAAA ACTATGA
 
Protein sequence
MSDGLQLWRI DAPGQTLVLS SDGGLPGALY WGPALQPETD LIALARAVEQ EVTGGMIDRL 
PPLSLCPEAG LSFEGQPGLV AYRDGAPLYP RFRLEDTNGA QFTCRDRALG LTLFFDFEVR
GGTIAASTTL TSEQDIILHH LAAPVLPGPQ MGQEIVDVCG RWIGEFQLQR TPWRAGIHKR
EARTGRSGHE HPPLAYFPEA GATNTHGTVF AMHYGWSGGH VMLAEELPSG RRQIQWGHAT
GTQGAGTKFR SAPLYLAVSQ AGFNGCAVAF QRLLSDHVVV WPKPETPRPV HYNCWEAVYF
DHDFTVLSDI AERAAALGAE RFVLDDGWFG RRDDDTTSLG DWQIDRRKWP EGLGPLIDHV
ERLGMTFGLW VEPEMVNLDS DLARAHPDWV LGPVDQIEGR QQRVLDLSRE DVREHLFQVL
SALLSENRID YLKWDHNRLL PIADAAQTEG IYALLRDLRS AFPKVEIESC ASGGGRIDAG
ILSLTQRVWL SDSNDALERM RIQHDAALLL PLSVTGSHVG PRTCHTSGRT IDIHMRAWVA
AQRHMGFEMD PRELTPTEIE ALTRVTTWWK NNRNWRMQAD ILRLDAPDQS VIAEQQMAAD
ASRFVVFAGK ARSVHQILPR PLRLTELDPK AQYRIDLVNR EALHHLSRGR TALKDGPLTL
SGEILMQQGL TLPWQFPETI WVIEGEKL