Gene TM1040_0548 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_0548 
Symbol 
ID4077195 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp582746 
End bp585754 
Gene Length3009 bp 
Protein Length1002 aa 
Translation table11 
GC content61% 
IMG OID638005845 
Productglycosyl transferase family protein 
Protein accessionYP_612543 
Protein GI99080389 
COG category[G] Carbohydrate transport and metabolism
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0726] Predicted xylanase/chitin deacetylase
[COG1215] Glycosyltransferases, probably involved in cell wall biogenesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGTCCCTGA ATGAGAGCTG CAATCAACTG GGGGTTCTGG TGCCGGACTG GCTCCGGCTC 
GAAACCACCC CCGAAGGGCC CGTGGTGAAG ATCGAATCCG AAGAAAGCCG CGCGCCTCTG
GTGGATTACC GCAGCACATC GGGCCATGCC CCAGATCTGA TGCCAGTGCT TGAAGTGGAC
ACTGGCAGCG ACAAGGCAGG TTTCTTGCGC AACCTCAGCA CGCCCGAGAC TGCTCAAAGC
GTAACGTCTC AGGTGCTTGC ACAGCTTCGA CCGCTCTATG CCCAAGGAAC CTGCATCAGC
ATTCCTGGGC TTGAGACCGC TGATCTCGAT GTGTTGCAGC CGTTTTTCAA AACCCTGACC
TCGAGCCTGA GAAGCGAAGG CCATAGCCCC TGCCTCATCC TGAGCGGCAC CTCGACGGCG
TGGCAATCGC GCGAAACCAC AGCGCTGTTT GATAAGGTCA TCCTCAAGCT CTTTCTCGAT
CCCTGGGTGG GCACCGCGCC TTCGCCGCTG GCCACGGACG CCTGGTTCGA GAAAACCGCC
AAGGCCGCAC TTCAGGAGAT TGGCAAAGAC AAACTGGTGA TCGCTCTTGG AACCTTCGCG
GTGGAGTGGG TGTCGGGTGA GCCGCTGCCC AAGGTTCTGC CCTATGCGGC GGCAATGGAG
AAAATTGCAG CCGCAGGCGC CGAGCTGCGC TTCAGCGAAA AGACCTCCGG ATCGCTTGCG
TCTTATCGCG ATCCCGAAGG GCGCCTCAAC AAGATCTGGA TGCAGGACGT CGCCAGCCTC
ATCAACCAGC TTGTCATCCT GCAGCAGCTT GAGATCCCCA ATAGCGCTGT CTGGTCGCTT
GGCCTCGAGG ATCCGGGTAT CTGGAGCGTG CTGCAAAACC GTGACCTGAG CCACGACGCG
CTGAGTGCAG ACTTGGCTCT GGTGAAGCTC GACTCATATG TGAGCTATCG CGGCGAAGGC
GCGTTGCTGC GCCTTCACCG CCGGCAGTCT CCCGGCATCC GCCAGATCGG GTTTGATACG
GAAACCGGGC GTGTAGTCTC GCAGAGCTAT GACCTGCTGC CGCGCCCCTA CGCGCTGGAG
CGCTATGGAA AACCGGCCGG CCGCAAGGTG GTTTTGACCT TCGACGATGG CCCACATCCG
GTGTTCTCCG AGCAGATCCT CGACATCCTG CAGGAAACCC AGACGCCCGC GACCTTCTTT
GTGACCGGCA AAAGCGTGAT GAACGCGCCC GAGGTTTTGA ACCGGATGAT CGACGAGGGA
CATGAAATCG GCGCACATAC GTTCTCTCAC CCCCGGATGG ATCAGGTCTC CAAGACCCGC
GCGACGCTTG AATACGCGAT GCTCGACAAG GTGGTGGCCG GGGCGGCAGG TCGGCAGTTG
ACCCTCTATC GTGAACCTTT CCAGCGCAGT GGCGGCCCGG TGACGGCCGA TCGCGTCGCC
GCGCTCGAGA TTGCTTGGGA TCGCGACATG CAAGTGGTCG GCATGGATGT GGTACCCCAC
GACTGGGCCG GATGGAGCGG CCGCGAGATT GCGGACTTTG CTATCGAGGA AGTCGAACGC
GGTGCAGGCA ACGTGATCTT GCTGCATGAC GGCGGCGAGG ATCGCACCGC TTCGGTCGAG
GCCACGCGGC TGATCATCAC CGAGCTCTCT GCCAAAGGCT ATGAGTTTAC CACCGTGGCC
GACTTGACCG GCAGCACCCG TGCAGCATTG ATGCCCGTGA CCGAGGGCGG TTATCAAACC
TTTGACCGTG TTTCCTTTTC TCTCGTCGCT TGGGGTCAGG ACGCCATCGT GATCCTGTTC
TGGCTGGCGC TCGGCATCGG GGTTGTGCGC TCTGTTGCGA TCTTGTTGCT CGCCGTCCTG
AATTGGCGCG GACATCGCAC CATCTCGCTG ACCACCCCAA AGGTGGCGGT GATCATCCCG
GCCCACAACG AGGAGAAGGT CATCCGCAGC TGCATCCAGA GCGTGCGGGC AAGCGACTAC
AAGAACCTCG AAATCATCGT GGTCGACGAT GGCTCCAGCG ACAATACGCT GAACGAGATC
TTTGCCTTTT CGCACATGCG CGAGGTCCGC CTGATCTCGC AGCCGAACCA GGGCAAATGG
AGTGCGCTGA ACCGGGCGCT GATGAACACA TCCGCCGAGA TTGTGGTCTG TATCGATGCA
GACACGCAGA TCGAGAAATC CGCCATTGGG CACATGGTCA GACATTTCGA CAACCCAAGG
ATCGGTGCGG TCGCGGGCAA GATCATCGCG GGCAACAAGG TGAACCTTCT GACCCGACTG
CAGGCGCTGG AATATACCAC CGCGCAGAAC GTTGAGCGCA AGGCCTTTGA TCTGATCAAC
GGCATGCTGG TGGTGCCCGG CGCCCTCGGT GCATGGCGCG TGGCTGCGCT GCGCAAGGCG
GGGCACTTCA GCGACGAGAC GATGACCGAA GATACCGACC TCACCATCGA GGTCAACCGT
GCAGGATACC GGATCGCATA TGAGCCGCTC GCCCGCGGCT ACACCGAGGT ACCCGAGCGC
ATTGGGCAGC TTTTGAAACA GCGCCTGCGC TGGTCGTTTG GCATGTTCCA AAGCGCATGG
AAGCACAAAA AAGCGATGTT CGAGGGGCGC TCTGTGGGGT TGATTTCGAT CCCTGACATG
TTCATCTTTG GCTATCTCTT CCCACTGCTG GCGCCGATTG CGGACCTCTT TGTCGCCATC
CTGCTTTACC AGATGGTCAG CGGCGGTTGG GACAGCGGGG CGGTTGGCGC GCAGAACATG
CAGTATCTCC TCGCCTACCT CACCCTACCC GCGCTCGAGT TCGTGATTGC CGCCTTTGCC
CTCGCACGGG ACAAGGATGA GAGCATGTGG TCGCTGTTGC TGTTCCCGGT CCAGCGGGTT
CTCTACCGGC CGATCCTCTA TTACTCCGTG ATCCGTGCGA TCCTGCGGGC CATCACGGGC
CGCCTGTTCA GCTGGGGTGC GCAGAAACGG CTGGGGCGTG ACTACAGCCT TGCGACGAGC
GGCACATGA
 
Protein sequence
MSLNESCNQL GVLVPDWLRL ETTPEGPVVK IESEESRAPL VDYRSTSGHA PDLMPVLEVD 
TGSDKAGFLR NLSTPETAQS VTSQVLAQLR PLYAQGTCIS IPGLETADLD VLQPFFKTLT
SSLRSEGHSP CLILSGTSTA WQSRETTALF DKVILKLFLD PWVGTAPSPL ATDAWFEKTA
KAALQEIGKD KLVIALGTFA VEWVSGEPLP KVLPYAAAME KIAAAGAELR FSEKTSGSLA
SYRDPEGRLN KIWMQDVASL INQLVILQQL EIPNSAVWSL GLEDPGIWSV LQNRDLSHDA
LSADLALVKL DSYVSYRGEG ALLRLHRRQS PGIRQIGFDT ETGRVVSQSY DLLPRPYALE
RYGKPAGRKV VLTFDDGPHP VFSEQILDIL QETQTPATFF VTGKSVMNAP EVLNRMIDEG
HEIGAHTFSH PRMDQVSKTR ATLEYAMLDK VVAGAAGRQL TLYREPFQRS GGPVTADRVA
ALEIAWDRDM QVVGMDVVPH DWAGWSGREI ADFAIEEVER GAGNVILLHD GGEDRTASVE
ATRLIITELS AKGYEFTTVA DLTGSTRAAL MPVTEGGYQT FDRVSFSLVA WGQDAIVILF
WLALGIGVVR SVAILLLAVL NWRGHRTISL TTPKVAVIIP AHNEEKVIRS CIQSVRASDY
KNLEIIVVDD GSSDNTLNEI FAFSHMREVR LISQPNQGKW SALNRALMNT SAEIVVCIDA
DTQIEKSAIG HMVRHFDNPR IGAVAGKIIA GNKVNLLTRL QALEYTTAQN VERKAFDLIN
GMLVVPGALG AWRVAALRKA GHFSDETMTE DTDLTIEVNR AGYRIAYEPL ARGYTEVPER
IGQLLKQRLR WSFGMFQSAW KHKKAMFEGR SVGLISIPDM FIFGYLFPLL APIADLFVAI
LLYQMVSGGW DSGAVGAQNM QYLLAYLTLP ALEFVIAAFA LARDKDESMW SLLLFPVQRV
LYRPILYYSV IRAILRAITG RLFSWGAQKR LGRDYSLATS GT