Gene TM1040_3690 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_3690 
Symbol 
ID4075659 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008043 
Strand
Start bp751404 
End bp752513 
Gene Length1110 bp 
Protein Length369 aa 
Translation table11 
GC content65% 
IMG OID638005210 
Productglycosyl transferase, group 1 
Protein accessionYP_611919 
Protein GI99078661 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.633764 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGCAT CCCGTATCGC CTTTTATGCG CCCATGAAGG CTCCGACTCA TCCTACGCCC 
TCTGGCGATC GCGCGATGGC TCAGAACCTG ATGGAGCTTT TGCAATTGGG CGGTGCGGAG
GTGATCCTCG CCTCAGAGCT GCGGCTTTAT GATAAACTGG GCGACCCCGC GCATCAGCAG
CTATTGCAGC GGCGCGCCGC TGATGAGGTC AGCCGTCTGG TCGAGGAGCT TCCGCCCGTG
GACGCTTGGG TGACCTATCA CAATTACTAC AAAGCCCCCG ATTTGCTCGG ACCCGCCGTG
GCCGAGGCCC GGGGCATTCC CTACGTGCAG ATCGAGAGCA CGCGCGCCAA GAAACGCCTG
AAGGGGCCTT GGGCCGCATT TGCGCAGGCC GCCCACGAGG CCGCTGATCA GGCCGCGGTG
ATCTTCTACC TCACGGACCA GGACCGACAG ACGTTGGAGC GCGATCGCGC AGGCGATCAA
CAGCTGGTGC ATCTGCGCCC GTTTCTGCCG CAGGATGTGC TGCCTCCGGC AAGGGCAGAG
TCAGACGACG CAGGCCGTAC ACTGCTGGCC GCTGGCATGA TGCGTCCGGG CGACAAACTG
GCGTCATATG CCCTTATTGC CGAGACGCTG CGCCACCTTG AGAAGACCGA GCGCGCGAGC
GACTGGCAGC TTTTGATTGC GGGCGACGGC CCCGCGCGCA CCGAGGTCGA CGCGCTCATG
GCGCCCTTTG GCGACCGGGT GCGTTTTCTC GGACAGCTCG GCCCCGAGGC CATGATAGAC
GCGTATCGTG CCGCAGATCT TTTCCTGTGG CCCGGCGTCA ACGAAGCCTT CGGGATGGTC
TATGTCGAGG CCCAATCTCA TGGTCTACCT GTGGTCGCTC AAGACAGGCC CGGCTTGCGG
GATGTGCTTT TGCCCGGGGA TTATCCCGCT CCAGATGCTG GCGCGCGCGC CCTTGCCGCC
CGTGTGGTGC ACCTGCTGGC GGATGCGTCT GAGCGCAAAG ACCTCGGGCG GCGGGCGCGG
GATCATATCG CCCGCCACCA CCTTCGCCCC GCCGCGTCCG CAACCCTCTG GGCGGCGCTC
AAGCCACTAT TTAGGGAACA CAGCGCATGA
 
Protein sequence
MSASRIAFYA PMKAPTHPTP SGDRAMAQNL MELLQLGGAE VILASELRLY DKLGDPAHQQ 
LLQRRAADEV SRLVEELPPV DAWVTYHNYY KAPDLLGPAV AEARGIPYVQ IESTRAKKRL
KGPWAAFAQA AHEAADQAAV IFYLTDQDRQ TLERDRAGDQ QLVHLRPFLP QDVLPPARAE
SDDAGRTLLA AGMMRPGDKL ASYALIAETL RHLEKTERAS DWQLLIAGDG PARTEVDALM
APFGDRVRFL GQLGPEAMID AYRAADLFLW PGVNEAFGMV YVEAQSHGLP VVAQDRPGLR
DVLLPGDYPA PDAGARALAA RVVHLLADAS ERKDLGRRAR DHIARHHLRP AASATLWAAL
KPLFREHSA