Gene TM1040_3858 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_3858 
Symbol 
ID4074921 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008042 
Strand
Start bp109942 
End bp111195 
Gene Length1254 bp 
Protein Length417 aa 
Translation table11 
GC content61% 
IMG OID638004515 
Productglycosyl transferase, group 1 
Protein accessionYP_611250 
Protein GI99077991 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones57 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGATCC TTTTTGTGCA TCAAAACATG CCTGGCCAGT ATCGCGAGCT GGTGCGCTGG 
CTTGCGGCAC AGGGAGGGCA TGAGCTGGTG TTTCTGAGCC AGCGTGGGGG TGTAAAAATC
CCGGGTGTAC ATACGCTGGT CTATACGCCC TTTCATGTAC CCGCAAAGGA CGCTTTTGGT
CTGTCCAAGG ACTGGGAAGC CGCGGCAGGG GCGGGTTTTG GCGCCGTGCA AACGATGCAA
AAGTTTGAGC GCAAGCACGG GTTTCGCCCT GATATCATTC TTGGTCATAC CGGCTGGGGA
GAGCTCAGTT TTTTTAAGGA TCTGTGGCCG GATGTGCCGA TCCTGGGGTT CTTTGAATAT
TACTACAGCA TGCAAGGTGG CATGGTGGGC TTTGATCCCG AACAGCCTCC CGGACCCCAT
GCGCCCTACT TCAATCGCGC CCGCAATGTG GTGCCATGCC TCAATCGCGA TGTGGTGGAT
CTGGGGCATG TGCCGACGCT CTGGCAGCGC GATCGCTTTC CGGCGTCCTT TCACGACAAG
ATGTATGTCT GCCACGATGG CATCCGCGCG GATCGACTGC ATGCCAATCC CGAGGCGCAG
CTCTCGCTGG GCCGGTTGGA GCAGCCCATT TCGCGCGGCG ATGAGATCGT GACCTATGTG
GCACGCAATA TGGAGCGGGC GCGTGGCTTT CATATCATGA TGCGGGCCCT GCCGGCCATT
CAGGCCGCGC GGCCCAATGC GCGTATCCTG ATGATTGGTG GCAATGAGGC CTCTTACGGG
CGTGAAAGCG AACACCCCGG GGGGCTGCGC GGCGAGATGG AAGCGGAGGT CGGCCAGTAT
GTTGACTGGA GCCGGGTGCA TTTCCTGGGC CGGGTGCCCT ACGGGGATCT GTGCCAGATC
ATTCAGCTGT CACGCTGCCA CATCTACCTC ACCATGCCCT TTGTTCTGAG CTGGTCGCTC
TTGGAGGCAA TGGCGATGGA GGCGACGATC GTGGCCGCCG ATGTGGAGCC GGTGCGCGAG
GTGATCACCC ATGGCGACAC CGGGCTTTTG GTGGATTTCT TTGACCCCGA GGCGCTGGCG
GCCCAAGTGG CCGAGGTGCT GGCGCGCCCG CAGGATTTTG CCAGCCTTGG CGCGCGCGCC
CGGGCGCGGG TGATGCAGGA CTATGATTTC CTGACCCGCT GCCTGCCTGA GCACCTCAGC
CAGATCAACC GGCTGGTGCC CTCCGCGCGG CCGATCCCGC TCCCCGAAGG GTAG
 
Protein sequence
MKILFVHQNM PGQYRELVRW LAAQGGHELV FLSQRGGVKI PGVHTLVYTP FHVPAKDAFG 
LSKDWEAAAG AGFGAVQTMQ KFERKHGFRP DIILGHTGWG ELSFFKDLWP DVPILGFFEY
YYSMQGGMVG FDPEQPPGPH APYFNRARNV VPCLNRDVVD LGHVPTLWQR DRFPASFHDK
MYVCHDGIRA DRLHANPEAQ LSLGRLEQPI SRGDEIVTYV ARNMERARGF HIMMRALPAI
QAARPNARIL MIGGNEASYG RESEHPGGLR GEMEAEVGQY VDWSRVHFLG RVPYGDLCQI
IQLSRCHIYL TMPFVLSWSL LEAMAMEATI VAADVEPVRE VITHGDTGLL VDFFDPEALA
AQVAEVLARP QDFASLGARA RARVMQDYDF LTRCLPEHLS QINRLVPSAR PIPLPEG