Gene TM1040_3833 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_3833 
Symbol 
ID4074983 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008042 
Strand
Start bp78296 
End bp79729 
Gene Length1434 bp 
Protein Length477 aa 
Translation table11 
GC content49% 
IMG OID638004491 
Productglycosyl transferase, group 1 
Protein accessionYP_611226 
Protein GI99077967 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value0.00330118 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.252696 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGACCT ACTATTTTGA TATCACGGAT ATCGTTCTCT ACGTCAAAAA AGAAACGACA 
ATATCTGGGA TACAGCGCGT TGCTCTGGAA GTTATTAGGC GTGCAGTAGG CAAGCTCGGA
TCTGAGCGCG TCAAGGTAGG CATGTGGGAC AAAGGTTCTC AAAGCTATCT GGCGATGGAT
GCCGATTTCC TTCTTGAACG GGAAGAATTT GACCCAGATT TTCTAGCTGC TGCTTTGCTG
GGGAAGGCTG TGCGCAAGGT GCAAACAGTA CCCAGTATTC TTACCCGGTA TCGCAACCAG
CGCGCGAAAT ACTTGTATCA TTGGCTGCGA GCTAACCTGG AAGAATACAA GCGAAACCAT
CGTTATTTTG AGCGACGCGG CACTACTTTA GAAGGCTGGG TGCTCGAGAA AGAAAGAGCC
CATAGAAACT CAAAGGAGCT GCCGGTTCTC GCGCCTGCAG CATCGGTACA AAAGCGGGTT
CCGCTCGAAG ACGTGGCGCA ATCAGGCGAC CGTCTGATTA TTTTAGGTGC GACTTGGGGG
CTGCAAGACC TCAACGACCA TCTTATTGAA TTAAAAGAGA AACTTGGAGT TGAAGTTGAT
CTCCTTATTC ACGATCTAAT CCCGCTGGTC GCGTCAGAAC ATCTTGCGGA TGATTTTTCT
GAGACATTCT ACCGCTGGCT GGAGGGATCT ACTCTCTATT GTTCGCGTTA TTTTGCAAAT
TCTCAGAACA CAGGGAAGGA TTTACGATGC TTCCTAAGCG AAATTAACTC CGACCTTCCA
ATCGATGTTG TGCCGTTGGC ACAAGCACTG GGAGATGGCC CTGCGGCCCT AGATACTCAA
AGCTTTAGAT CCAAACTGAA CGCGACAAAG GGTGTGCGCA GATCGTTTCT GAACATGACA
AAGGTACCGT ATGTTCTCGT TGTTGGTACG CTTGAAACAA GAAAGAATAT CTGGCGTTTA
GCTCAGGCCT GGCAGCAACT CATACAAGAT CCTCAAGTAG AACCTCCGCG CCTAATTTTT
GCGGGGAAGA AGGGGTGGTA CATCGATGAA TTTCTGGATT GGATGAAGGC GAGTGGAAAT
CTTGATGGCT GGATTAGTAT CGCGGATAGG CCGACTGACA AGGAATTAGC CTTCCTCTTC
CACAATTGCG TATTTACAGC AAACGTCTCC ACATATGAGG GATGGGGTTT GCCAGTGGGA
GAAGGCCTAA GCTTCGGCAA AACAGGTGTT GTCGCCGAAA ATTCCTCACT GACAGAGGTC
GGTGGAGACA TGGTCGAGTA CTGTGATGCT CATTCGATCA GTAGTATTCG AGACGCGTGT
AAGCGCTTGA TCATGGAAGA TGGTAGGCGA ACGGAGCTAG AGCAACGGAT TAAGAACACG
CAGTTACGTA GTTGGGATGA TGTGACCAAC GATTTGCTGG AGTACCTTAA CTAG
 
Protein sequence
MQTYYFDITD IVLYVKKETT ISGIQRVALE VIRRAVGKLG SERVKVGMWD KGSQSYLAMD 
ADFLLEREEF DPDFLAAALL GKAVRKVQTV PSILTRYRNQ RAKYLYHWLR ANLEEYKRNH
RYFERRGTTL EGWVLEKERA HRNSKELPVL APAASVQKRV PLEDVAQSGD RLIILGATWG
LQDLNDHLIE LKEKLGVEVD LLIHDLIPLV ASEHLADDFS ETFYRWLEGS TLYCSRYFAN
SQNTGKDLRC FLSEINSDLP IDVVPLAQAL GDGPAALDTQ SFRSKLNATK GVRRSFLNMT
KVPYVLVVGT LETRKNIWRL AQAWQQLIQD PQVEPPRLIF AGKKGWYIDE FLDWMKASGN
LDGWISIADR PTDKELAFLF HNCVFTANVS TYEGWGLPVG EGLSFGKTGV VAENSSLTEV
GGDMVEYCDA HSISSIRDAC KRLIMEDGRR TELEQRIKNT QLRSWDDVTN DLLEYLN