Gene TM1040_3692 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_3692 
Symbol 
ID4075661 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008043 
Strand
Start bp753091 
End bp754230 
Gene Length1140 bp 
Protein Length379 aa 
Translation table11 
GC content64% 
IMG OID638005212 
Producthypothetical protein 
Protein accessionYP_611921 
Protein GI99078663 
COG category[R] General function prediction only 
COG ID[COG4671] Predicted glycosyl transferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.833754 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGGTTC TGATTGCTGT GACGCATCTC TTGGGCACCG GGCATCTGTC GCGCGCCCTC 
ACATTGGGGC GGGCGTTTTC GCGACTGGGC CATGCCGTTA CCGTGATCTC TGGCGGGTTT
CCAGCGCCGC AACTCAGCCT TGAGAGCGTC CAGATCGAGC AATTGCCACC GCTGCGCTCT
GATGGGGTGG CGTTCACGCG CCTGCTGGGC GAAGGGGGCG AGGTCGCGGA TGAGGCCTAT
CTTGCCCGCA GGGTGCATCA GCTGGAAACC GTGGTTCAGG CTCGGGAACC CGATGTTCTG
ATCACGGAGC TTTACCCCTT TGGTCGCCGC GCTCTTAGGG CAGAGTTCCG CGCCCTCCTT
GAGGCTGCCA AGGCCCTGCC CCGCCCGCCC CTGATCCTGT CCTCGATCCG TGATATTCTT
GCCCCGCCGT CAAAGCCGCA AAAGGCCGTG GACGCCGATG CAATGATTGA GCGCTATTAC
GATGGCGTGC TCGTTCACTC CGACCCCAAG GCGACCCGGC TCGAGGTCAG CTGGCCTGTC
TCGGACATGC TCGCCGCCAA GCTGCATTAC ACCGGCTATG TCGCCCCACC AGCCGCAGCG
CCGCATCCCG ATGGGGTTGG CAAAGGCGAA ATCCTCGTCA GCGCTGGCGG TGGCAGCGTC
GGAGATGCAC TATATGCCTG CGCCATTGAG GCCGCCAAGG AGATGCCAGA CTATAGCTGG
CGCATTCTTG TCGGCGGCGC GGATGCGGCG GCGCGGATCG CAGAGTTGCA CGACCCAAGT
TCGCCCGCGA GTCTTGAGCC TGCCCGCTCT GACTTTCGCG CGATGCTGCC CCATGCCGCC
GCCTCCGTGA GCATGTGTGG CTACAATACC GCACTGGATT TGCTGCAATC GGGTACCCCA
GCGGTGCTCG TGCCCTTTGA TGCGGGCAAG GAGGTGGAGC AGACCCTGCG CGCCAAGAGC
CTGTCTCCGT TACCAGGTTT TGAGGTCGAA GCGGCGGCGA CACTCACGCC AGCCCGTCTC
GCGACAGCGC TGCGCCGCGT TATGCAGGAT ACGCAACGCA GCCTTGACGG CTTTGAATTT
GACGGAGCGG GTCAGAGTGT GGAGATTGCC GCAACGCTGC TGAGGGGGCA GCGCGCTTGA
 
Protein sequence
MKVLIAVTHL LGTGHLSRAL TLGRAFSRLG HAVTVISGGF PAPQLSLESV QIEQLPPLRS 
DGVAFTRLLG EGGEVADEAY LARRVHQLET VVQAREPDVL ITELYPFGRR ALRAEFRALL
EAAKALPRPP LILSSIRDIL APPSKPQKAV DADAMIERYY DGVLVHSDPK ATRLEVSWPV
SDMLAAKLHY TGYVAPPAAA PHPDGVGKGE ILVSAGGGSV GDALYACAIE AAKEMPDYSW
RILVGGADAA ARIAELHDPS SPASLEPARS DFRAMLPHAA ASVSMCGYNT ALDLLQSGTP
AVLVPFDAGK EVEQTLRAKS LSPLPGFEVE AAATLTPARL ATALRRVMQD TQRSLDGFEF
DGAGQSVEIA ATLLRGQRA