Gene TM1040_0420 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_0420 
Symbol 
ID4076180 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp430753 
End bp431982 
Gene Length1230 bp 
Protein Length409 aa 
Translation table11 
GC content60% 
IMG OID638005715 
Productcell wall biogenesis glycosyltransferase 
Protein accessionYP_612415 
Protein GI99080261 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.414661 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.295666 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTGACT TCCACCAGAA TGGCAATATC GCCCAATTCC ACAACCTGCG CAGCCGCCCG 
GTCGAGGAGC TGGAGTATGA GCTCACTACC TTTGCCCAGA CCCGCAAGAT CTCCTTGATC
CTACCCTGCC TCTATTCCGA ACTCGAAGGC CCGGCCCTGC GCCCGATCCT CGACGAACTC
GCCCGTGTTC CCTACCTCCA TCACATCGTA ATCGGCTTGG ATCGCGCCAG CGAGGCCGAA
TACCGCCACG CCCGTACCTA TTTTGCCTCC CTGCCCCAAA GCCACCGCGT CCTGTGGAAT
GACAGCCCCC GCATGAAGCA GCTGGGCGCG CGTCTGGCTG CACAGGGGCT CGCGCCGCCC
GAGCCGGGCA AGGGCAAGAA TGTCTGGTCC TGCATCGGCT ATCTTCTGGC CCAGGCCGAA
AGCTCGGTGG TCGCGATCCA CGATTGCGAC ATCAGCACCT ACTCCAAGGA GATGCTGGCA
CGGCTGGTTT ATCCCGTGGC GCATCCCGCC TTCTCCTATC AGCTCTCAAA AGGGTTTTAT
GCGCGGGTGG GCGGCGGCAA ACTCAATGGT CGTGTCTCGC GCCTGCTGGT CTCGCCGCTG
TTGATCGCGC TGAAGCGCAC CATCGGAGAT CGCGACTATA TCGACTACCT GCGCAGCTTT
CGCTACCCGC TCTCGGGGGA GATGGCCCTG CGCGCACCAC TTTTGCCGGA CCTGCGCATC
CCGTCCGATT GGGGGCTGGA GATCGGCGTT CTGTCCGAAG CCTGGCGCAA TCTTGGCCGC
AAAGCCGTTT GTCAGGTGGA GATCGCCGAC AACTACGACC ACAAACACCA GTCCCTCAGC
CCCGAAGATG CCTCTGCGGG GCTCAACCGC ATGTCAACAG ACATCTGCAA GGCGATCTTT
CGCAAGCTCG CCGCCGACGG CACGGTCTTC ACCCCCAATG TCTTTCGCAC CCTCAAGGCC
ACCTATTATC GCTCGGCCCT TGATCTGCTG GAAGCCTATG CCCACGACGC CATGATGAAT
GGCCTCGCTC TGGATCGCCA CGGCGAAGAA AAAATGGTCG AAATGTTTGC AAACAACATC
ATGGCCGCCG GGCAGGTGTT TCTGGAAAAC CCCCATGAAA CCCCTTTCAT TCCAAACTGG
AACCGCATCC ACTCGGCAAA TCCGACCTTT CTGCAGGATC TCAAGGCTGC AGCGCTTGCG
GATGAGGCCG AATACAGCCC AAGTCCCTGA
 
Protein sequence
MADFHQNGNI AQFHNLRSRP VEELEYELTT FAQTRKISLI LPCLYSELEG PALRPILDEL 
ARVPYLHHIV IGLDRASEAE YRHARTYFAS LPQSHRVLWN DSPRMKQLGA RLAAQGLAPP
EPGKGKNVWS CIGYLLAQAE SSVVAIHDCD ISTYSKEMLA RLVYPVAHPA FSYQLSKGFY
ARVGGGKLNG RVSRLLVSPL LIALKRTIGD RDYIDYLRSF RYPLSGEMAL RAPLLPDLRI
PSDWGLEIGV LSEAWRNLGR KAVCQVEIAD NYDHKHQSLS PEDASAGLNR MSTDICKAIF
RKLAADGTVF TPNVFRTLKA TYYRSALDLL EAYAHDAMMN GLALDRHGEE KMVEMFANNI
MAAGQVFLEN PHETPFIPNW NRIHSANPTF LQDLKAAALA DEAEYSPSP