Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TM1040_0420 |
Symbol | |
ID | 4076180 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ruegeria sp. TM1040 |
Kingdom | Bacteria |
Replicon accession | NC_008044 |
Strand | - |
Start bp | 430753 |
End bp | 431982 |
Gene Length | 1230 bp |
Protein Length | 409 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 638005715 |
Product | cell wall biogenesis glycosyltransferase |
Protein accession | YP_612415 |
Protein GI | 99080261 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.414661 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.295666 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCTGACT TCCACCAGAA TGGCAATATC GCCCAATTCC ACAACCTGCG CAGCCGCCCG GTCGAGGAGC TGGAGTATGA GCTCACTACC TTTGCCCAGA CCCGCAAGAT CTCCTTGATC CTACCCTGCC TCTATTCCGA ACTCGAAGGC CCGGCCCTGC GCCCGATCCT CGACGAACTC GCCCGTGTTC CCTACCTCCA TCACATCGTA ATCGGCTTGG ATCGCGCCAG CGAGGCCGAA TACCGCCACG CCCGTACCTA TTTTGCCTCC CTGCCCCAAA GCCACCGCGT CCTGTGGAAT GACAGCCCCC GCATGAAGCA GCTGGGCGCG CGTCTGGCTG CACAGGGGCT CGCGCCGCCC GAGCCGGGCA AGGGCAAGAA TGTCTGGTCC TGCATCGGCT ATCTTCTGGC CCAGGCCGAA AGCTCGGTGG TCGCGATCCA CGATTGCGAC ATCAGCACCT ACTCCAAGGA GATGCTGGCA CGGCTGGTTT ATCCCGTGGC GCATCCCGCC TTCTCCTATC AGCTCTCAAA AGGGTTTTAT GCGCGGGTGG GCGGCGGCAA ACTCAATGGT CGTGTCTCGC GCCTGCTGGT CTCGCCGCTG TTGATCGCGC TGAAGCGCAC CATCGGAGAT CGCGACTATA TCGACTACCT GCGCAGCTTT CGCTACCCGC TCTCGGGGGA GATGGCCCTG CGCGCACCAC TTTTGCCGGA CCTGCGCATC CCGTCCGATT GGGGGCTGGA GATCGGCGTT CTGTCCGAAG CCTGGCGCAA TCTTGGCCGC AAAGCCGTTT GTCAGGTGGA GATCGCCGAC AACTACGACC ACAAACACCA GTCCCTCAGC CCCGAAGATG CCTCTGCGGG GCTCAACCGC ATGTCAACAG ACATCTGCAA GGCGATCTTT CGCAAGCTCG CCGCCGACGG CACGGTCTTC ACCCCCAATG TCTTTCGCAC CCTCAAGGCC ACCTATTATC GCTCGGCCCT TGATCTGCTG GAAGCCTATG CCCACGACGC CATGATGAAT GGCCTCGCTC TGGATCGCCA CGGCGAAGAA AAAATGGTCG AAATGTTTGC AAACAACATC ATGGCCGCCG GGCAGGTGTT TCTGGAAAAC CCCCATGAAA CCCCTTTCAT TCCAAACTGG AACCGCATCC ACTCGGCAAA TCCGACCTTT CTGCAGGATC TCAAGGCTGC AGCGCTTGCG GATGAGGCCG AATACAGCCC AAGTCCCTGA
|
Protein sequence | MADFHQNGNI AQFHNLRSRP VEELEYELTT FAQTRKISLI LPCLYSELEG PALRPILDEL ARVPYLHHIV IGLDRASEAE YRHARTYFAS LPQSHRVLWN DSPRMKQLGA RLAAQGLAPP EPGKGKNVWS CIGYLLAQAE SSVVAIHDCD ISTYSKEMLA RLVYPVAHPA FSYQLSKGFY ARVGGGKLNG RVSRLLVSPL LIALKRTIGD RDYIDYLRSF RYPLSGEMAL RAPLLPDLRI PSDWGLEIGV LSEAWRNLGR KAVCQVEIAD NYDHKHQSLS PEDASAGLNR MSTDICKAIF RKLAADGTVF TPNVFRTLKA TYYRSALDLL EAYAHDAMMN GLALDRHGEE KMVEMFANNI MAAGQVFLEN PHETPFIPNW NRIHSANPTF LQDLKAAALA DEAEYSPSP
|
| |