Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dshi_1110 |
Symbol | tolA |
ID | 5711078 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Dinoroseobacter shibae DFL 12 |
Kingdom | Bacteria |
Replicon accession | NC_009952 |
Strand | + |
Start bp | 1134607 |
End bp | 1135662 |
Gene Length | 1056 bp |
Protein Length | 351 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 641267021 |
Product | hypothetical protein |
Protein accession | YP_001532453 |
Protein GI | 159043659 |
COG category | [S] Function unknown |
COG ID | [COG5373] Predicted membrane protein |
TIGRFAM ID | [TIGR01352] TonB family C-terminal domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.00592448 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 31 |
Fosmid unclonability p-value | 0.281955 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGTGCGCC TGCCGCATAT CGAGACCGGG ACCTATATCT CCGGCGCGGG GCATCTGGCG CTCTTGGCCT GGCTCGCCCT GGGCGGACTT TTTTACTCCG CGCCCGAATT GCCGGTGCCT TCGGCGGCTG ATGTCGTGTT GTTCAGCGAG GCGGAGTTTG CCGCGATGAC CCGGGCGCCG GAGGTTGCCG AGCCCGCGCC CGCGCCACCC CCCGTGCCCG CGCCGGCCCC AGAACCCGCC CCACCCCCCG AACCTGCGCC AGCGCCGGAA CCCGTCCCGC AGCCGGAACC CGAACCCGTG CCGATCCCCG AGCCGGAGCC CGCGCCGCCG CCCCCGGCAG AGCGCGTGGC ACCCGAGCCC GTGCCGCAGC CCGAGCCGGA GGCGCAGGTC GCGCCAGAAC GGCAGGAGGC CGTCACCCCC GACAATTCCG GGGCCGAGGC CGTGCCTGCG GAAGAGGCCA CGGCCCCCGA GGCCGCGACC ACACGCATTA TCACCGAGGC GACCGAGACC GATCCCGAAA GCCAGGCCCC GGACTTGATC GCCAGCCCGC GCCCCTCGGC GCGTCCCGAC CGCCCCCGGC CCGTGCCGGT GGAGGCCCCG CCCACACCCG AAGCGCCGCC GGAAACGCCC GCCGAGACTG CGCCGGACCC GGTCGCCGAT GCGGTCGCGG CTGCCGTGGC AGAGGCCGCG GAGACCCCGA GCGCGGTGAG CCGCCCGGAT GTGCCCGTCG GTCCGCCACT CACCGCGTCC GAACGGGACG GTCTGCGCGT CGCTGTTCAG CAATGCTGGA ACGTGGGTTC GTTGTCGTCG GATGCGCTGC GCACCACGGT CACGGTCGCA GTGGAGATGG AGCAATCGGG CCGCCCCGTG ATCAATTCCA TCCGCATGAT CGGCTCCGAA GGCGGCTCGG ACGCGGCGGC GCGCCAGGCC TTCGAGACGG CACGCCGGGC AATCATCCGC TGCGGCAGCG CCGGTTTCGA TCTGCCGGTT GAGAAATATG CCCAGTGGCG CGATATCGAG ATGACATTTA ACCCCGAAAG GATGCGGATC CGATGA
|
Protein sequence | MVRLPHIETG TYISGAGHLA LLAWLALGGL FYSAPELPVP SAADVVLFSE AEFAAMTRAP EVAEPAPAPP PVPAPAPEPA PPPEPAPAPE PVPQPEPEPV PIPEPEPAPP PPAERVAPEP VPQPEPEAQV APERQEAVTP DNSGAEAVPA EEATAPEAAT TRIITEATET DPESQAPDLI ASPRPSARPD RPRPVPVEAP PTPEAPPETP AETAPDPVAD AVAAAVAEAA ETPSAVSRPD VPVGPPLTAS ERDGLRVAVQ QCWNVGSLSS DALRTTVTVA VEMEQSGRPV INSIRMIGSE GGSDAAARQA FETARRAIIR CGSAGFDLPV EKYAQWRDIE MTFNPERMRI R
|
| |