Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dshi_1957 |
Symbol | |
ID | 5712951 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Dinoroseobacter shibae DFL 12 |
Kingdom | Bacteria |
Replicon accession | NC_009952 |
Strand | - |
Start bp | 2051005 |
End bp | 2052432 |
Gene Length | 1428 bp |
Protein Length | 475 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 641267882 |
Product | hypothetical protein |
Protein accession | YP_001533299 |
Protein GI | 159044505 |
COG category | [N] Cell motility [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG3063] Tfp pilus assembly protein PilF |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.00211883 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 0.00150316 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGTCCGATA CTCAGGAATG GACACCGAAC GTTCAGCAAG TTGGTGGCAA TCAGAACGAA GCCATGAACT ACGTCCGCAG TCTGCTGCGC GAGGGCCGCA CGGACGAGGC GCGGGCCGAG TTGCAGATGA TGATCGAGGC AGACCCCCAG GACACCCGGG CGATGATGGC CTTCGCGATG TCGCTGGTGC GCGAGCAGCG GCTCGAAGAG GCCGCGCCGT ATGTCGAGCG CGCGCTCGAG GTCGAGCCGG GCAATACCAC CGCCGCCCTG ATGGGCGCCC AGATCGGCGT GCGCGGCGGC AATGCCGAAT ATGCCGAGGC GCATTACCAC AAGGCGCTGC AGGCCGATCC GCGCAACATG CGCGCGCTGA TGGGGTTGGC ACGGCTGCAC GGCCAGAACC AGAAGCCCGA GGCGGCGATC GAGGTGCTGC AGACCGCGCT GGAGGTGGAT CCGCAATCCG CCCGTGTCCG GCGCCAGTTG GCCACACTGC TGCAGCGGAC CGAGAAGACC GAAGAGGCCA AGGCGCAGCT GCGCGCGGCG CTCACCGCGA ACCCGAACGA CCAGGGCGCG TCGGTGCAGC TGGCCAATAT CTGCATGCGC GCCGGGGATA CCGCCGAGGC GATCCAGGTG CTCGAGACCG CGCTCGAAGC CCAGCCCGAC AATCGTCGCC TGACCATGTC GCTGGGCCGG ATGCGCCTGC GGGCGGAGGA TTACGCCGGG GCCGAGGCGA CCTTGCGGCC CCTGACCACC GGGCAGCGCG GCGGCATGGC GCGGATCGCG CTGGTCCAGG CGCTGATCCC GCAGGGCAAG CTGACCGAGG CGCGGACGCT GCTGGCAAGC TCCTCGCGCG GGGCGCGGAC GCCCTCGCTG GTGCATCGTC TCTATGGCGA CGCGTTCGTG GCCGAAGAGA AATGGAGCGC GGCGGAGAAA TCCTATCGCG CGGCGGTCTC GGCCCTGCGC GAAGGCGGCG ACGAGATGCT GGCCAGGATC GACGCCCAGA AGGCCGCCAA CCCCAAGGCG ACGGGCGCGG ACCTGATGAA GATCTACACC GACGCGTTCG AGGCGCGCCG GGCCGAGCAG GTCGCGCAGC GCCAGGCCCA GGATCCGGCC GAGGCGCGCG AACGGCGCAG GGCCGCCCGG GCGGAGCGGC GCGACGGTCC CAATGCGGAG CGGCGTCGCC AGGTGTTGCA GCGGCTTGCC CAGCAGCGGC GCGCCAACAA TGCGACCGGC ACCGCGGCGG GGCCCCAGGC CGGCGGCGGA TTGCGCGCCC GCATCCAGGA GCGGCGCGCC CAGCAAGCGG CCGGGACCGC CCCGGCGGCC GCCGGCGGCG TGACCGGGGA GGTGATCCCG CCGCGCGGTG GCGGGGGCGG CCGGTTGCGC AACCTGATCG CCCGGCGGCG GGGCACCCCC CCGGGCGCCC AGTCCTGA
|
Protein sequence | MSDTQEWTPN VQQVGGNQNE AMNYVRSLLR EGRTDEARAE LQMMIEADPQ DTRAMMAFAM SLVREQRLEE AAPYVERALE VEPGNTTAAL MGAQIGVRGG NAEYAEAHYH KALQADPRNM RALMGLARLH GQNQKPEAAI EVLQTALEVD PQSARVRRQL ATLLQRTEKT EEAKAQLRAA LTANPNDQGA SVQLANICMR AGDTAEAIQV LETALEAQPD NRRLTMSLGR MRLRAEDYAG AEATLRPLTT GQRGGMARIA LVQALIPQGK LTEARTLLAS SSRGARTPSL VHRLYGDAFV AEEKWSAAEK SYRAAVSALR EGGDEMLARI DAQKAANPKA TGADLMKIYT DAFEARRAEQ VAQRQAQDPA EARERRRAAR AERRDGPNAE RRRQVLQRLA QQRRANNATG TAAGPQAGGG LRARIQERRA QQAAGTAPAA AGGVTGEVIP PRGGGGGRLR NLIARRRGTP PGAQS
|
| |