Gene Dshi_1957 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDshi_1957 
Symbol 
ID5712951 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDinoroseobacter shibae DFL 12 
KingdomBacteria 
Replicon accessionNC_009952 
Strand
Start bp2051005 
End bp2052432 
Gene Length1428 bp 
Protein Length475 aa 
Translation table11 
GC content72% 
IMG OID641267882 
Producthypothetical protein 
Protein accessionYP_001533299 
Protein GI159044505 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG3063] Tfp pilus assembly protein PilF 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.00211883 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.00150316 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGTCCGATA CTCAGGAATG GACACCGAAC GTTCAGCAAG TTGGTGGCAA TCAGAACGAA 
GCCATGAACT ACGTCCGCAG TCTGCTGCGC GAGGGCCGCA CGGACGAGGC GCGGGCCGAG
TTGCAGATGA TGATCGAGGC AGACCCCCAG GACACCCGGG CGATGATGGC CTTCGCGATG
TCGCTGGTGC GCGAGCAGCG GCTCGAAGAG GCCGCGCCGT ATGTCGAGCG CGCGCTCGAG
GTCGAGCCGG GCAATACCAC CGCCGCCCTG ATGGGCGCCC AGATCGGCGT GCGCGGCGGC
AATGCCGAAT ATGCCGAGGC GCATTACCAC AAGGCGCTGC AGGCCGATCC GCGCAACATG
CGCGCGCTGA TGGGGTTGGC ACGGCTGCAC GGCCAGAACC AGAAGCCCGA GGCGGCGATC
GAGGTGCTGC AGACCGCGCT GGAGGTGGAT CCGCAATCCG CCCGTGTCCG GCGCCAGTTG
GCCACACTGC TGCAGCGGAC CGAGAAGACC GAAGAGGCCA AGGCGCAGCT GCGCGCGGCG
CTCACCGCGA ACCCGAACGA CCAGGGCGCG TCGGTGCAGC TGGCCAATAT CTGCATGCGC
GCCGGGGATA CCGCCGAGGC GATCCAGGTG CTCGAGACCG CGCTCGAAGC CCAGCCCGAC
AATCGTCGCC TGACCATGTC GCTGGGCCGG ATGCGCCTGC GGGCGGAGGA TTACGCCGGG
GCCGAGGCGA CCTTGCGGCC CCTGACCACC GGGCAGCGCG GCGGCATGGC GCGGATCGCG
CTGGTCCAGG CGCTGATCCC GCAGGGCAAG CTGACCGAGG CGCGGACGCT GCTGGCAAGC
TCCTCGCGCG GGGCGCGGAC GCCCTCGCTG GTGCATCGTC TCTATGGCGA CGCGTTCGTG
GCCGAAGAGA AATGGAGCGC GGCGGAGAAA TCCTATCGCG CGGCGGTCTC GGCCCTGCGC
GAAGGCGGCG ACGAGATGCT GGCCAGGATC GACGCCCAGA AGGCCGCCAA CCCCAAGGCG
ACGGGCGCGG ACCTGATGAA GATCTACACC GACGCGTTCG AGGCGCGCCG GGCCGAGCAG
GTCGCGCAGC GCCAGGCCCA GGATCCGGCC GAGGCGCGCG AACGGCGCAG GGCCGCCCGG
GCGGAGCGGC GCGACGGTCC CAATGCGGAG CGGCGTCGCC AGGTGTTGCA GCGGCTTGCC
CAGCAGCGGC GCGCCAACAA TGCGACCGGC ACCGCGGCGG GGCCCCAGGC CGGCGGCGGA
TTGCGCGCCC GCATCCAGGA GCGGCGCGCC CAGCAAGCGG CCGGGACCGC CCCGGCGGCC
GCCGGCGGCG TGACCGGGGA GGTGATCCCG CCGCGCGGTG GCGGGGGCGG CCGGTTGCGC
AACCTGATCG CCCGGCGGCG GGGCACCCCC CCGGGCGCCC AGTCCTGA
 
Protein sequence
MSDTQEWTPN VQQVGGNQNE AMNYVRSLLR EGRTDEARAE LQMMIEADPQ DTRAMMAFAM 
SLVREQRLEE AAPYVERALE VEPGNTTAAL MGAQIGVRGG NAEYAEAHYH KALQADPRNM
RALMGLARLH GQNQKPEAAI EVLQTALEVD PQSARVRRQL ATLLQRTEKT EEAKAQLRAA
LTANPNDQGA SVQLANICMR AGDTAEAIQV LETALEAQPD NRRLTMSLGR MRLRAEDYAG
AEATLRPLTT GQRGGMARIA LVQALIPQGK LTEARTLLAS SSRGARTPSL VHRLYGDAFV
AEEKWSAAEK SYRAAVSALR EGGDEMLARI DAQKAANPKA TGADLMKIYT DAFEARRAEQ
VAQRQAQDPA EARERRRAAR AERRDGPNAE RRRQVLQRLA QQRRANNATG TAAGPQAGGG
LRARIQERRA QQAAGTAPAA AGGVTGEVIP PRGGGGGRLR NLIARRRGTP PGAQS