Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dshi_4189 |
Symbol | |
ID | 5714720 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Dinoroseobacter shibae DFL 12 |
Kingdom | Bacteria |
Replicon accession | NC_009959 |
Strand | + |
Start bp | 34761 |
End bp | 35777 |
Gene Length | 1017 bp |
Protein Length | 338 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 641277084 |
Product | ABC-type nitrate/sulfonate/bicarbonate transport systems periplasmic components-like protein |
Protein accession | YP_001542380 |
Protein GI | 159046712 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG0715] ABC-type nitrate/sulfonate/bicarbonate transport systems, periplasmic components |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.623523 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCCTTA GCAAGATGAC CGGATCGCTC CTGGCCGGCG CGGGCCTCGC CGCGCTGCTG GCCGGCAGCG CCACCGCCAA GGAGTTCGAC CTGAGCGGCA CCGAGGTGAC CATCGGCACC GCGCAGGCGC AGGTCCTGAA CCTCGGCACC CTGCGGATGA TCGAGATGCT CACCGAATGG GGTGCCGATG TCACCCGGGT GGAGCTGCCC AATATCTCGG GGCTCGAAGC CATTGTCGCG GACCGGATCG ACATCGCCTC GCGCAGCTCG GACGAGATCC TCGCGGGTCA GTCGCGCGGC GTCGACGTGG TGGCCTTCGC GGCGCCGATC TCCACCATGC ACTATGCCGT GGTCTCGACC CCGGGCATCG ACTCCCTCGA AGACCTGCGG GGCCAGTCCA TCGCCACCAG CGGCCCCGGC GGCTTCAACG GCATGCTCTT TCGCTACATG CTGACCCAGG CCGGGCTCGA GCCCGAGGTC GACGTGGCCA TCGTCCCCGT GGGCGGGTCC AGCGAGCGGG CCGCCGCGAT CATGGCCGGG CAGGTGCAGG CGACGGTCGT GTTCATCGAC AACTGGCTCG CCCTGCGGGA ACAGGGCGCG AATGCGCAGC TTCTGGGCTA CGTGGCCGAC CTGGTACCGG GCCTGTCCTC GCGCGCCGTC TTCGCCCCGC GCGACTACCT GGCCGCCAAC GAGGACCTGG TGACGGCCAT CGCCTGCGCG AACCTCGAAG TGAACGCCTG GATCAATTCC AACAAGGACG ACTTCGTCGC CTACGCCATG GACAATGTGC GCGGCGCCAA CGAGGACGCG GTCTCGGCCT TCTACGACGT GGCGATGGAG ATCAACATGT TCCCCACCGC GCCGCGCGAA CTGCTGGATG TCTCCGGCTA CCAGGCGCTG GCCGACCTGA TGTACGCGGG CGGGGAGCTG GACGACCAGC TCGATGCCAG CGGCTTCGTC GACTTCACCT ATGTGGACCG GGCCGCCGAA ATGGGCTGCG GCACCGGCGC GATGTAA
|
Protein sequence | MTLSKMTGSL LAGAGLAALL AGSATAKEFD LSGTEVTIGT AQAQVLNLGT LRMIEMLTEW GADVTRVELP NISGLEAIVA DRIDIASRSS DEILAGQSRG VDVVAFAAPI STMHYAVVST PGIDSLEDLR GQSIATSGPG GFNGMLFRYM LTQAGLEPEV DVAIVPVGGS SERAAAIMAG QVQATVVFID NWLALREQGA NAQLLGYVAD LVPGLSSRAV FAPRDYLAAN EDLVTAIACA NLEVNAWINS NKDDFVAYAM DNVRGANEDA VSAFYDVAME INMFPTAPRE LLDVSGYQAL ADLMYAGGEL DDQLDASGFV DFTYVDRAAE MGCGTGAM
|
| |