Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dshi_1941 |
Symbol | |
ID | 5712935 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Dinoroseobacter shibae DFL 12 |
Kingdom | Bacteria |
Replicon accession | NC_009952 |
Strand | - |
Start bp | 2030122 |
End bp | 2031135 |
Gene Length | 1014 bp |
Protein Length | 337 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 641267866 |
Product | NMT1/THI5 family protein |
Protein accession | YP_001533283 |
Protein GI | 159044489 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG0715] ABC-type nitrate/sulfonate/bicarbonate transport systems, periplasmic components |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 29 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.0000780337 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | GTGTTCCAAC GCTTCACACA GATCTGCGCG GCCGGGGCCG TGCTGGCCGC CGCCACCGCC GCCGGGGCCG AAACCGAGGT GCCCTTCGCC CTCGACTGGA AATTCGAAGG CCCCGCCGCC CCCTATTTCG TCGCCGTCGA CAAGGGCTAT TTCACCGATG CGGGCCTGTC GGTGGAAATC GCCGCCGGCC AAGGCTCCCT CGACGCGATC CCCAAGGTCG CCACCGGCGC CTTCCCCGTG GGCTTTGCCG ACATCAACTC GCTCATCAAG TTCCTCGACC AGAACCCGGG CGCGCCCGTG ATCGCGGTGA TGATGGTCTA TGACAAGCCG CCCTTCGCCA TCGTCGGGCG CAAGTCGCTC GGCGTCGAAG CCCCCGCCGA CCTCGAAGGC CGCGTGCTCG GCGCGCCGCC CCCCGACGGG GCCTGGGCGC AATTCCCGAT CTTCGCCGCG GAAAACGACC TCGACACCGA CGCCATCACG GTCGAACCCG TGGGCTTCCC GACCCGGGAG CCGATGCTGG CCCAGGGCGA GGTGGCCGCG GTCACGGGCT TCTCCTTCTC GTCCTACCTC AACCTCGTGC GCCTCGGCGT GCCGGAGGAC GACATCTCCA CCATCCTGAT GGCGGACCAC GGCGTGGACC TCTACGGCAA CGCGATCATC GCCAACACCG AATGGGCCGC GGAAAATGCC GAGCTGCTGA CCGGCTTCCT CGGCGCCGTG GTCAAGGGCT GGGCCGATGC CATCGCCGAC CCGGCGGCCG CGATCCCGTC CCTGATCGAA CGCAACCCCG CCGCCGATGC GGAGCTTGAG ACCCGCCGCC TGCAACTGGC GATCGACGCC AACGTGGCGA CCGACTACGC GCTGGCCAAC GGCATGGGCG GGATCGACGC GGACCGCATG GCCAACGCGA TCGAGCAGAT CAAGCTGACC TACGAGTTCC AGAACGCGCC CGACATGAGC CTCTATTTCA CCGACGCCTA CCTGCCCGGC GCCGAGATGC GGATGCTCAA GTGA
|
Protein sequence | MFQRFTQICA AGAVLAAATA AGAETEVPFA LDWKFEGPAA PYFVAVDKGY FTDAGLSVEI AAGQGSLDAI PKVATGAFPV GFADINSLIK FLDQNPGAPV IAVMMVYDKP PFAIVGRKSL GVEAPADLEG RVLGAPPPDG AWAQFPIFAA ENDLDTDAIT VEPVGFPTRE PMLAQGEVAA VTGFSFSSYL NLVRLGVPED DISTILMADH GVDLYGNAII ANTEWAAENA ELLTGFLGAV VKGWADAIAD PAAAIPSLIE RNPAADAELE TRRLQLAIDA NVATDYALAN GMGGIDADRM ANAIEQIKLT YEFQNAPDMS LYFTDAYLPG AEMRMLK
|
| |