Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_4269 |
Symbol | |
ID | 5705774 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | - |
Start bp | 4845279 |
End bp | 4846304 |
Gene Length | 1026 bp |
Protein Length | 341 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 641273688 |
Product | NMT1/THI5-like domain-containing protein |
Protein accession | YP_001539041 |
Protein GI | 159039788 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG0715] ABC-type nitrate/sulfonate/bicarbonate transport systems, periplasmic components |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 0.933987 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.0127409 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGTAGGC TGACCCGTAC GGTCGCCGCA GCCACGACGG CCGCTGCCCT GATGTTGGTC GGGGCATGTA GCGGCTCGGA CTCCACTGAC GACAAGGGGG GTGACAGCGG TGCGCTGGAG CAAGTAACCT ACCTCACCTC ATTTGGGAAC TTCGGCCGTG ATTCCTACGC CTGGGTGGCG AAGGAAAAGG GCTTCTTCCG GGACGCGGGC TTTGACGTCG ACATCAAGGC GGGGAAGGGC ACCGGTGCCG TTATCCAGAC GGTCTCCGGA GGCAAGGCGC ATTTCGGGCC GATCGACCTC ACCGGAGGTT TGCTCCAGTT TGGCAACGGC GAGGCAAAGG ACTTCGTCGT CGTGGCCGCG ATCCAGCAGC GCACCATGGC CGGCATCGCC ACCGTCGAGG GCACGAACAT CACCACCCCG AAGGATCTTG AGGGTAAGAA GATCGCGGAC GCCCCCGCCT CCGTGGTCCG CAACCTCTTC CCCACGTACG CCAAGATGGC CGGCGTCGAC GCGAGCAAGG TGACCTGGGT CAACGGTGCG CCGCAGGACC TGATGGGTAC CCTCGCCGCG GGCACCGTTG ACGGCATCGG GCAGTTCGTG GTTGGCCAGC CGACCATTGA GGCGGTGGCC AAGAAGAAGG CGATCATGCT GCCGTACAGC GAGTACATGC AGGATCTCTA CGGCAACGTG CTGATCACGT CGACAACGAT CGCCAAAGAG CAGCCGGACA TGGTCAAGCG TTTCCGCGAC GCTCTGCTCA AGGGCTTGGA CTACGCGTTG GCCAATCCGC AGGAGGCAGC TGAGCTGCTG AAGAAGAACG TGGACTCGAC GAACGTCGAC GCCGCCAGGT CGGAACTGGA ACTGATGGCC GGCTACGTGC GGTCCAGCAA CAGCGGTGCC CAGCTGGGCA CGGTGGACAG CGCCCGGGTG GCGCAGAGCA TTGCCATCCT GCAGGGCGCG GGCGCGCTCA AGCAGACCCT CGATCCCGAC GAGATCATCG ACTTCAGTCT CACGCCGAAG GCCTGA
|
Protein sequence | MSRLTRTVAA ATTAAALMLV GACSGSDSTD DKGGDSGALE QVTYLTSFGN FGRDSYAWVA KEKGFFRDAG FDVDIKAGKG TGAVIQTVSG GKAHFGPIDL TGGLLQFGNG EAKDFVVVAA IQQRTMAGIA TVEGTNITTP KDLEGKKIAD APASVVRNLF PTYAKMAGVD ASKVTWVNGA PQDLMGTLAA GTVDGIGQFV VGQPTIEAVA KKKAIMLPYS EYMQDLYGNV LITSTTIAKE QPDMVKRFRD ALLKGLDYAL ANPQEAAELL KKNVDSTNVD AARSELELMA GYVRSSNSGA QLGTVDSARV AQSIAILQGA GALKQTLDPD EIIDFSLTPK A
|
| |