Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Noca_0538 |
Symbol | |
ID | 4596205 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardioides sp. JS614 |
Kingdom | Bacteria |
Replicon accession | NC_008699 |
Strand | + |
Start bp | 572921 |
End bp | 573958 |
Gene Length | 1038 bp |
Protein Length | 345 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 639775152 |
Product | sulfate ABC transporter, periplasmic sulfate-binding protein |
Protein accession | YP_921767 |
Protein GI | 119714802 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG1613] ABC-type sulfate transport system, periplasmic component |
TIGRFAM ID | [TIGR00971] sulfate/thiosulfate-binding protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGACTT CGATCAAGGT GGCGGCGGCC GCGTTCGCCG GGGTGCTTGC CTTGACCGGC TGCTCCGCCT CTGCGGACGG CGACGCCGAC GAGACCGTCA ACGTCAACGC GTTCTCGGTG ATGGAGGCGG CCAACGAGCC CGTCTTCGCC GACTTCGCGG CCACGGAGGA GGGCAAGGGG GTTCGGTTCG AGACGTCGTA CGGCGCCTCC GGTGACCAGA GCCGTGCCGT CGTGGCCGGC GCGGCCGCGG ACGTCGTGCA CTACTCCCTC GAGACCGACG TGACCCGGCT CGTCGACGAG GGCCTGGTGG CCGAGGACTG GAAGGACGAT GCGACCAACG GCATCGCGAC GTCGTCGGTC GTCGTGTTCG TGGTCCGCAA GGGCAACCCC GAGAACATCC AGACCTGGGA CGACCTGGTC AAGCCCGGCG TCGAGATCAT CACGCCCAAC CCGGGCTCGT CCGGCTCGGC CCGCTGGAAC ATCCTGGCCG CCTGGGCGCA CGTGACCGGA AACGGCGGCA GCGCTGAGGA CGCGAAGTCG TTCCTCACCC GGCTGCTCGA CAACACGATC GCCCTCCCGG GCTCTGGCCG GGAGGCCACC ACGGCCTTCA CCGACGGCTC CGGCGATGTG CTGCTCTCCT ACGAGAACGA GGCGATCCTC GCCAAGCAGA GCGGCGCGGA CGTCGACTAC GTGCTGCCGC CGGACACGCT GCTGATCGAG AACCCGGCCG CCGTCACCGT CGACGCGGAC GAGACCGCCC AGGCGTTCCT CGAGTTCATG ACCTCGCCGG AGGCGCAGGC CGACTACGCC CAGTCGGGCT TCCGCCCGGT CGTCGACGGC GTCGACATCG GTGCGGTCGA GGGCGCCAAC GATCCGTCGG ACCCGTTCCC CGCACCCGAC CGGCTGTTCA CGATCGACGG CGACTTCGGG GGTTGGGGTG AGGCAGCCGA CAAGTACTTC GGTGACGGCG AGGAGGGGCA CCCCCTCGGC ATCATCACCG AGCTGCAGCA GCAGACCGGC AAGGTGGGCG AGGAGTAA
|
Protein sequence | MKTSIKVAAA AFAGVLALTG CSASADGDAD ETVNVNAFSV MEAANEPVFA DFAATEEGKG VRFETSYGAS GDQSRAVVAG AAADVVHYSL ETDVTRLVDE GLVAEDWKDD ATNGIATSSV VVFVVRKGNP ENIQTWDDLV KPGVEIITPN PGSSGSARWN ILAAWAHVTG NGGSAEDAKS FLTRLLDNTI ALPGSGREAT TAFTDGSGDV LLSYENEAIL AKQSGADVDY VLPPDTLLIE NPAAVTVDAD ETAQAFLEFM TSPEAQADYA QSGFRPVVDG VDIGAVEGAN DPSDPFPAPD RLFTIDGDFG GWGEAADKYF GDGEEGHPLG IITELQQQTG KVGEE
|
| |