Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BTH_II1166 |
Symbol | |
ID | 3845076 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia thailandensis E264 |
Kingdom | Bacteria |
Replicon accession | NC_007650 |
Strand | - |
Start bp | 1358405 |
End bp | 1359466 |
Gene Length | 1062 bp |
Protein Length | 353 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 637838469 |
Product | regulatory protein NasS, putative |
Protein accession | YP_439363 |
Protein GI | 83716082 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG0715] ABC-type nitrate/sulfonate/bicarbonate transport systems, periplasmic components |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 0.241228 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGACACAG CGCGCATCAA CCAAACGGCT CCGGACGCGC CGGAACGCGG CCGCCTGCGC ATCGGCTTCG TCGCGCTGAG CGATGCCGCG CCGCTCATCG CGGCGAAGCG GCTCGAACTC GGCGAACGCT ACGGCCTCAC GCTCGAGCTG TGCCGGCAGC CGTCGTGGGC GAGCATCCGC GACAAGCTGC TGTCGGGCGA ACTCGACGCA GCGCATGCGC TGTATGGGCT CGTCTACGGC GTGCAGCTCG GCATCGGCGG GCCGCGCGCC GACCTGGCGG TGCCGATGGT GCTGAACCGC AACGGCCAGG CGATCACGTT CTCGAACCGG CTCGCCGACG CGTATCGCGC GTCGGGCGAC CTGAAGGCCG CGCTCGCGAC GCTCGGCCGG CGCCCCGTGT TCGCGCAGAC CTTCCCGACC GGCACGCACG CGATGTGGCT GTATCACTGG CTCGCGTCGC ACGGCGTCGA TCCGCTGCGC GACGTTCGCA GCGTCGTGAT TCCGCCGCCG GAGATGGTGG GCGCGCTCGC GGCGGGCGAG CTCGACGGGC TGTGCGTCGG CGAGCCGTGG AACGCGGTCG CGCAGGCGCG CGGCGCGGGC CGCACGGTCG CGACGACGAG CGAAGTGTGG CGCGATCATC CGGAGAAGGC GCTCGCGTGC CGGCGCGAAT TCGTCGCGCT GTATCCGAAT GCCGCGCGGC TGCTCGTGCG CACGCTGCTC GACGCTTGCG CATGGCTCGA CGATCCGGCG CATCGGATGC GGGCGGCCCA ATGGCTGGCG GAACCGGACG CGATCGGCGT GCCGCTCGAG CAGATCGCGC CGCGGCTGCT CGGCGACTAC GGCGCCGGGC CGTTCGCGCA GCCGCCCGCA CCGATCCGCT TCTACGCGCA CGGCACGGCG AACCGGCCGG CGGCGAGCGA CGGCCTGTGG TTCCTGTCGC AGTATCGGCG CTGGGGGATG TTGAGCGGCG ACGTCGACGA TGCGGCGATC GCGACCGGCG TCGCGCAGAC GGCGATTTAT GACGAAGCGG TCGCGCTCGC GGGGGCCCGG CGAGCCGATT GA
|
Protein sequence | MDTARINQTA PDAPERGRLR IGFVALSDAA PLIAAKRLEL GERYGLTLEL CRQPSWASIR DKLLSGELDA AHALYGLVYG VQLGIGGPRA DLAVPMVLNR NGQAITFSNR LADAYRASGD LKAALATLGR RPVFAQTFPT GTHAMWLYHW LASHGVDPLR DVRSVVIPPP EMVGALAAGE LDGLCVGEPW NAVAQARGAG RTVATTSEVW RDHPEKALAC RREFVALYPN AARLLVRTLL DACAWLDDPA HRMRAAQWLA EPDAIGVPLE QIAPRLLGDY GAGPFAQPPA PIRFYAHGTA NRPAASDGLW FLSQYRRWGM LSGDVDDAAI ATGVAQTAIY DEAVALAGAR RAD
|
| |