Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS1106A_A3088 |
Symbol | |
ID | 4905199 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 1106a |
Kingdom | Bacteria |
Replicon accession | NC_009078 |
Strand | + |
Start bp | 3001222 |
End bp | 3003054 |
Gene Length | 1833 bp |
Protein Length | 610 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 640146191 |
Product | sulfate permease family inorganic anion transporter |
Protein accession | YP_001077117 |
Protein GI | 126456065 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG0659] Sulfate permease and related transporters (MFS superfamily) |
TIGRFAM ID | [TIGR00815] high affinity sulphate transporter 1 |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.447137 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGACCGAAC GCGAACGCGC CGCGCGCGGG CGCAACGACG CGCGGCGCGG CGGCGGCGAT AGCGGCGGCG ACGACGAGGC GTGGCGGCGC TGGCTGCCGG GCATCGCGAC GTTGCGCACG TACCGGCGCG CGTGGCTCGC ACGGGATCTG TACGCGGGCG TCGCGCTGAC CGCGGTGCTC GTGCCGGTCG GCATGAGCTA CGCTCAGGCG GCGGGCCTGC CGGTCATCGC CGGCCTGAAC GCGTCGATCG CCGCGCTCGT CGGCTACGCG ATCTTCGGGC CAAGCAGGAT CCTGGTGCTC GGCCCCGATT CGGCGCTCGC CGCGCTGATC GCCGGCGCGA TCGCGCCGCT CGCGCACGGC GAGCCCGCGC ACGCGGTCGC GCTCGCGGCC GCGCTCGCGT TGATGTCCGG CGGCTTTTGC GTGCTCGCCG GCCTGCTGAA GCTGGGCTTC GTCACCGATC TGCTGTCCAA GCCGATTCAA TACGGCTATC TGAACGGGCT CGCGCTGACG CTGATCGCGA GCCAGCTACC GAGCCTGCTC GGCACCGTGC CGCTCGGCGG CACATTCGTC GACCAGGTCG CGAGCCTCGC CGCCACCGTC GCGCAAGGCC GGATCGATTT CGCGTCGCTC GCGCTCGGCG GCGGCTGTCT CGCCGGCATC GCGCTGCTGC GGCGCGTCGC GCCCGCGTGG CCGGGCATGC TGATCGCGGT CGCCGGCGCG TCGATCGTCG CCGCGTGGCT CGGCGCGGCG CCGGACGCGG GCGGCGCACA CGCGCATGTC GCCTACGCGC ATGTCGCAAA CGCGCATGTC GCTCTCGTCG GCTCGCTCGC CGGCACGCTG CCGCCGCTCG GCCTGCCGTC GATCTCGCTC GCCGACGCGA GCCGGCTCAT CGCCGGCGCG CTCGCGATCG CGATGGTGTC GGTCGCCGAC ATCAGCGTGC TGTCGCATGT GTTCGCGCAG CACGACGGCA GCGAAACGGA CCGCAATCAG GAACTGTGCG CGCTCGGCGC GGCGAACCTG CTCGCCGGCA TGCTGCGCGG CTGCGCCGTC AGCAGCAGCG CGTCGCGCAC GCCCGTCGCG CTCGCGGCCG GCGCGCGCAC GCAGTTGACG AGCCTCGTCG CGGCCGCGTG CATCGCGCTG CTGCTTGTCG CGCCGACGCT GCTCGCCCGC GTGCCGCTCG CGGCGCTCGC GGCCGTCGTC GTCTATTCGG CGAACGCGCT CGTCGACGTG CGTGCGATCG TTCGGCTCTA TCGCGTGCGC CGCGGCGAAT GCGCCGTATC GGTGCTCTGC TTCGCGGGCG TCGTGCTGCT CGGCGTCGTG CCGGGCATCC TGCTCGCCGT CGGGCTGTCG CTGCTGTCGT TCGTCTGGCG CGCGTGGCAC CCGTACGACG CGGTGCTCGG CCGCGTCGAG GGCATGCACG GCTATCACGA CGTGTCGCGC CACCCGGGCG CGGCCCTCAC GCGCGGCCTC GTCGCGTTTC GCTGGGACGC GCCGCTGTTC CATGCGAACG CGACGATCTT TCGCGATCAC GTGCGCGACG CGATCGCCGA GGCCGACGCG CCGGTGCGCT GCGTCGTGAT CGCCGCCGAA CCGATCACCG ATGTCGACGT CACCGCCGCC GACATGCTCG CGACGCTGCG CGACGAGCTC GCCGCGCGGC GGATCGCGCT GGTGTTCGCG GAAATGAAGG GGCCGGTCAA GGACCGGTTG CGCACGTACG GGCTCTTCGA GAAGATCGGC GCCGATCATT TTTTTCCGAC GGTGACGGAC GCGATCGAGC ATTTCACGCG GATGCGCAAG GACGTGGCGA CCGCGCGGCG GGCGCGGCGT TAG
|
Protein sequence | MTERERAARG RNDARRGGGD SGGDDEAWRR WLPGIATLRT YRRAWLARDL YAGVALTAVL VPVGMSYAQA AGLPVIAGLN ASIAALVGYA IFGPSRILVL GPDSALAALI AGAIAPLAHG EPAHAVALAA ALALMSGGFC VLAGLLKLGF VTDLLSKPIQ YGYLNGLALT LIASQLPSLL GTVPLGGTFV DQVASLAATV AQGRIDFASL ALGGGCLAGI ALLRRVAPAW PGMLIAVAGA SIVAAWLGAA PDAGGAHAHV AYAHVANAHV ALVGSLAGTL PPLGLPSISL ADASRLIAGA LAIAMVSVAD ISVLSHVFAQ HDGSETDRNQ ELCALGAANL LAGMLRGCAV SSSASRTPVA LAAGARTQLT SLVAAACIAL LLVAPTLLAR VPLAALAAVV VYSANALVDV RAIVRLYRVR RGECAVSVLC FAGVVLLGVV PGILLAVGLS LLSFVWRAWH PYDAVLGRVE GMHGYHDVSR HPGAALTRGL VAFRWDAPLF HANATIFRDH VRDAIAEADA PVRCVVIAAE PITDVDVTAA DMLATLRDEL AARRIALVFA EMKGPVKDRL RTYGLFEKIG ADHFFPTVTD AIEHFTRMRK DVATARRARR
|
| |