Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tgr7_1442 |
Symbol | |
ID | 7315151 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thioalkalivibrio sp. HL-EbGR7 |
Kingdom | Bacteria |
Replicon accession | NC_011901 |
Strand | + |
Start bp | 1538888 |
End bp | 1541191 |
Gene Length | 2304 bp |
Protein Length | 767 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 643616332 |
Product | sulphate transporter |
Protein accession | YP_002513514 |
Protein GI | 220934615 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG0659] Sulfate permease and related transporters (MFS superfamily) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.928724 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGACGAAA AACCCAGTAA CGGCGTCAAG GGCCTCAGGC ACTGGCGTTA TGACCTGAGT GCCGGCCTGC AGGTGGCGCT GGTGTCCCTG CCGCTGTCCC TCGGGATTGC CATCGCCTCC GGTGCGCCGC CGGTGACCGG GGTGATCTCG GCCATCATCG CAGGCCTGAT CTTTCCCTTC CTGGGCGGTG CCTACGTCAC CATCAGCGGC CCGGCCGCGG GGCTCGCGCC GGCACTGCTC TCGGGCATGC TGCTGCTGGG CGGCGGCGAT CTTGCCGCCG GCTACCCGCT GTTGCTGGTG GCCATCTGCC TTACCGGCCT GCTGCAGATC CTGCTGGCCT TCATGAACGC GGGGCGTTTT GCCATCTTCC TCCCGGTGAC CGTGGTGGAG GCCATGCTCG CCGCCATCGG CATCATGATC ATCCTCAAGC AGATCCCGCC CTTGTTCGGT GATCTGTCCC CGGTGGCACC CACCACCGTG GATTCCATCC TCAAGCTGCC TCACACGCTG CTCAACATCG AGCCGACGGT GTTCCTGATC GGCGCCGTGT GCCTGTTCCT GATGTTCTAT CTGAACGCGA CGCGCCGGCA GTGGATGAAA CGCATCCCGC CGCCCCTGTT CGTGGCCCTG CTGGGACTGG TGCTGGCCAC CGTGTTCGGC CTGGAGACGG TGTATCTGAT CACCATGCCG GAGAGCCTGC TGCAAGGCGG CATCACCGTG CCCGCCTTCG GCGAGGTGAT GAACCGGCCG GAACTCTGGA TCAACCTGCT CATCGTGGTC ATCACCCTGA CGCTCATTGA CGGCATCGAG TCCCTGGCCA CCATCGCGGC CGTGGACAAG ATCGATCCCT ACCAGCGCAA GTCCAACCCC AACATCACCC TGCGCGCCAT GGGCGTCTCC AACATGCTGT CGAGCCTTGC GGGCGGTCTG ACCATCATCC CGGGCGGCGT GAAGAGCCGT GTGAACATTG ATGCCGGCGG CCGCACCCTG TGGGCCAACT TCTACAACGC CATCTTCCTC ATCCTGTTCC TGCTGGTCGC CACGGATCTC ATCGCCATGA TTCCCCTGGC CGCCATCGCC GCGATCCTGA TCTACGTGGG CTGGCGCTTG TGCGAGTACA AGGTGTTCCG CAAGACCTAC GCCATCGGCA GGGACCAGAT GGTGATCTTC CTGATCACCG TAGGTGCGAT TCTGGCCACG GATCTGCTGT GGGGCATCCT CATCGGCATG GCTGCGGAAG TGCTGATGCT CGCCTATCTG CTGACGCCCT CGTTCCGGGT GGTGCTCACC GGGAGACTGA GCTTCTCCCA GTCTCTGGAG TTGCTGGCAC GTAACTTCAT GGGTCTGTTC AGAAATCCCA TCATCAAGGT GCACGGTGAC ACCCGTGACG GACGGCCCCA CTACGATGTC TCCGTGGGTT CCCTGGTGTG TTTCAACCTG CTGCCCTTTG ACAAACTCCT GCAGCAGCTG CCCGAGGATG CCGGTCTCTC CATCATCGTC ACGGAGTCCG GACGCATCAT CGACCACACG GCCATGGAGT ACCTGCACCA GTTGCAGGAG GAGTCCCTGC GCGCACAGCG CCCCTTCGAG CTGCTGGGTA TGGAGAAGTA CTACCAGTTC ACCCAGCACT CCCTGTCGGC GCGCATGCAC GATGCCGCGC TGGTCAAGCG GGAGGCGCGG CTGTCCGCCC GCGCGGAGGA GATGCACCAG TTCGCGCGTC TGCATGGTCT GCAGTTCGAT CCCGCCACCC TGGCGGAACT CAACGCTCAC GATTTCGTCT ACCTGCGCCG CGGCGATCAC AAGCAGGAAC GCAACGTGAT GACCGGCCAC TATCGTGACT GGATGCTGCG CGCTTTCGAT TACAGTCATA CCGCCGCGCC GGATTACTAC GCGGAACACC AGCACACCCT GGTGATCGTC AAGCCCGCAG AGCCGGTCTC GGGTCTGCCC GACATGGTGC TGGCACCGGG GCACTACCTG AAGAGCTACC TGGTGAATTA CCAGGAACTG GAATTGCCCG AGTCCCTGGG TTTCCCCGAG GGCTATCGCC TGTATGCACC GAGTACCCAC GATGGCGTGA TCGCGGCAGC GGGGCGTTTG CGTGGTTTCC TGGCACGCCA CCCCGGTGTG TATCTCGAAG TGCGCAAGAA CGCCGTGCTC ACCTTCCGGC CCGATCGGGA ACTGGAGATG CCGGAGGGGA TCGAGGAACT GCTGGAAATC ATCGAGTACT GTTTGGAGGG TTCTCCATCC CCCGCTGATG ACAACGTATC CGGCGCGGAC CCTGCGGAGG AGGGGGCATC CTGA
|
Protein sequence | MDEKPSNGVK GLRHWRYDLS AGLQVALVSL PLSLGIAIAS GAPPVTGVIS AIIAGLIFPF LGGAYVTISG PAAGLAPALL SGMLLLGGGD LAAGYPLLLV AICLTGLLQI LLAFMNAGRF AIFLPVTVVE AMLAAIGIMI ILKQIPPLFG DLSPVAPTTV DSILKLPHTL LNIEPTVFLI GAVCLFLMFY LNATRRQWMK RIPPPLFVAL LGLVLATVFG LETVYLITMP ESLLQGGITV PAFGEVMNRP ELWINLLIVV ITLTLIDGIE SLATIAAVDK IDPYQRKSNP NITLRAMGVS NMLSSLAGGL TIIPGGVKSR VNIDAGGRTL WANFYNAIFL ILFLLVATDL IAMIPLAAIA AILIYVGWRL CEYKVFRKTY AIGRDQMVIF LITVGAILAT DLLWGILIGM AAEVLMLAYL LTPSFRVVLT GRLSFSQSLE LLARNFMGLF RNPIIKVHGD TRDGRPHYDV SVGSLVCFNL LPFDKLLQQL PEDAGLSIIV TESGRIIDHT AMEYLHQLQE ESLRAQRPFE LLGMEKYYQF TQHSLSARMH DAALVKREAR LSARAEEMHQ FARLHGLQFD PATLAELNAH DFVYLRRGDH KQERNVMTGH YRDWMLRAFD YSHTAAPDYY AEHQHTLVIV KPAEPVSGLP DMVLAPGHYL KSYLVNYQEL ELPESLGFPE GYRLYAPSTH DGVIAAAGRL RGFLARHPGV YLEVRKNAVL TFRPDRELEM PEGIEELLEI IEYCLEGSPS PADDNVSGAD PAEEGAS
|
| |