Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tgr7_0137 |
Symbol | |
ID | 7316143 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thioalkalivibrio sp. HL-EbGR7 |
Kingdom | Bacteria |
Replicon accession | NC_011901 |
Strand | - |
Start bp | 147500 |
End bp | 149896 |
Gene Length | 2397 bp |
Protein Length | 798 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 643615025 |
Product | biofilm PGA synthesis protein PgaA precursor |
Protein accession | YP_002512226 |
Protein GI | 220933327 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.764186 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCTGCCAT GGGCCCTGTT GGCTCCCGGG TATGCCAGCA GCCTTGACCA GCAGCGGGAA TCGGCCGTCG AACTCGCCCG TGAGGGACGC ATAGACGAGG CCGCAGATCG ACTGCGGACG CTGGATAATC AATTCCCGAA CCAGCCTCTT CTGCAGGCAG ACCTGATCGT CGTGCTGCGC CTGTCTGGCG ACAATGCCGG GATTGCGGCA CGAACGGCGG GCCTCGATCC CGAATCCGTG CCGGACTATG CCCACATGAG CTGGGCTGCT GCCCTGCGTG ACCAGGGTGA ATTCACGGCG GCGGCGGATG TCCTGCGCGA GTCCAGGAAG CGGGTGGGCA CCTCCGCGCA GATCCTCTTT GCCGTGACCA GCGCCGAGGC TGGCGACATC CCGGCGGCAC GCCAGGCCAT TGCACACATT CACCCCTTGC CCGAATCGGC CCATGAACTG GCCCTGATGG CCTATGCCCT GCGCCAGTCG GGTGCATCGC ACGAGGCACT GGCGCTGGCC AATCGAGCGC GTGCCCGCGA CCCACAATAT GCCCTGGCCT TTCAGGAACA GGCCATGGCC CTGTGGATGC TGGGGGCCAG CAACCGGGCG TTTGCCGCGA TGCAGGAACG TCCGGATCTG TTCGAGGATG ACATTCGCCA TCAAGCCGAT GCAGAGGCAA TCGCAACGGA CATCCGGCAG GCGCTCGACA TCCGCGCCGA GATGGAGGCC GCAGGCCGCC ACCGTGAGCG AAACCGGGCG CTGGATCAGG CCCTGGCGAG GGTAGACGGA TTTCTTTCGC GTGTCCCTGA TACGCACCCT CAACACCTTC GGGTCCGCTT CGACCGGGTT GCATTACTGC GCGAACTCGA GCGAATGCCC GAGGCCATCG ATGCATTCGA GGCATTACCG GAGCCGACAG CAGCCCCGCC CTTTGTGCGC CGGGCAGCAG CGGATGCCTA TCTGGCCGAG CAACAGCCGC GAGCGGCCCT GCCCCTGTAC CAGTCCCTGA TACCGGAGGA TGAGCCGCCG GAGGCCTCGC TGTTGCTGGA TCTCTATTAC ACGCACATCG CACTCGAGGA TTATGCGGCA GCAGCGGATC ACCTGGATGA GGCCTACCGT CACACCCCGG TGTGGCTGAG TGTTGGCCCG GACCGGGACC CGATCCCCAA CTGGGAGCGG GTAGACATTG ATCACCTTCA CGCCTACGAC GCAGCCTATC GGCAGGATCT TGGCCTGGCC TGGGAACGCA GTAGCGAATT GGTGCGTCAG GCGCCTGCGC ACGCGGGGCT GCGCAACACC CAGGCACGCA TCGCCCGCTG GCGAGGCTGG CCGGCCCGCT CGCGTGAGAT CACCGAAATC GCCGAACGCT GGGCGCCCGA TGCGCGGGAT ACACGGCTGA ATCGGGCGGA AAATGCGCGG GACCTGGGTG AGCACGATGC CTGGAAGCAA GAGCTTGCCT CCCTGAGGGC CGACTACCCG CGCAGCCGGG ATGTGCAGCG TCTATCCGCA GAACTGGAGG ACCGTGACCG CCCCTCCATC GAGAGCGAGC TGATTTTTGG CCGAACCTCG GGTGGCGATG GCCTGGTCAG CGGCGATCGC GACCGGACGT GGCGTACGCG GCTGAATTCG CCCTGGTCCG CTCATGCGCA CCGGGTCTTT CTGGAACACC GTGACTCTAC AGCTACCTTC GATGAGACGC AGGCACGTCA TGAACGCATC GGCGCAGGTG TCGAGTGGGC TGCCCGACGC AAGCAGGCCT GGGTACGTCT CGATCGTGAT TTGACGAACG ACACCAACCC CGGTGTCGCG GCCGGGTGGT CGCAGTGGCT CAATGACCAC TGGCGTTTCG GCATCGAAGC CGACAGCGTC TCGATGGAAA CACCCTTGCG TGCAATCGAT GCGGGCCTGG AAGGCTGGGC GGTCTCGGCT TCCGTGGACT GGCGTGCCCA CGAATCGCTG TCAGCCTACA CCAGGCTCGG CCTGCTCTCG ATCGATGACG GCAATCGCCG CACTTCGCTG GGCACTGGGG TGACCCATCG GGTGTTCGCC AACGCCCACC ATGCCACCGA CCTGGGCGCC GATGTCTATT TCCAGAACAA CAGCCAGCCG GGCGGGCCCT ATTTCAACCC GGAGCAGTCC GCCAGCCTGT CCATCCGGGT GGACCACCAG TGGATAACCT GGCGCCACTA TGACCGCTCC TTGACCCAGC ACTTCCATGC CGCAGCAGGG GCCGGCTACC AGTCCGGGTT CGGCAGCGAT CCGGCCATCG CACTGCGCTA CGAGCATCGC TGGAGCCTGG ATCGCCGCTG GGGCTTCGAT TACGGCGTGG GCTGGGGCTC CACCACCTAC GATGGCGACC GGGAAAACCG TCTGTTCGGG CTTGTGCGCC TGCGAGGGAT CTTCTGA
|
Protein sequence | MLPWALLAPG YASSLDQQRE SAVELAREGR IDEAADRLRT LDNQFPNQPL LQADLIVVLR LSGDNAGIAA RTAGLDPESV PDYAHMSWAA ALRDQGEFTA AADVLRESRK RVGTSAQILF AVTSAEAGDI PAARQAIAHI HPLPESAHEL ALMAYALRQS GASHEALALA NRARARDPQY ALAFQEQAMA LWMLGASNRA FAAMQERPDL FEDDIRHQAD AEAIATDIRQ ALDIRAEMEA AGRHRERNRA LDQALARVDG FLSRVPDTHP QHLRVRFDRV ALLRELERMP EAIDAFEALP EPTAAPPFVR RAAADAYLAE QQPRAALPLY QSLIPEDEPP EASLLLDLYY THIALEDYAA AADHLDEAYR HTPVWLSVGP DRDPIPNWER VDIDHLHAYD AAYRQDLGLA WERSSELVRQ APAHAGLRNT QARIARWRGW PARSREITEI AERWAPDARD TRLNRAENAR DLGEHDAWKQ ELASLRADYP RSRDVQRLSA ELEDRDRPSI ESELIFGRTS GGDGLVSGDR DRTWRTRLNS PWSAHAHRVF LEHRDSTATF DETQARHERI GAGVEWAARR KQAWVRLDRD LTNDTNPGVA AGWSQWLNDH WRFGIEADSV SMETPLRAID AGLEGWAVSA SVDWRAHESL SAYTRLGLLS IDDGNRRTSL GTGVTHRVFA NAHHATDLGA DVYFQNNSQP GGPYFNPEQS ASLSIRVDHQ WITWRHYDRS LTQHFHAAAG AGYQSGFGSD PAIALRYEHR WSLDRRWGFD YGVGWGSTTY DGDRENRLFG LVRLRGIF
|
| |