Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tgr7_3181 |
Symbol | |
ID | 7315553 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thioalkalivibrio sp. HL-EbGR7 |
Kingdom | Bacteria |
Replicon accession | NC_011901 |
Strand | + |
Start bp | 3330405 |
End bp | 3333671 |
Gene Length | 3267 bp |
Protein Length | 1088 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 643618080 |
Product | helicase domain protein |
Protein accession | YP_002515237 |
Protein GI | 220936338 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCCAGC CCGAACACCC CGACAACGAG CAGATCGATC TGTTCTCCCC GTCAGCGGAA GGGCCCGCCG ATGCCTGGCC GGAACATGCG CGCTTCCCCT GCAACCGCCC CGGCCATACG GTCGGCGAGG TGGTACTGGA GGATCTGCGC ACATCCCAAA ACCCGCTAAT CGTCAGCGGT TATGCATCGC TGGAGCGCAT CCTGCGCCTG CTGGCGGATC TGAAACCCGA GCAAGGGCCC GTGCGCTTGC TGTTCGGTGT CGAACCCCAC GTGACGGGCC GGGAACGCTT CACCCTGCAA TACACCGACT TTCCCGGTGA GGTCCGGGAC TACTGGCTGT CACGGGGTGT CTCGGTGGAA GCCAGCGCAT CGCTGATCCA CGCCATCGGA CTCATCGAGG CAGGCCGGGT CGAGGCCCGA TACCTGGGCG GCAGCCGCAA GCGGCTGCAC GCCAAGATCT ACTGCGGCGA TGCGGGTGTT ACGCTCGGCT CCAGCAACTT CACGGACTCG GGTCTGCGCC ATCAGCTGGA GGCCAACGTG CGTTACACGG CGGCGGAACC CCGTCGCTTT TACGAGGCCC GTCGCATCGC AGAGAATTTC TGGGACGAGG GCCTGCCATA CGGAGATGAA CTGCTCGCCC TACTGCACCG CCTGCTGGCG GTGGTCGGCT GGCAGGAGGC GCTCGCCCGG GCCTGCGCGG AACTGCTAGA CGGCGAGTGG GCACGCAACT ACCTGGGCTC CCTACGACTT GAGGACGCGG GTCTCTGGCC CTCCCAGATC CAGGGCATCG CCGAGGCCCT GTGGCTCATG GAGAACGTGG GCAGCGTGCT GGTTGCCGAC GCCACGGGTT CCGGTAAGAC CCGCATGGGC GCGCACCTGA TCCGCGCCGC CATGGACCGC ATCTGGGGCA GCGGGCGCAT GCGCAAGGGA CGCCCGCTGA TGGTCTGCCC GCCGGCGGTG CGGCATGCCT GGGAACGGGA GGCGCGGCTC TGTGATCTTC CCCTGGAGAC ACTCTCCCAC GGCGTGTTGT CCCGGGCCGC AGGCAACGCC GGTGAACAGG TGGAGGACGC CCTGCGGCGC GCACAGGTAC TGGCGGTGGA TGAGGCCCAC AACTTTCTCA ACCTGGGGTC GCGGCGCACC CGCATGGTGC TCTCCAACAT GGCGGACCAC ACACTGCTTT TCACCGCCAC CCCCATCAAC CGCAGCGTGA TCGACCTGCT GCGTATCGCC GACACTCTGG GCGCCGACAA TCTGGCCCCG GAGACGCTCA CCATGTTCGA GGGCTTGCTG CGCAGCCGTC GCCTGGACCG CAGCCTGACG CCCGGTGAAC TCTCGGCCCT GCGCCAGGAA CTGGAGCGTT TCACGGTTCG GCGCACGAAG CGGGAACTCA ACGCCCTGGT GGACCGGGAT CCCGAGGCCT ACCGGGACGC TCGCGGCAAA CCCTGCCGCT ACCCCCGGCA CCATTCACAC ATCTACCCCT TGAACGAGCC TGCCGCCGAC CAGGCCCTGG CCGGTGAGAT ACGCGCCCTG GCGGGAAAAC TTCGGGGCCT GCTCTATCTG CGCGGCGCCA TCGAGATGCC GGCGGTACTC AAGCGGGAAC ACTGGGATAC GGCCCGTTAT CTGGACAGCC GCCTGAAATC CGCCCACATG CTCCCCGCCT ACGTGGTGAG TGCGGCGCTG CGTTCCTCCC GCGCTGCCCT GGCGGAACAT CTCAAGGGCA CAGCCTTTGC CCTGGAATAC TTCGGGCTCA CAGGGCAGTT CAACAAACCG CCTACCGGAG ACGTACTGGG CAAGCTCCGG GAGATCCGCG GGCAGATCCC CGAAAACCGT CTGGGCATCC CCCTGCCGGA CTGGCTCTGC GACCCTGAAG CCCATGCCCA GGCCTGCGAC GAGGAAGCCG CACTCTACGA GGCCATGTGG TCGGCACTCA CCCGTCTCAC CGATGGACGC GAACGGGCCA AGGCGGCGTT GCTGACCCGG CTGTTATCCG ACCACCACCT GGTGCTCGCC TTCGACAGCC GCCCCATCAC CCTGGCGGTC ATCCGCAGCC TGCTGACTCA GGAGAGCGAC GCCCGCGTGC TGCTGGCCAC CGGCGACAGC GGCTCCGACC GGAGCGAGAT CGCCGAGGTG CTGCGCCCCG GCTCCGCCCA TCAGCGGGTG GTAGCCCTGT GCTCCGACAG CCTTTCCGAA GGCGTGAACC TGCAACAGGC CAGCACCATC GTGCACCTGG ACATGCCCAG CGTGGTGCGC ATCGCCGAGC AACGGGTAGG TCGCGTGGAT CGCATGGACA GCCCTCACGA GGCCATCGAC GCCTGGTGGC CCGACGATGC CTCCAGCTTC GCCCTGAGTT CCGACGAGCG CTTCATCGAG CGATTCGAAA CGGTGGAGAA CCTGCTGGGC TCCAACCTGC CCCTGCCCGA GGAGATGCTG GGCAAGGGCC ACGGCGCGGT GCGTACCGAG GAATTGATCC GGGAGTTCGA GGAACAGGTC GACGCGGATC CCTGGGACGG CATTCGCGAC GCCTTCCAGC CGGTGCGCGA CCTGTTGGAA GGCCCCAAGG CCCTGGTCTC CGAGACCGTG CGCGACGAAT TCCGTGGGCG GAAGGCCACG GCACAGACCC GCATCGCAGC CGTGCACGCG GCGGAGCCCT TTGCCTTTTT CTGCCTCGCC GGCGACGGAG ACCGTGCGCC CAAGTGGATC CTGCTACGTC GGCGCGCAGG CCGGACCAAA CCCGCACTGG ACACGGATCT GCACCGGGTC GCCGGCCACC TGCGACGCCT GCTGGGACCC GCAACCGAGA CCCTGCCCCT GGACGATGCC GTGCAGACCC AGCTGAGCGA TTTCTTCCAG GACCTGCAAC ACGCCGAACG GGAGACCCTG CCGAGGCGCA AGCGTTGTGC ACTGGACGAG ATGCAAAGAG TGCTGGAGGC ACATGCCCGG CAAAGCCATG ACCCGCGGAT GCGCCGGGTG GTGAACGCCC TGCTGGTGAC CCTGCGCGCC CCGGACCCCG AGCAGATGCC CGATTGGGAC GCCCTGGCGG AGCGCTGGCT GGACCTGATC CGGCCTGTGT GGTACCGGCG ACTGGCCGAG CGCAAACGGG CCCGGCCCCT GCTGCTCAAG GATATCCGCC AGGACCTGCA AGCCGAGGAT GCCCTGGATG CAGCCACCCT GGAGGCCGCG TTCAAAGACC TCCCCCTGGC AGGTCGGCTC GAAGCACGTC TGGCCGCCTG CGTCATCGGC GTCACTCAAG ACGCTAAACC CGGCTGA
|
Protein sequence | MSQPEHPDNE QIDLFSPSAE GPADAWPEHA RFPCNRPGHT VGEVVLEDLR TSQNPLIVSG YASLERILRL LADLKPEQGP VRLLFGVEPH VTGRERFTLQ YTDFPGEVRD YWLSRGVSVE ASASLIHAIG LIEAGRVEAR YLGGSRKRLH AKIYCGDAGV TLGSSNFTDS GLRHQLEANV RYTAAEPRRF YEARRIAENF WDEGLPYGDE LLALLHRLLA VVGWQEALAR ACAELLDGEW ARNYLGSLRL EDAGLWPSQI QGIAEALWLM ENVGSVLVAD ATGSGKTRMG AHLIRAAMDR IWGSGRMRKG RPLMVCPPAV RHAWEREARL CDLPLETLSH GVLSRAAGNA GEQVEDALRR AQVLAVDEAH NFLNLGSRRT RMVLSNMADH TLLFTATPIN RSVIDLLRIA DTLGADNLAP ETLTMFEGLL RSRRLDRSLT PGELSALRQE LERFTVRRTK RELNALVDRD PEAYRDARGK PCRYPRHHSH IYPLNEPAAD QALAGEIRAL AGKLRGLLYL RGAIEMPAVL KREHWDTARY LDSRLKSAHM LPAYVVSAAL RSSRAALAEH LKGTAFALEY FGLTGQFNKP PTGDVLGKLR EIRGQIPENR LGIPLPDWLC DPEAHAQACD EEAALYEAMW SALTRLTDGR ERAKAALLTR LLSDHHLVLA FDSRPITLAV IRSLLTQESD ARVLLATGDS GSDRSEIAEV LRPGSAHQRV VALCSDSLSE GVNLQQASTI VHLDMPSVVR IAEQRVGRVD RMDSPHEAID AWWPDDASSF ALSSDERFIE RFETVENLLG SNLPLPEEML GKGHGAVRTE ELIREFEEQV DADPWDGIRD AFQPVRDLLE GPKALVSETV RDEFRGRKAT AQTRIAAVHA AEPFAFFCLA GDGDRAPKWI LLRRRAGRTK PALDTDLHRV AGHLRRLLGP ATETLPLDDA VQTQLSDFFQ DLQHAERETL PRRKRCALDE MQRVLEAHAR QSHDPRMRRV VNALLVTLRA PDPEQMPDWD ALAERWLDLI RPVWYRRLAE RKRARPLLLK DIRQDLQAED ALDAATLEAA FKDLPLAGRL EARLAACVIG VTQDAKPG
|
| |