Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tgr7_1784 |
Symbol | |
ID | 7317594 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thioalkalivibrio sp. HL-EbGR7 |
Kingdom | Bacteria |
Replicon accession | NC_011901 |
Strand | - |
Start bp | 1899181 |
End bp | 1902537 |
Gene Length | 3357 bp |
Protein Length | 1118 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 643616676 |
Product | hypothetical protein |
Protein accession | YP_002513853 |
Protein GI | 220934954 |
COG category | [R] General function prediction only |
COG ID | [COG2251] Predicted nuclease (RecB family) |
TIGRFAM ID | [TIGR03491] RecB family nuclease, putative, TM0106 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.424401 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCATAAAG TTCAGAATAA TCTGATTTTC TCAGCTTCAG ACCTGAGCTA TTTCCTCGAA TGCCCCCATC GCACTACGCT TGACCGCCTG AGCCTCGACC AGCCTATGGA CAAGGCTGTA GCCGGTGAGG AGACGAGACT CATCCAAGAG AAGGGTGTCG AGCATGAGCG GGCCTACCTG GAATCACTGA AATCCAGTGG TCGGAATGTC GTTGAGATAG ATGACGCGCT CGGTTTGGAT GAGCGGTGTG CAGCCACTCA GGAGGCCATG CGGGGTGGAG CAGATATCAT CTACCAGGCG GTCTTTATGA ATGACCGCTG GTTGGGCTAT GCGGATTTTC TCAAGCGTGT AGAAGGGGCT TCGGAGCTGG GCGATTACAG CTACGAACCG GTGGACACCA AACTTTCCAC ACAACCCAAA ACCAAGCACC TGATCCAGCT ATGTGTCTAT TCTGATCTGC TGCAGGACGC CCAGGGAACA GTGCCAGAGT CTATGCACCT GGCACTTGGC AATGGCGAGA GCCGCAGTTT CCGGGTTGAG GACTATCGGC ATTACTACGC CCGATCTCGG GATCAATTCC TCGGCTTTAT TGAGAACCCG ACAGAAACCC GGCCGGAACC CTGTGATTTC TGCAACTTCT GCCACTGGCA TGAGCGCTGT GAAAAGCAGT GGGAAGTTGA GGATCATCTG AGCCTCGTTG CCAATATCAC TAAAGGCCAG ATCCGCAAGT TGCGGGATGC AGGTATCCAT ACGGTGGCCG ACCTCGCAGG CCACGATCAG TCTGTAGCTA TTTCTGGCAT GAACAAAGAG GTCCTGGAGC GGCTTCGTGA ACAGGCCTCC CTTCAGGTCC AGGGCAGGAC TTCCGGCAAA CCGGTCTACC GTCTTTTAAA GCAGGATCCA GATGGCCGAA AAGGTTTCTT CCGGATGCCT GAACCGTCGG AGGGCGATCT CTTCTTTGAC ATGGAGGGCG ACCCGCTCTA TCCGGAGGGG CTTGAGTACC TCTTCGGTGT CTATTACCTG GAGAACGGGG AGTGGCGATT CACCGCGTTC TGGGCCCATG ATCACGATGC CGAGAAGAAG GCCCTGGAAG ATTTCGTCGA TTTCGTTGTC GAGCGACTGA AGCAGTATCC GGATGCCCAC ATCTACCACT ACAACCACTA TGAGGTGACC GCGATCAAGC GCCTCATGAG TCGCTACGCC ACCCGTGAAC GGGAGGTGGA TGATCTGCTG CGTCGCGAGC GTTTCGTGGA CCTTTTCAAG GTGGTTCGGG AGTCGATCCG TGTCTCAGAG CCATCCTACT CCATCAAAAA TCTTGAGCAT TTCTACATGG ATGCCCGTGA TGCGGACGTC AAAACGGCCG TGGGTAGTAT CGTCTGGTAT GAACACTGGC GCGAGAGCCG GGACGACGAC CTGCTGGAAC AAATCCGCAA GTACAACGAG GATGACTGCC GTTCCACGCT CCTGCTCCGG GACTGGCTCC TTGGGCTCCG TCCTTCCAAT CTACCCTGGT TCAGCGGGGA GGTGCGGGGG GAGTCCGAGG GGGTGGATAG GATCACCGAG CATGAACAAC GTCTGGCCCG ATATGAAAAG GCCCTTCTGG GCAACAAGGC GGAAGATCCC GAGACCCTCC ATCACAACAC GTTGATCTAT CAGTTGCTGG ACTTCCATCG CCGAGAGGCG AAGCCCCAGT GGTGGGCCAT GTTTGCCCGT CAGGATATGG AAACCGCTGA TCTGATCGAG GATTCAGAGT GCCTCGGCGG CCTTGAACTC GTGTCGACAG AAAAAGCCAC CACCAAGTCA ATGGATTGCG TCTATCGGTT CCCGCCCCAG GAGACCAAAC TCAAGGCTGG CGATGTGGTT CACGTGGCTG AATCAAGTGA TCGTCTGGGG ACGATTCGTT CGCTGGATGA CGAAAATGGC ACCGTTACGA TCAGGACAAC CTTCGCCGAG ATCCCCGAGA GTCTTTCGAT CGGGCCAGGT GGTCCTGTCG AAACCCGGGT GCTGAGCGAG GCGTTGTTCC GCTACGCCGA TGCGCATCTT GCGAAAAAGG CCTGTTACCC AGCGCTCGAT GCCTTCCTCA CCCGTAAGCC CCCGCGTCTG AAGGGCAGGG AGGCCCCTGA GCCGCTGGCG CCCCATGGCC ATCTGGCTCA GATTCTCGAT GCCGTGGAAC GGCTGGATGG CAGTCACCTG TACATCCAGG GGCCGCCGGG TGCGGGCAAG ACCTACACCG GCTCCCACCT GATTGTCGCC CTGCTGCGCA AGGGGTTCCG CATCGGCGTG ACCTCGAACA GTCACAAGGC GATCGACAAC CTGCTCGAGG CCGTGGAAAA GGTGGCCCAG AAAGAAGGTG TGGCGTTCAG GGGCGTGAAA AAGACCACCC AGAAGAACGA TCTGGGTTTT GAGGGGCACC AGATCGAGGC CTGTACTTCC AATCCTGACA TCGAGAGCGA TGAGGACTAT CAACTGGTGG CTGGGACGGC ATGGCTGTTT GCCCGGGAGG GCCTGGATCA GCGCTTTGAT TACCTCTTCA TTGATGAGGC AGGGCAGGTG TCACTGGCCA ACCTGGTGGC CATGGGGACC AGTGCGCGGA ACCTGGTATT GCTCGGCGAT CAGATGCAGC TCTCCCAACC GATCCAGGGA TCGCATCCGG GCCACTCGGG CGACTCGGCC CTGGACTATC TGCTGAACGG GCGTTCGGTG ATCGAGCCGG AGCGGGGTGT CTTCCTGGAA ACCACCCGGC GCATGCATCC CGATGTGTGT CAGTTCATCT CGGACGCCAT CTATGCCGGT TGTCTCATGC CCCATGAGGA TAATCACAAG CAGCGCCTGA TGCTCAGCTC CACGGCTCAC CCGGAGCTGG TCCCTAACGG AATCCGCATG GTCGAGGTGA TGCATTCGGA TCGCGGTCAG CAGTGCCCGG AAGAGGCGGA TGTAATTGAG GCCATGTACC AGAGCCTGCT CAAGCAGAGT TACGTAGACA AGCACGGCAA GGAGCACCCG ATGAGGCAGG AGAACATCCT CGTGATCTCG CCCTACAACA TCCAGGTGAA CCTGCTTAAA GGCAGGCTTC CGCCTGGGGC CAGGGTCGGC ACCGTGGACA AGTTCCAGGG GCAGGAGGCC GAGGCTGTGT TGATCTCCAT GGCGACCTCT GACGCCGACA ACATGCCCCG CAACGCGGAT TTCCTTTTCA GCCGAAACCG GCTGAATGTG GCCATCTCCC GTGCCCGCTG TCTGGCTGTG GTGGTGGCAA ATCCTGCCTT GCTGAACGTG CCCTGCAAGA GTGTCACGGA GATGGAGTTG GTCAATACCT TGTGCTGGCT GAAGTCATAC TCCGATCAGC TCCGCGCTCG CCATTGA
|
Protein sequence | MHKVQNNLIF SASDLSYFLE CPHRTTLDRL SLDQPMDKAV AGEETRLIQE KGVEHERAYL ESLKSSGRNV VEIDDALGLD ERCAATQEAM RGGADIIYQA VFMNDRWLGY ADFLKRVEGA SELGDYSYEP VDTKLSTQPK TKHLIQLCVY SDLLQDAQGT VPESMHLALG NGESRSFRVE DYRHYYARSR DQFLGFIENP TETRPEPCDF CNFCHWHERC EKQWEVEDHL SLVANITKGQ IRKLRDAGIH TVADLAGHDQ SVAISGMNKE VLERLREQAS LQVQGRTSGK PVYRLLKQDP DGRKGFFRMP EPSEGDLFFD MEGDPLYPEG LEYLFGVYYL ENGEWRFTAF WAHDHDAEKK ALEDFVDFVV ERLKQYPDAH IYHYNHYEVT AIKRLMSRYA TREREVDDLL RRERFVDLFK VVRESIRVSE PSYSIKNLEH FYMDARDADV KTAVGSIVWY EHWRESRDDD LLEQIRKYNE DDCRSTLLLR DWLLGLRPSN LPWFSGEVRG ESEGVDRITE HEQRLARYEK ALLGNKAEDP ETLHHNTLIY QLLDFHRREA KPQWWAMFAR QDMETADLIE DSECLGGLEL VSTEKATTKS MDCVYRFPPQ ETKLKAGDVV HVAESSDRLG TIRSLDDENG TVTIRTTFAE IPESLSIGPG GPVETRVLSE ALFRYADAHL AKKACYPALD AFLTRKPPRL KGREAPEPLA PHGHLAQILD AVERLDGSHL YIQGPPGAGK TYTGSHLIVA LLRKGFRIGV TSNSHKAIDN LLEAVEKVAQ KEGVAFRGVK KTTQKNDLGF EGHQIEACTS NPDIESDEDY QLVAGTAWLF AREGLDQRFD YLFIDEAGQV SLANLVAMGT SARNLVLLGD QMQLSQPIQG SHPGHSGDSA LDYLLNGRSV IEPERGVFLE TTRRMHPDVC QFISDAIYAG CLMPHEDNHK QRLMLSSTAH PELVPNGIRM VEVMHSDRGQ QCPEEADVIE AMYQSLLKQS YVDKHGKEHP MRQENILVIS PYNIQVNLLK GRLPPGARVG TVDKFQGQEA EAVLISMATS DADNMPRNAD FLFSRNRLNV AISRARCLAV VVANPALLNV PCKSVTEMEL VNTLCWLKSY SDQLRARH
|
| |