Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tgr7_1626 |
Symbol | |
ID | 7316939 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thioalkalivibrio sp. HL-EbGR7 |
Kingdom | Bacteria |
Replicon accession | NC_011901 |
Strand | - |
Start bp | 1741710 |
End bp | 1744487 |
Gene Length | 2778 bp |
Protein Length | 925 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 643616518 |
Product | hypothetical protein |
Protein accession | YP_002513696 |
Protein GI | 220934797 |
COG category | |
COG ID | |
TIGRFAM ID | |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCCCTTT CCGTCCTGCT GCGCATGCAG GACCAGATGA CGGCACCGGT GAAGCGCGCC CAGGAGAGCC TGGGCGCGTT CAAGCGTAAT GCCGAATCGA TCGGCCGCCA GGTGGGCTTC GAGCGGGTGA CGGAGAATTT CAGCCGGGCC GGCGAGGCGG GCCGCGAGCT GTACGACACC GCGGGGCGCG TGGGTCGCCG CCTGGCGGCG GTCAGCGCCG CCGGCGCGGC GGCGCTGACC GGCATCACGG TGGCGGTGGC CAACAGCGCC GACGAGCTGG CCAAGTTCGC CGACCAGATG GACATCCCGG TGGAGAGCCT GCAGGAGTGG CAGTACGCCG CCGAGCGCAT GGGGGTGAGT CAGTCGACCT TCAACAGCTC GCTTGCGGCC TTCTCGCGGC GCCTGGCCGA TGCCCGTGAC GGCACCGGGG CGCTGCATTC GCGACTGAAG GAGACCAACC CCGAGCTGCT CGAGCAGCTG ACCCATGCGC GCGACACCAA TGAGGCGCTG AGCCTCTACA TGCGCGCCAT GGAGGGCGCG GCAGACCAGA CCGAGCGTAA CGGCATGGCG GCGGCGGCGT TCAGCCGGGC CGGCATGGAC ATGTCGCGGA TGGTGCGCGA CGGCGCCGAC GAGATCGACC GGCTGCGCGA GCGGGCGCGG GATCTGGGCT TCATCCTTTC CGACGAGACG GCGCGCCAGG GCGAGCAGTT CGCCGACCAG ATGCTGGACA TGCGCAAGTC GATGCTGGGG CTGCGCAATA TCATCGGCGG TGCGCTGATG CCGGCGTTCG GCGACCTGAT GGACCAGATC ACCGCCGGGG TGCTGTACGT GCAGCCGCTG GTGCGCGAGT GGGCCGAGGG CTTCGCTGCC GACCTGCCGG GGCGGATTGC CTGGCTGCGC GGCGAGTTCG ACGCGCTGAT GGTGACGATC AACCCGGCGC TGGAGCTGGG CGCGGCGCTG GTGGAGCGCT TCGGGGCGCT GCACGTGGCG GCCGGGGCGC TGGCACTGAT GATCGGCGCG CCCTTGATCG TGCCGCTGGT GAAGTTCACC GCGGCGATGG TGTCGCTGGG CTGGGCGCTG GGCCGGGTGG GCTTCGCCCT GGTGGGGCTG GCGGTGAAGG CCCTGCCGGC GGTGCTGGCC GGGCTGAAGA CGCTGGCGCT GGCGGCCATG GCGCACCCGG TGCTGGCGAC GGTGGCGCTG CTGGCCGGCG GTGCGGCGCT GCTGATCGCC AACTGGGACC GGGTGGGCCC GTTCTTCCGC GACACCTGGG CGACGGTGAG CGACGCCACG CGGCGCGGCT GGGAGGCGGT GACCGAGTGG CTGGGCTTCG ATCCGCTGGC GATGCTCGAG CCGCTGTGGC GGCCGATCGG CGACTGGCTG GGCCGCCAGC TGGACGCCTG GAAGCGGCTG CTTTCCGGCG AGTGGGGCGC GGTGAAGGAG ATCTTCCGCT GGAGCCCGCT GGGGCTGCTG GTGAGCGGCT TCGGCCGGGC GCGGGAGTGG CTGGCCGGGC TGACCTGGCG CGATGCCGCC GAGGGCGGCT GGAGCGCGCT GAAGGCGCTG TACCGCTGGT CTCCCCTGGG CGTGATGCAG CGAGGCTTTG GCCGGGCGCT GGAGTACCTG GGTGGCGTGG ACTGGCAGGC GGCGGCCGAG GCTGGCTGGA GCGCGCTGAA GGCGCTGTAC CGCTGGTCTC CCCTGGGCGT GATGCAGCGC GGCTTTGGCC AGGCGCTGGA CTACCTGGGC GGGGTGGACT GGCAGGCGGC GGCCGAGGGT GGCTGGAGCG CGCTGAAGGC GCTGTACCGC TGGTCTCCCC TGGGCGTGAT GCAGCGGGGC TTTGGCGAGG CGCTGGCCTG GCTGCGCGGC GTGGACTGGC AGGCGGCGGC CGAGGCTGGC TGGAGCGCGC TGAAGGCGCT GTACCGCTGG TCTCCCCTGG GCGTGATGCA GCGAGGCTTT GGCCGGGCGC TGGAGTACCT GGGCGGGGTG GACTGGCCGG CGGCGGCCGA GGCTGGCTGG AGCGCGCTGA AGGCGCTGTA CCGCTGGAGC CCGCTGGGGC TGTTGCAGCG GGGCTTTGGC GGGGCGCTGG CCTGGCTGCG CGACCACTGG GACGCGTTGA CCGCCGAGGC GGGCTGGGCC TGGGACGTGC TCAAGGGGAT TCTGGCCTGG TCGCCGCTGG CGACGATCGA GGCGGCCTGG GGCGGCGTGG CCGATTGGTT CGGCGGCATG TGGGACGGCA TCACCGAGCG CGCCGAGCGG GCGATCGGCT GGATCGCCGA CCGCCTGGAG TGGGTGGGCA ACGCCTTTTC GCGGGCGGCC GGGGCGCTGG GCATCAGCCG TAACGACGAG CCGGTGGTGG CGGTGGCCGG CCCGGGGGCG CCGACGCGGA TGCCGCGCCG CGACGCCGAG CCGGGCGGCG ACACGCCCCC CGCCGAGCCC CACCCCCTGG CGCGCTCGCG CGAGCTGCTG GGTAGCGAGC CTGCCCCGGC GGCGCAGGAT GAGGCGCCGG CCTGGGTGCG GGAGTGGCAG CGCCAGCGCG ACGCCGAGGC CGCGCAAGCC CCCGAGGCCG CCGAACCGCT GACCATGGCG CGGCGCGGAG CGCCGGAGCA CGAACGCCGC CAGGGCGGCG GCACCACGGT GCAGCGGGTG GACTTCCGCC CGCAGATCAC GATCGAGGCC GGGGCCAACG CCAACGCCGA GGACCTGGCC AACCTGTTGG ACGAGCGGCT GTCCCGCTAT CGCGACGAGA CGATGTGGGA GCAGGCGCAC GGCGTGGAGG CCGACTGA
|
Protein sequence | MALSVLLRMQ DQMTAPVKRA QESLGAFKRN AESIGRQVGF ERVTENFSRA GEAGRELYDT AGRVGRRLAA VSAAGAAALT GITVAVANSA DELAKFADQM DIPVESLQEW QYAAERMGVS QSTFNSSLAA FSRRLADARD GTGALHSRLK ETNPELLEQL THARDTNEAL SLYMRAMEGA ADQTERNGMA AAAFSRAGMD MSRMVRDGAD EIDRLRERAR DLGFILSDET ARQGEQFADQ MLDMRKSMLG LRNIIGGALM PAFGDLMDQI TAGVLYVQPL VREWAEGFAA DLPGRIAWLR GEFDALMVTI NPALELGAAL VERFGALHVA AGALALMIGA PLIVPLVKFT AAMVSLGWAL GRVGFALVGL AVKALPAVLA GLKTLALAAM AHPVLATVAL LAGGAALLIA NWDRVGPFFR DTWATVSDAT RRGWEAVTEW LGFDPLAMLE PLWRPIGDWL GRQLDAWKRL LSGEWGAVKE IFRWSPLGLL VSGFGRAREW LAGLTWRDAA EGGWSALKAL YRWSPLGVMQ RGFGRALEYL GGVDWQAAAE AGWSALKALY RWSPLGVMQR GFGQALDYLG GVDWQAAAEG GWSALKALYR WSPLGVMQRG FGEALAWLRG VDWQAAAEAG WSALKALYRW SPLGVMQRGF GRALEYLGGV DWPAAAEAGW SALKALYRWS PLGLLQRGFG GALAWLRDHW DALTAEAGWA WDVLKGILAW SPLATIEAAW GGVADWFGGM WDGITERAER AIGWIADRLE WVGNAFSRAA GALGISRNDE PVVAVAGPGA PTRMPRRDAE PGGDTPPAEP HPLARSRELL GSEPAPAAQD EAPAWVREWQ RQRDAEAAQA PEAAEPLTMA RRGAPEHERR QGGGTTVQRV DFRPQITIEA GANANAEDLA NLLDERLSRY RDETMWEQAH GVEAD
|
| |