Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tgr7_1641 |
Symbol | |
ID | 7316954 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thioalkalivibrio sp. HL-EbGR7 |
Kingdom | Bacteria |
Replicon accession | NC_011901 |
Strand | - |
Start bp | 1752609 |
End bp | 1754180 |
Gene Length | 1572 bp |
Protein Length | 523 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 643616533 |
Product | hypothetical protein |
Protein accession | YP_002513711 |
Protein GI | 220934812 |
COG category | [S] Function unknown |
COG ID | [COG4383] Mu-like prophage protein gp29 |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGTAGCGC TGGTGGATCA ACACGGCCGG CCAATCCGCC GGCAGGAGCT GGCCGGCGAG GTGCAGACCG CCACGGTGGC GTATCTGCGC CGCGAGTTCG CCGACCACCC GAGCCGCGGG CTGACCCCGG CCAAGCTGGC GCGCATCCTG GAGGATGCCG AGCAGGGCGA CCTGGTGGCT CAGGCGCGCC TGGGCGAGGA CATGGAGGAG AAGGACGCCC ATCTCTACGC CGAACTGTCC AAGCGCCGCC GGGCGCTGCT GGGGCTGGAC TGGGACCTGC GCCCGCCGGA GGGCGCCACC GAGCGCGAGC GCCGCGAGAC CTCGCGCATC GAGGCGCTGA TTCGCGAGCT GGACTGGGAG GCGATCGTCT ACGACGCGGC GGCGGCGATC CTGTACGGCT ACAGCTGCCA GGAGATCGAC TGGGAGCGCA GCGCCGGCGA GTGGCGGCCG CGGGCGATCG ACTACCGTCA GCCCGACTGG TTCATGGTGC AGCCCGAGGC GCGCGACACC CTGCGCCTGC GCAGCGCCGG CGGCCAGGGC GAGGCGCTGC GGCCGTGGGG CTGGGTAGTG CACGTGCACA AGGCCAAGAG CGGCTACCTG GCGCGCGGCG CGCTGACCCG GGTGCTGGCC TGGCCCTTCC TGTTCCGCAA CTTCAGCGCG CGGGACATGG CCGAGTTCCT GGAGATCTAC GGGCTGCCGC TGCGCCTGGG GCGCTACCCC AACGGCGCCA ATGAGCAGGA GAAGGCCACG CTGATGAAGG CGGTGGTGAA CATCGGCCAT GCCGCCGCGG GGATCGTGCC CGATGGCATG CAGGTGGAGT TCAAGGAGGC CGCCAAGGGG CAGAGCGACC CCTTCATGGC GATGATCGGC TGGGCCGAGC GCTCGGTGAG CAAGGCGATC CTGGGCGCCA CCCTGACCAG CCAGACCGGC GAGAGCGGCG GCGGCTCCTA CGCCCTGGGC GAGGTGCACA ACGAGGTGCG CCACGACATC CTGCGCAGCG ACGCGCGCCA GATCGCCCGC ACCCTGACCC AGGCGCTGGT GGTGCCCCTA GTGCGTCTCA ACACTCGCCT GACGCGCATG CCGCAGTGGG TGTTCGACAC CGAGCAGCCG GAGGACCTGA AGGCCTACGC CGACGCCCTG CCGAAACTGG CCAAGGTGAT GCGCATCCCG GCGCGCTGGG CCTACGACAA GCTGCGCATT CCCCAGCCGG AGGGCGACGA GCCGGTGCTG GAGGTGGCCG CCCCGGCGCT GCCCGGGCGC GCGGCGCTGC GCGATCGGCG CGCCGCCCCG CCGGCGCGGG CGGCGGCGCG CACCGCTGGC GACGACACCC TGGTGAGCGA GTGGGCCGAT CGGCTGGAGC GCGACCACGC GGCGGCCTTC GACAACCTGC TGGCCCCGCT GCGCGGCCTG CTGGGCGAGG TCGACTCCCT GGCCGAGCTG CGCGAGCGCC TGGCCGACGT GTACGACGAC ATGCCCGAGG AGCAGCTGGC GCAGCTGCTG CATCGCGCCC TGGCCGCCGC CGAGCTGGCC GGGCGCGACG AGGTGGCCGA GCAGGACGGG GGCGAGGCAT GA
|
Protein sequence | MVALVDQHGR PIRRQELAGE VQTATVAYLR REFADHPSRG LTPAKLARIL EDAEQGDLVA QARLGEDMEE KDAHLYAELS KRRRALLGLD WDLRPPEGAT ERERRETSRI EALIRELDWE AIVYDAAAAI LYGYSCQEID WERSAGEWRP RAIDYRQPDW FMVQPEARDT LRLRSAGGQG EALRPWGWVV HVHKAKSGYL ARGALTRVLA WPFLFRNFSA RDMAEFLEIY GLPLRLGRYP NGANEQEKAT LMKAVVNIGH AAAGIVPDGM QVEFKEAAKG QSDPFMAMIG WAERSVSKAI LGATLTSQTG ESGGGSYALG EVHNEVRHDI LRSDARQIAR TLTQALVVPL VRLNTRLTRM PQWVFDTEQP EDLKAYADAL PKLAKVMRIP ARWAYDKLRI PQPEGDEPVL EVAAPALPGR AALRDRRAAP PARAAARTAG DDTLVSEWAD RLERDHAAAF DNLLAPLRGL LGEVDSLAEL RERLADVYDD MPEEQLAQLL HRALAAAELA GRDEVAEQDG GEA
|
| |