Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tgr7_1792 |
Symbol | |
ID | 7317602 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thioalkalivibrio sp. HL-EbGR7 |
Kingdom | Bacteria |
Replicon accession | NC_011901 |
Strand | + |
Start bp | 1910946 |
End bp | 1912103 |
Gene Length | 1158 bp |
Protein Length | 385 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 643616684 |
Product | hypothetical protein |
Protein accession | YP_002513861 |
Protein GI | 220934962 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1769] Uncharacterized protein predicted to be involved in DNA repair (RAMP superfamily) |
TIGRFAM ID | [TIGR01888] CRISPR-associated protein, Cmr3 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.900228 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCTAACA CCCTGAGCAT CAGCGCGCTC GACACCCTGT TCTTCCGTGA ATCCCGTCCC TTCGATGCGG TCGGTGGCAG CGAGCTGGCG AGTGTGTTCC CGCCGCCACC CCGCGCCCTG GCCGGTGCCG TGAAGGCGCG GATTGCCGAG GCCCTGGGGG TGGACTGGGC GGCTTTTCAC AAGGACCCTG ATGGCTATGA AATCGGAGGA CATTGCCTCA AGGACCTGAT CGGCGACGGC CAGGACCCGG GCGTCCTGCG GCTGCGCGGC CCCTGGCCCT GCCTTGCGGG CGAACGCCTC TACCCGGCGC CGCTCTACCT GCTGGCCGCC GATCAGTTCA AACGCCTGGC CAGGCTGCAG ATCGGCAAGG CTGCCAAAAC CCATCTCGGC ACCGTGCGCC TGCCCACCCT GCCTGGCGGC GAGCGCGGTC TGAAGCCTTT GGACGATCAC TGGCTGACCG CCGACGGTCT GGCACAGGTG TTGTCCGGCG CTTGCCCCGC CCCGGACCAG ATCCACCACT CGGCGGAGCT GTTCAGCGCG GAACCGCGCC TCGGTATCGG CCGTAACAAC CAGACCCGCA CCGCCCAGGA GGGCCTGCTC TACCAGACCC GCCATCTGCG CCCCGTGCCG GAGATGCATC TGGAACTGGA CATCGACGGC CTGCCCGAGG GCCTGCCGGC GTTCGACGGC CTCGTGCGCC TGGGCGGTGA GGGCCGCATG GCCCATCTCG ACATGGGCAA GCCGGCCGCC GCCCTGCCCG CGCCGCCGCG ACCTGATGCC GACACGGTGG GCCTGATCCT CACCCTGCTC ACCCCGGCCC GGGTCGACCC CGGCACCTGG TTGCCGCCGG GCTTTGACAA GGTCGAAACG AAAGACGCGC TCTGCTTCCG GGGCGAACTG AGCGGCATCA GACTGAGCAT CCATGCCGCC GTGATCGGCA AGCCCCTGCG AGAAGGCGGC TGGGACATGG CGCGGCGCGA ACCGCGCCCC GTCGCCAGCC TGCTGCCCGC CGGCAGCGCC TGGTACTGCC GGGTGGAAAG CGGCACGGTG GCTGCTGCCA TCGACGCACT GCACGGCACC CAGGTGGGCG GCGACACCGA ACTGGGGCGC GGCCTGCTGG CCTGCGCCCT GTGGCAAAAA TCCGAAGATC AGAATTGA
|
Protein sequence | MANTLSISAL DTLFFRESRP FDAVGGSELA SVFPPPPRAL AGAVKARIAE ALGVDWAAFH KDPDGYEIGG HCLKDLIGDG QDPGVLRLRG PWPCLAGERL YPAPLYLLAA DQFKRLARLQ IGKAAKTHLG TVRLPTLPGG ERGLKPLDDH WLTADGLAQV LSGACPAPDQ IHHSAELFSA EPRLGIGRNN QTRTAQEGLL YQTRHLRPVP EMHLELDIDG LPEGLPAFDG LVRLGGEGRM AHLDMGKPAA ALPAPPRPDA DTVGLILTLL TPARVDPGTW LPPGFDKVET KDALCFRGEL SGIRLSIHAA VIGKPLREGG WDMARREPRP VASLLPAGSA WYCRVESGTV AAAIDALHGT QVGGDTELGR GLLACALWQK SEDQN
|
| |