Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tgr7_1490 |
Symbol | |
ID | 7317976 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thioalkalivibrio sp. HL-EbGR7 |
Kingdom | Bacteria |
Replicon accession | NC_011901 |
Strand | - |
Start bp | 1595263 |
End bp | 1596243 |
Gene Length | 981 bp |
Protein Length | 326 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 643616381 |
Product | hypothetical protein |
Protein accession | YP_002513561 |
Protein GI | 220934662 |
COG category | [R] General function prediction only |
COG ID | [COG1721] Uncharacterized conserved protein (some members contain a von Willebrand factor type A (vWA) domain) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.00153565 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGCTTG CCGCCCAGAT CCTTGAGCCG GCCGTCACCT GGCTGATGCG TCGCGAACAT CCCGCCGGCC GCGAAATCAC CCTGCACCGC CGGCGCATCT ACATCCTGCC CACCCGCCAG GGCCTGGTGT TCGGCCTGAT GCTGGTGGTG ATGCTGCTGG GCGCCATCAA TTACAGCAAC AGCATGGCCT TCCTGCTCAC CTTCCTGCTG GCAGGCCTGG GCGCCAATGG TATCTGGCAT ACCCACCGCA ACCTGCTGGG ACTGCGCATC ACGCGCCTGC CCCTGGAGCC GGTGTTCGCC GGCGAAACCG CCTACTGCCG CTACCGCATC GAGAACCCGA GCCACGTCCC CCGTCGCGGC CTGGTGCTGC ATCGCCGGGA TGTCCAGGGT GCCGCCATCG CGGCACAGGC CCGGGCGGAG GCCGTGGCAG AGCTGCCCAT CGTCACCACC CGCCGCGGCA GTCTGCGCCC GGGGCGTTTC AGCCTGTCGA CCACCTATCC CTTGGGCCTG TTCCGGGCCT GGTCCTGGAT CCAGTTCGAC GAGGCCCTGA CGGTCTACCC GCACCCCCTC GCCGTGGACG AGCGCTTCAC CGATGCGGGC AGCAGCGAGG GCGAAGGCCG GCCCGATCCG GGACTGGGCG AGGACTTCTC CGGCCTGCGC GAATACCAGC CCGGCGACTC CCCGCGGCGG GTGGACTGGA AGGCCCTGGC CCGCACCGGC GATCTCTATA CCCGCAACTT CGAATCCCCC CGGGGTGGCG AGCTGTGGAT CGACTGGCAT GCCCTGCCCG CCACGGACAC CGAGACCCGG CTGTCGATGC TCTGTCACCA GGTGCTGCTG GCCCACCAGC AGGGGATCCG TTACGGGCTG CGCCTGTCCG GCCTGGAACT GGAGCCCGAC CAGGGTGCGG ACCACCGGCA CCGCTGCCTC TCGGCCCTGG CCCATTACGG TCAGCCGGCT CAAGGCAGTC CGGTATCATG A
|
Protein sequence | MKLAAQILEP AVTWLMRREH PAGREITLHR RRIYILPTRQ GLVFGLMLVV MLLGAINYSN SMAFLLTFLL AGLGANGIWH THRNLLGLRI TRLPLEPVFA GETAYCRYRI ENPSHVPRRG LVLHRRDVQG AAIAAQARAE AVAELPIVTT RRGSLRPGRF SLSTTYPLGL FRAWSWIQFD EALTVYPHPL AVDERFTDAG SSEGEGRPDP GLGEDFSGLR EYQPGDSPRR VDWKALARTG DLYTRNFESP RGGELWIDWH ALPATDTETR LSMLCHQVLL AHQQGIRYGL RLSGLELEPD QGADHRHRCL SALAHYGQPA QGSPVS
|
| |