Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tgr7_0411 |
Symbol | |
ID | 7317241 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thioalkalivibrio sp. HL-EbGR7 |
Kingdom | Bacteria |
Replicon accession | NC_011901 |
Strand | + |
Start bp | 445076 |
End bp | 446731 |
Gene Length | 1656 bp |
Protein Length | 551 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 643615295 |
Product | Integrase catalytic region |
Protein accession | YP_002512496 |
Protein GI | 220933597 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.00539616 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGCAACG TAATCGCCAC CCCCTATACC CCGCGCCCAT TACCACAACC CGCTCGATCG GACGACGAGT GGGCGCGGCT CCCGGAGTAC AAGCGTGAGC GTGCCGAGAT GCGGCTGGCG TTGATCAAGG TGGAGCTCGA CAAGGTGGTC GCGGGCGTCG TGTCCGCAAA GTCCGCGGCC CAACACCTCC GGGCGATGAT CGACGCCCGC CGGGTGCCGG AGCACACCCT GGCTATCGCC AAGAAGGTGG GGCGGGCGGG CGAACCGCCC AGCTGGCAGA GCATCGAGCG ATGGCTCAAC GGCTACCGTG ATGCTGGCAT GTTAGGCCTC GTGGATCGAT ACAAGGGCCG GCAAAGGCGT GCCCTCGGGT GCGAGGCTCG CATCCTTTAT TACCTCCGGC ACGGCGCGAA GAACGACGCG GGCAACATCA CAAAGTTCCT GCAACAGGAA GGATTCGATG TCACCTATGG ACAGGTCCGG AGGTACATCA AGACGCTCCC CGCAACCGAG ACGCATCACC AGAGCCGCGT CGGGCAGCTC GAATACAACA GCCGGATGCG CGGTTACGTG CGACGCGCCA CAGCCCACCT TCCAGTGGGT GCGTGCTATC AGTCCGACGG CAACCTGATG CCCATACACC TGGCGCATCC GGTGACGGGC AAACCCTGGC GGCCGGAGAT GACCCCCTGC TTGGATGTCG TGAGCCGGTA CTGGGTCGGG TATTACATCG CCGAGTCGGA GTCGGCGATC GGAACGATGC ACGCCCTCGG GGATTCGATC CGGCGTGAGA ATCACGTCCC TCTGGATTTT CAGGCCGACA ACGGCCCCGG CTTCAAGGCC GGACTCGTGC AGCGCTTCCT GGAAAACCTC GGGATCACGC CCCACCACCC GCGCCCACGC AACCCCAAGG ACAACGGCTA TGTGGAGCGG TTTCACGGCA TCCTGAAAAA CGAGTGCCTG AAGCGGCTGC CGGGTTACTG CGGCAAGGAC GCGAATCCGG ACATGGTGAA GGCCTTCTTG CGCGACGTGC GCAACAAGGA ACAGCAGCTG CTCACTCTCG CTCAGTTCTA CGAAATCGTG GAAGAGTTCC GTCGCTGGTA CAACCACGAG AGGGCGCACG GCGAGATCGA CTGCGCGCCG GCCGCCCTCT GGGCGAGGCT GGAGCGCAAC CCCCCGATTG ACCTGGATGC CGCGATGTTC TGGGACCGGG AGGAACGTAC GGTTTCTGAT TGCCGGATCC GCTTCGCCAA CCGCGAATAC AGCGCGCCCG AGCTGATCCA GTGGAATACC AAGAAGGTCT TCGTCGAGTT CAATCTGCGC TCGGATGCGA TGGTGCGCGT GCTCGATCTG TCGGGCCGAT GGATATGCGA TGCGCCCCGC ACCAAGGAAT CCCCATACCG CTTCAGCTCG ATCATGGAGG ACCGCAAGCA GTTCAACCTG CGCCAGCGGC TCAAGACGAT CGAGCGCAAT CGCGAAGAGA TGGAGGCCCG GGCCGGCCTG GCCGTCACCC ACGACCAGGT GCTGGAACGC ATGGCCGAGC TGGAGCAACC GGACGGAGCT GCTCTGGAAA AGAAAACGGG AAGCACGTCG GCCAACGTGC CCCCCGTCGA GTCCCGCTCG GAGATCGAGC TGGACATCCT TAACACCGAC TACTGA
|
Protein sequence | MGNVIATPYT PRPLPQPARS DDEWARLPEY KRERAEMRLA LIKVELDKVV AGVVSAKSAA QHLRAMIDAR RVPEHTLAIA KKVGRAGEPP SWQSIERWLN GYRDAGMLGL VDRYKGRQRR ALGCEARILY YLRHGAKNDA GNITKFLQQE GFDVTYGQVR RYIKTLPATE THHQSRVGQL EYNSRMRGYV RRATAHLPVG ACYQSDGNLM PIHLAHPVTG KPWRPEMTPC LDVVSRYWVG YYIAESESAI GTMHALGDSI RRENHVPLDF QADNGPGFKA GLVQRFLENL GITPHHPRPR NPKDNGYVER FHGILKNECL KRLPGYCGKD ANPDMVKAFL RDVRNKEQQL LTLAQFYEIV EEFRRWYNHE RAHGEIDCAP AALWARLERN PPIDLDAAMF WDREERTVSD CRIRFANREY SAPELIQWNT KKVFVEFNLR SDAMVRVLDL SGRWICDAPR TKESPYRFSS IMEDRKQFNL RQRLKTIERN REEMEARAGL AVTHDQVLER MAELEQPDGA ALEKKTGSTS ANVPPVESRS EIELDILNTD Y
|
| |