Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tgr7_1040 |
Symbol | |
ID | 7316617 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thioalkalivibrio sp. HL-EbGR7 |
Kingdom | Bacteria |
Replicon accession | NC_011901 |
Strand | - |
Start bp | 1129571 |
End bp | 1130728 |
Gene Length | 1158 bp |
Protein Length | 385 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 643615926 |
Product | type III restriction enzyme, res subunit |
Protein accession | YP_002513114 |
Protein GI | 220934215 |
COG category | [K] Transcription [L] Replication, recombination and repair |
COG ID | [COG1061] DNA or RNA helicases of superfamily II |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.343931 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCAAAAT TGACGCCGCG AGGCTGGCAA GTCGATGCGC TCGCTAATTG GGAACAAGCT AACCATCGCG GCATCGTAAG TGTCGTTACG GGCGGCGGTA AGACGGTGTT TGCGCTGTCA TGCGTCGACC GCATTCGGCC GGTCGCGACA CTCATCGTCG TCCCGACCTC GGCGTTGCTT GAGCAATGGT GGGAAGAGGC AGCAAGCTAC TTCGATCTTG ATCTTGATGA AATAAACATC ATCACTGGGA GCCTGCGCTT CCGCCTTGGG GCGATTAACA TCGCAGTGCT CAACACGGCC GCCAAGTTGC CTGAACGCAT GCAGGAGCAG CATAAGTGCT TCCTAATAGT CGATGAGTGT CACAAGGCTG CATCTGAACA GTTTCGTTCG GCACTTCAAG TTCCAACCTT CGCGTCTTTG GGCCTTTCGG CAACTCCTGA ACGGCAATAT GACGACGGCC TGAAGGATGT GCTCGTTCCA GCACTCGGCG AGATCATCTA CAGCTATGAC TACACAGATG CACTGCGTGA CGGTGTAATC GTGCCCTTCG AACTGAAGAA CATCGTCTTC GAGTTGGAAG CAGGCAGACA AGCCGAATAT GACAAGCTCA GCCGAGCCAT AGCGCGCTCC ATCAGCCAGC ATGGCACCGA AGCTGAGGAA ACGATTGCAC TGTTCCTGAA GCGAGCCCGC GTCTTGAATC TCAGCCTGAA CCGAATTCGC TTGGCGCTTA AGTTGGTGGC TGCAAATCGC GGCAAGCGTA CCCTGATCTT TCATGAAGAT ATCGAAGCCT GCGATCTGAT CCACGGCGTG CTATCCGAAA ACGGCGTCAA GAGTGGCGTG TATCACTCTA AGCTGCCGCT CCGCGCGAAA GCCGCGATGT TGGGACAGTA CCGACGCGGT GAGATCGATG TGTTGGTAAC GTGTCGTGCG CTCGACGAAG GCTTCAACGT GCCCGAAACT GAAATCGGTA TCATTGCTGC TAGCACGGCG ACGCGGCGCC AGCGTATCCA ACGACTCGGT CGGGTCGTGC GACCCGCGAA GGGAAAAAAC CGTGCTTCAA TCTATACGCT CGTTGCGACC GGTCTAGAAA TTCAGCGCCT CAAAGAGGAA GAGGAGCGCC TTGAGGGCGT AGCAACCGTG ACGTGGAGCC GGGCATGA
|
Protein sequence | MPKLTPRGWQ VDALANWEQA NHRGIVSVVT GGGKTVFALS CVDRIRPVAT LIVVPTSALL EQWWEEAASY FDLDLDEINI ITGSLRFRLG AINIAVLNTA AKLPERMQEQ HKCFLIVDEC HKAASEQFRS ALQVPTFASL GLSATPERQY DDGLKDVLVP ALGEIIYSYD YTDALRDGVI VPFELKNIVF ELEAGRQAEY DKLSRAIARS ISQHGTEAEE TIALFLKRAR VLNLSLNRIR LALKLVAANR GKRTLIFHED IEACDLIHGV LSENGVKSGV YHSKLPLRAK AAMLGQYRRG EIDVLVTCRA LDEGFNVPET EIGIIAASTA TRRQRIQRLG RVVRPAKGKN RASIYTLVAT GLEIQRLKEE EERLEGVATV TWSRA
|
| |