Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tgr7_0981 |
Symbol | |
ID | 7315012 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thioalkalivibrio sp. HL-EbGR7 |
Kingdom | Bacteria |
Replicon accession | NC_011901 |
Strand | + |
Start bp | 1060539 |
End bp | 1062452 |
Gene Length | 1914 bp |
Protein Length | 637 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 643615866 |
Product | ATP-dependent metalloprotease FtsH |
Protein accession | YP_002513056 |
Protein GI | 220934157 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0465] ATP-dependent Zn proteases |
TIGRFAM ID | [TIGR01241] ATP-dependent metalloprotease FtsH |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.38316 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGAACGACT TGGTCAAAAA CCTGATTATC TGGGCGGTCA TTGCCGTGGT GCTGATGTCC GTGTTCAACA ACTTCAGCCC GCGCACGTCG ACGCCGCAGT CCCTGTCCTA TTCCCAGTTC ATCAGCGAAG TGAAGAGCGG GCGCATCAAG AGCGTCTACA TCGAGAACAA CACCATCGAG GGGCGCACCA TCAACGGTGA GCGCTTCACC ACCTACAGCC CCAACGATCC GGGCCTGATC GGTGACCTGC TGAACAACAA CGTGGAGATC CTGGCCCAGG AGCCCCAGCG TCGTTCCCTG CTGATGGACA TCCTGATCAG CTGGTTCCCG ATGCTGCTTC TGATCGGTGT GTGGATCTAC TTCATGCGTC AGATGCAGGG CGGGGCCGGT GGCCGGGGGG CCATGTCCTT CGGCAAGAGC AAGGCCAAGA TGATGAGCGA GGACCAGGTC AAGGTCACCT TCGCCGATGT GGCCGGCTGC GACGAGGCCA AGGAAGAGGT GGCCGAGCTG GTGGAATTCC TGCGCGATCC CAGCAAGTTC CAGAAACTTG GCGGCAAGAT TCCCCGTGGT GTGCTCATGG TCGGTTCCCC GGGTACCGGT AAGACCCTGC TGGCCAAGGC CATCGCCGGC GAGGCCAAGG TGCCCTTCTT CAGCATCTCC GGCTCCGACT TCGTGGAGAT GTTCGTGGGC GTGGGCGCCA GCCGCGTGCG CGACATGTTC GACCAGGGCA AGAAGCACGC CCCCTGCATC ATCTTCATCG ACGAGATCGA CGCCGTGGGC CGCCACCGTG GCGCAGGTCT GGGCGGCGGT CACGACGAGC GCGAGCAGAC CCTGAACCAG CTGCTGGTGG AGATGGATGG CTTCGAGGGC ACCGAGGGCG TGATCGTGAT CGCCGCCACC AACCGTCCCG ACGTACTCGA CCCGGCGCTG CTGCGTCCCG GCCGCTTCGA CCGCCAGGTG GTGGTGCCCC TGCCGGACGT GCGCGGCCGC GAGCAGATCC TCAAGGTGCA CATGCGCAAG GTGCCCCTGG CCGAGAACGT GCGCCCGGAC CTCATCGCCC GAGGCACCCC CGGTTTCTCG GGTGCGGACC TGGCCAACCT GGTCAACGAG GCGGCCCTGT TCGCCGCCCG GGGCAACAAG CGCCTGGTGG ACATGCACGA TTTCGAGCGC GCCAAGGACA AGATCATGAT GGGCGCCGAG CGCAAGTCCA TGGTCATGAA CGACGCGGAG AAGAAGCTCA CCGCCTACCA CGAGGCTGGT CACGCCATCG TTGGTCGCCT GGTGCCCGAG CACGACCCGG TCTACAAGGT GAGCATCATT CCCCGCGGCC GCGCCCTGGG TGTGACCATG TTCCTGCCCG AGGAGGACCG CTACAGCCAC AGCAAGACCC GGCTTGAGAG CCAGATCTGC TCCCTGTTCG GCGGGCGTAT CGCCGAAGAG ATCATCTTCG GCTCCGACAA GGTCACCACC GGTGCCTCCA ATGACATCGA GCGGGCCACC GCCATTGCCC GCAACATGGT GACCAAGTGG GGCCTGTCCG ATCGCCTCGG GCCGCTGTCC TACAGCGAGG ACGAGGGCGA GGTGTTCCTG GGCCGCCAGG TGACCCAGCA CAAGCACATG TCCGACGAGA CGGCCCATGC CATCGACGAG GAGATCCGGC GCGTGATCGA TACCAGCTAC GATCGCGCCA AGAAGATCCT GGAGCAGAAC ATGGACAAGC TCCACGTGAT GGCCGAGGCC CTGATGAAGT ACGAGACCAT CGACGTGGAG CAGATCAACG ACATCATGGA GGGCAAGACC CCCCGTCCGC CCAGTGAATG GAGCGATGAC GAGCCCAAGG CGGGTGGCGG CACCCCCGCC GAGGATGAGC ACGCCAAGGG CGGCGTGATC GGCGGGCCAG CCAGTCAGCA TTGA
|
Protein sequence | MNDLVKNLII WAVIAVVLMS VFNNFSPRTS TPQSLSYSQF ISEVKSGRIK SVYIENNTIE GRTINGERFT TYSPNDPGLI GDLLNNNVEI LAQEPQRRSL LMDILISWFP MLLLIGVWIY FMRQMQGGAG GRGAMSFGKS KAKMMSEDQV KVTFADVAGC DEAKEEVAEL VEFLRDPSKF QKLGGKIPRG VLMVGSPGTG KTLLAKAIAG EAKVPFFSIS GSDFVEMFVG VGASRVRDMF DQGKKHAPCI IFIDEIDAVG RHRGAGLGGG HDEREQTLNQ LLVEMDGFEG TEGVIVIAAT NRPDVLDPAL LRPGRFDRQV VVPLPDVRGR EQILKVHMRK VPLAENVRPD LIARGTPGFS GADLANLVNE AALFAARGNK RLVDMHDFER AKDKIMMGAE RKSMVMNDAE KKLTAYHEAG HAIVGRLVPE HDPVYKVSII PRGRALGVTM FLPEEDRYSH SKTRLESQIC SLFGGRIAEE IIFGSDKVTT GASNDIERAT AIARNMVTKW GLSDRLGPLS YSEDEGEVFL GRQVTQHKHM SDETAHAIDE EIRRVIDTSY DRAKKILEQN MDKLHVMAEA LMKYETIDVE QINDIMEGKT PRPPSEWSDD EPKAGGGTPA EDEHAKGGVI GGPASQH
|
| |