Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acry_2959 |
Symbol | |
ID | 5159555 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Acidiphilium cryptum JF-5 |
Kingdom | Bacteria |
Replicon accession | NC_009484 |
Strand | + |
Start bp | 3237413 |
End bp | 3240232 |
Gene Length | 2820 bp |
Protein Length | 939 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 640554889 |
Product | tetratricopeptide TPR_4 |
Protein accession | YP_001236068 |
Protein GI | 148261941 |
COG category | [R] General function prediction only |
COG ID | [COG4783] Putative Zn-dependent protease, contains TPR repeats |
TIGRFAM ID | [TIGR02917] putative PEP-CTERM system TPR-repeat lipoprotein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.83474 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATGAAAC CAGCGTCGAT CATGCGATCC GTCCGGTCGG TGACACTGTC ACTGGCCGTG GCGGGGATGG CGGGCTTCGC CCCCGCCGCC CATGCCGGGC AACCGCGCTC CTACCAGCGG GCGCAGGCGG CCGTGGCGCA CGGCAAGCTG CTCGATGCGC GGCTCGACCT GCTGAACGCC GTCCGCCGGC ATCCGCATGA CGGCGCCGCG CACGCCCTGC TGGGCGAGGT CTCGCTCCAG CTCGGCGATG CCGTCTCGGC CGAGCGCGAG GCGCGCAAGG CCATCGCCGC CGGCTACCAG CCGAACCGCA GCCTGGCCCT GCTGCTGCGC AGCTACGTGG CCCAGGGCAG GTCGATCGCC CTGCTGCACG ATTTCCCGAT CGGCTCGGCC ACCGGGGTCC ACGCCGCCGT CATCGCCGTG GGGCGGGCGC GCGCCAACCT GGTCCTCGCC CGCTTCGACC GCGCCGAGGC CGACCTCGCG CGCGCCGCCG GTTTCGCGCC CGATGCGCCG GGCCTGTACG AGACCCGGAT CGACCTCGCC ATCGCCCGGC ACGACCTCGC CGCGGCGCGG CGCATGATCG CCGCGCGGCT CAAGGTGACG CCGGCCGCGC CGGCCCTGCT GCGCCGCGAG GCGCAACTGG ACATCGGGGA GGGCCAGCCG GCGCGCGCCG CAACCATCCT GCAGCGCGCC AATGCCGCGG ACCCGCTCAA CACCCGCGGC AAACTGCTGC TCATCCAGGC TCACCTGGCC GCCGGCGCGC TGCCCGCGGC GCAGGCCGAG CTCAAGCGCG CGGCTTCCCT GCTGCCCCAT TCGGCCGGGC TCGCGTATTT CCAGGCGACG ATCGACGTCG ACGAACATCA CTGGCATCGC GCCAGCGCGC TGCTCGATCA TCTCGGCGGC ATCGCCACGC AGCTGCCCGG CATCCTGTAT CTTCGCGCGA GGACGGATGC GGCGCTCGGC CAGCCGGCCG CCGCGCTGAC CGCGGCCCAG AAGTTCGCGG CGCAGCAGCC CGGTGACAGC GCCGCGCAGC TGCTGGTGGC CACGCTGGCG GTGCGGAGCG GTCATTACGG CGTCGCCAGC CGCAGCCTCG ACAAGCTGGC GGCGGCCGGG AAGCTGGATG CCCAGGCCCT CGACCTGCGC ACGGCGATCG AGATCCACGA CCACAAGTGG AATCAGGCCG AACAGGACGC GCTTCAGGCC CTCAAGGCGG ATCCGAAGGA TCTCGCGGCG GGGATCAATC TGGCCGAGCT CGCCCAGCAG CGCGGCGACT TCAAGGCGGC CGAGCAGCGA TACCGCGCGG TGCTCGCCGG GGTTCCCGCA TCGGCGAGCG CCGTGCTCAC CTCGGTCGAG ATCCGGCTGG GATCGGCCGC GCTGCTGGCG AACGACGCCC CGACGGTAAC GCAGATGGTC GCCGCGCTGG ACAAGCAGGG CGCCACACCC GCCGCCGCGC AGCTGCAGGC GAAACTGGAC TTCCGCCGCG GCGATCTCGA CGCGGCGCGG GCCGACCTGA CCCGGCTGCT CGCCGCCCGC CCCGGCTCGG TGACGGCGCA GCTCGGCCTC GCGCGGGTCG ACCTGCTGAC GCAGCGGCCG GATGCGGCGC TGAAACGCCT GACCGCCGTG GTGAAGGCGC ATCCCGCGGA TCCGCGCCCC GTGCTCGCGC TGGCCGCCCT GCAACGGACG ATGAACCAGC CCGACGCCGC GCTGGCCACG CTCGATCGCG CGCATGACCG TGCCCCCGAC AATGTCGCGT TCGTGGCGGC GATCCTGCAG CAGCGGCTCG CGGCGAAGCA GTTCACCGAG GCCGGCAACC TGGTCTCGAC CCTTCCGGCC GCGATGCAGC GCCAGCCGCA GATCCTGCTC CTGCGCGCGA AGCTCGATGC CGCGCAGGGG CGCCTCGATC AGGCCGCCCT GAACCTGCAA TCCCTGCTGG CCGCGACGCC GGACGACGTG ACGTCGCGGC TGGCGCTCGC GCATATCGAC ATGGCGCGCA AGCAGCCGCA GGCGGCCGAA GCGGTGATCG AGGCCGGGCT CGCGCGTGAT CCGCGCCAGC TCGCCCTCAT GCAGGCCCGG GTCGGCCTCG CCCTGGCGCG GCACGGCGCC GGCGCGGCCG AGCAGGAGGC GAAGCGACTC GCCGCCGCGC CCGACCACAT GCCCGAGGCC GGCGCGCTGG AAGGCGATCT CGCCCTGTCG CTGCATCATT GGGCGAAGGC GGCGGCCGCG TTCAAGGCCG CCTATGACGC CAATCCGTCG TCGGCGCTGG CGGCGGGCGC GATCCGCGCC GACATCCAGG GCGGCCATCG CGACGCGGCG CTGGCCCTGC TCAAGGCGGC GAGCGCCCGG TTCCCCGACA GCGTGGCGCT GAGCGACATG CGGGGATCGG TCGCGCTGTC CCGGGGCGAC CTGAAGGCGG CCGCGGCGGA GTACGCACAC TCCCTGCGCC TGGCGCCGCA GGACGCGGTG GCGCTCAACA ATCTCGCCTG GATCGAGGCG AAATCCGGCA AGCCGGACGC CGAGGCCTTG GCCGAGCGCG CCTATGTCGG GGCGCCGACG CCGCAAACCG CCGACACACT CGGCTGGATC CTTCTCCACG GCACGCCCGG CAAGGCGCGG CGCGCGCGGG CGGCGGCGCT GCTCGATGCG GCGCACCGGG AAGACCCCGC CGACCCGTCG ATCGCCTATC ACGATGCCGC CGCGCTGGCC CGCACCGGCG AGCGGGGCAA GGCGATCGCG ATCCTCAAGC CGATCGTCAC GCCGAAGGCC TCGTTCGCCG ACCATGCCGC GGCGGCGGCG CTGCTGGCGA AACTGGAGCG GCAGGGCTGA
|
Protein sequence | MMKPASIMRS VRSVTLSLAV AGMAGFAPAA HAGQPRSYQR AQAAVAHGKL LDARLDLLNA VRRHPHDGAA HALLGEVSLQ LGDAVSAERE ARKAIAAGYQ PNRSLALLLR SYVAQGRSIA LLHDFPIGSA TGVHAAVIAV GRARANLVLA RFDRAEADLA RAAGFAPDAP GLYETRIDLA IARHDLAAAR RMIAARLKVT PAAPALLRRE AQLDIGEGQP ARAATILQRA NAADPLNTRG KLLLIQAHLA AGALPAAQAE LKRAASLLPH SAGLAYFQAT IDVDEHHWHR ASALLDHLGG IATQLPGILY LRARTDAALG QPAAALTAAQ KFAAQQPGDS AAQLLVATLA VRSGHYGVAS RSLDKLAAAG KLDAQALDLR TAIEIHDHKW NQAEQDALQA LKADPKDLAA GINLAELAQQ RGDFKAAEQR YRAVLAGVPA SASAVLTSVE IRLGSAALLA NDAPTVTQMV AALDKQGATP AAAQLQAKLD FRRGDLDAAR ADLTRLLAAR PGSVTAQLGL ARVDLLTQRP DAALKRLTAV VKAHPADPRP VLALAALQRT MNQPDAALAT LDRAHDRAPD NVAFVAAILQ QRLAAKQFTE AGNLVSTLPA AMQRQPQILL LRAKLDAAQG RLDQAALNLQ SLLAATPDDV TSRLALAHID MARKQPQAAE AVIEAGLARD PRQLALMQAR VGLALARHGA GAAEQEAKRL AAAPDHMPEA GALEGDLALS LHHWAKAAAA FKAAYDANPS SALAAGAIRA DIQGGHRDAA LALLKAASAR FPDSVALSDM RGSVALSRGD LKAAAAEYAH SLRLAPQDAV ALNNLAWIEA KSGKPDAEAL AERAYVGAPT PQTADTLGWI LLHGTPGKAR RARAAALLDA AHREDPADPS IAYHDAAALA RTGERGKAIA ILKPIVTPKA SFADHAAAAA LLAKLERQG
|
| |