Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Teth514_1049 |
Symbol | |
ID | 5876553 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermoanaerobacter sp. X514 |
Kingdom | Bacteria |
Replicon accession | NC_010320 |
Strand | + |
Start bp | 1082556 |
End bp | 1083752 |
Gene Length | 1197 bp |
Protein Length | 398 aa |
Translation table | 11 |
GC content | 34% |
IMG OID | 641541404 |
Product | carboxyl-terminal protease |
Protein accession | YP_001662684 |
Protein GI | 167039699 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0793] Periplasmic protease |
TIGRFAM ID | [TIGR00225] C-terminal peptidase (prc) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.0000950158 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCAAAAA AAAGATTTTA TATTTTGTTA GCAATGCTTT TGATAGTTAC AAATGTCATA ACTTTTGCAC TCGCAAATGT GGTGTCAGTG GCTCTTCCCA ATGGTAAAGT AATTGTTTCT CGCGAGGAGT ACCAGTTGAT AAAAAAATAT AGTAAACTTT TTGAAATTGA GAAAACCCTT GAAAATAGAT ATGTGGATAG AGTAAATTCT TCAATTCTTT TAGAAGGCGC TCTGAAAGGA ATGGCTAATT CTTTAGAGGA CCCTTATACT GTATACATGA ATAAAAAAGA ATTTTCTGAT TTTATGACTC AAACTACAGG TACTTATGGG GGAATAGGGA TAGTTGTAGC AGTTGATAAA GAAGACCATA TTGTGGTGGT TTCTCCAATA AAGAATACAC CGGGGGAAAG AGCAGGAATA AAATCCGGAG ATATAATAGT AGAAGTGAAC AATAAAAAAG TAAGTGGCAA AAATTTAGAT GAAGCAGTAG CTATGATGAG AGGACCTCAA GGAACAGAGG TAACCCTTAC TATAATGAGA GAAGGAAAAA CTTTTACTAA GACAATTACA AGAGAGATAA TAAAATTAGA AACAGTATAT GATGAAATGC TTCCTGATAA GATTGGGTAT ATTAAGATTA CAATGTTTGA CCAAAGCACA GCTGATGACT TTAAGGCAGC TCTTGATAAA TTAAAGTCTC AGGGTATGAA AGGACTTATA CTAGATTTAA GAGATAATCC TGGTGGACTT TTAGAGGAAA CTATAGATAT TTCTAATTTA ATTTTGCCAA AAGGGGTTGT TGTGACGACC AAAGGGAGAG TTGACAATAA AGAATATTAT TCTAAAGGAC CTGGTCTGGG ATTGCCACTT GCTGTGCTTG TAAATAAAGG CAGTGCTAGT GCCTCAGAAA TTTTAGCAGG TGCAATAAAA GATAGGAAAG TAGGAGTTTT AGTTGGATCA AATACTTTTG GAAAAGGACT TGTACAAACT ATTGTTGACT TTGGTGATGG TACAGGATTA AAATATACTA TTGCAAGGTA TTACACGCCA AATGGCACAA ATATTCAAGG CAAAGGAATT GAGCCCAACT ATGTAGTGGA GCTTCCTGAA AGTTACACTC TTCAAGATAC TCCTGACTTA AAAGGAGATA CTCAGCTTAT AAAAGCTTTT GAAATTGTAA AAAGTGAGAT AAAGTAG
|
Protein sequence | MAKKRFYILL AMLLIVTNVI TFALANVVSV ALPNGKVIVS REEYQLIKKY SKLFEIEKTL ENRYVDRVNS SILLEGALKG MANSLEDPYT VYMNKKEFSD FMTQTTGTYG GIGIVVAVDK EDHIVVVSPI KNTPGERAGI KSGDIIVEVN NKKVSGKNLD EAVAMMRGPQ GTEVTLTIMR EGKTFTKTIT REIIKLETVY DEMLPDKIGY IKITMFDQST ADDFKAALDK LKSQGMKGLI LDLRDNPGGL LEETIDISNL ILPKGVVVTT KGRVDNKEYY SKGPGLGLPL AVLVNKGSAS ASEILAGAIK DRKVGVLVGS NTFGKGLVQT IVDFGDGTGL KYTIARYYTP NGTNIQGKGI EPNYVVELPE SYTLQDTPDL KGDTQLIKAF EIVKSEIK
|
| |