Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tcr_0731 |
Symbol | |
ID | 3762106 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thiomicrospira crunogena XCL-2 |
Kingdom | Bacteria |
Replicon accession | NC_007520 |
Strand | + |
Start bp | 803192 |
End bp | 804595 |
Gene Length | 1404 bp |
Protein Length | 467 aa |
Translation table | 11 |
GC content | 45% |
IMG OID | 637785447 |
Product | peptidase S1C, Do |
Protein accession | YP_391001 |
Protein GI | 78485076 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | [TIGR02037] periplasmic serine protease, Do/DeqQ family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.00000000553018 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAATAC AACTAAAATC TTTGTATGGT GCTTGGATGG CTCTGATGCT GGCCTTGAGT TTTTCCTCAG TACAAGCCTC TGGACCGACG GTTACATTGC CCGACTTTTC GCAGTTGGCC TCTGAAAACA GCCCGGTAGT GGTGAATATC AGTACGCTGA AAAAAATTGA AAGACCGGAT CATCCTCAGT TAAGAGGAAT GCCTGATGAG ATGCTACGCT ATTTTTTTGG AATTCCTGAA GGACAGGATC CAAGAGGCGA GCGTCAAGAA CAGGTGAGCT CACTGGGGTC AGGGTTTATT ATTTCATCGG ATGGTTACAT TATCACCAAT CATCATGTGG TTGCGGATGC GGATGATATT GTCGTGAAAT TGAGTAACCG ACAAGAATTA AAAGCAAAAG TTATTGGCAG CGATGAACGT TCGGATATAG CGGTGATAAA AGTGGATGCT AAAAATCTGC CTGTGGCTAA AATTGGAACG TCGAAAAATC TAAAAGTGGG GCAATGGGTG ATGGCGATTG GTGAGCCATT TGGCTTGGAT TACACCGTCA CGCATGGCAT TATCAGTGCA TTAGGGCGTT CGCTTCCAGA CGATACTTAT GTACCGTTTA TCCAAACAGA TGTTGCGATT AACCCTGGTA ACTCAGGTGG ACCATTGTTA AACACCAATG GAGAAGTCAT CGGGGTTAAT GCCCAGATTT ACAGTAATAG CGGCGGTTCA ATGGGGCTTT CATTTTCGAT TCCGATTGAT ATTGCGATGG ATGTTGCGCA ACAACTTAAA ACCAAAGGCC GTGTTGAGCG CGGGTATCTT GGCGTCGGCG TTCAAGAAGT TTCGGGCGAC TTAGCCAAAT CGTTTGATAT GAAAAGACCG ATGGGCGCGC TGGTCACGTC AACAGAAAAG GATTCGGCCG CCAGTGAAGC TGGGATTCAG CCGGGTGATA TTATTATCGA ATTTGCCGGT CGAACAATTC AAAAGTCATC CGATTTACCA CCAATTGTGG GGAACTCTGC CGTTGGAGAA TCGATCAAGG TTAAAATCTT AAGAAATGGA GATTATAAAA CGTTGACGGT TCGTTTGAAG TCGTTAGATG ATATGAAGTT AGCGGCAGCA GGCGCCGAAG CTGAAAATAC GACTTTGGGT GTGATGATGA AAGAAGTCAG CCCCAAAGTG CTTGACAAGT TGAATCTACC ATTTGGAATT GGCGTTTCTA AAGTCAAGCG AGGCAGTGCG GCAGACCGGG CGGGCATTAT CCCTGGGGAT ATTTTGGTGA CGATTAATTT CAAACCAATT AAGTCCATTA AGGCTTTGAA TGAAATTGTT GCCGCTGCGC CAAAAGGTCG TTCTCTTCCT GTGAGAGTGG TTAGAGGGAA GCGTTCTGTA TTTCTTCCTC TGGTATTAAA TTAA
|
Protein sequence | MKIQLKSLYG AWMALMLALS FSSVQASGPT VTLPDFSQLA SENSPVVVNI STLKKIERPD HPQLRGMPDE MLRYFFGIPE GQDPRGERQE QVSSLGSGFI ISSDGYIITN HHVVADADDI VVKLSNRQEL KAKVIGSDER SDIAVIKVDA KNLPVAKIGT SKNLKVGQWV MAIGEPFGLD YTVTHGIISA LGRSLPDDTY VPFIQTDVAI NPGNSGGPLL NTNGEVIGVN AQIYSNSGGS MGLSFSIPID IAMDVAQQLK TKGRVERGYL GVGVQEVSGD LAKSFDMKRP MGALVTSTEK DSAASEAGIQ PGDIIIEFAG RTIQKSSDLP PIVGNSAVGE SIKVKILRNG DYKTLTVRLK SLDDMKLAAA GAEAENTTLG VMMKEVSPKV LDKLNLPFGI GVSKVKRGSA ADRAGIIPGD ILVTINFKPI KSIKALNEIV AAAPKGRSLP VRVVRGKRSV FLPLVLN
|
| |