Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Apar_0433 |
Symbol | |
ID | 8413282 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Atopobium parvulum DSM 20469 |
Kingdom | Bacteria |
Replicon accession | NC_013203 |
Strand | + |
Start bp | 500625 |
End bp | 502040 |
Gene Length | 1416 bp |
Protein Length | 471 aa |
Translation table | 11 |
GC content | 43% |
IMG OID | 645022001 |
Product | peptidase S1 and S6 chymotrypsin/Hap |
Protein accession | YP_003179455 |
Protein GI | 257784238 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.0000000434978 |
Plasmid hitchhiking | No |
Plasmid clonability | unclonable |
| |
Fosmid Coverage information |
Num covering fosmid clones | 29 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAGAACA AAACATATAG TTTAACCACA GATTCCGTTG TTTATAAGCG TTTCTCGCCA GTTAAGCGTG ATGGGTTTAT GTCTAAGCTT CGTAGCTCAA GCATAAAAAG AACAGCTTGT GCAATTGCCT TTGCTGTTGT ATGCGTTCCT ACAATTGCTT TGGGAGAGGG AGTTGCTAAA CTCCCTGCAG TGTCTGGTCA GGCAACTATT TCTCAAGCAG CGGCTCGTCC TGCTGCAGAA CCAACTGACA CCAAGACGGC AGAAACGGTA GCTTCTAAGG TTTTGCCTTC GGTCGTTTCT GTTAAAGCTA CTAGTGATTC ATCTGGTTCA ACCGGTTCTG GAGTTGTGTT GAACACAAAT GGAGATATTC TTACTAATTA CCACGTTATC GAGGGTATGG ATACCTTCTC TATTTCAATT AATGATAAAG AATATGACTG CACCGTTGTA GGTACAGATC CTACCTCTGA CCTTGCTGTC CTTCATGCTG ACCTCAAGGG AGATAGTGTA ACCCCAATTG AGATTGGCAA TTCCGATAAC CTGGCCCCTG GTAGTTGGGT TATGAGTGTT GGTAGTCCGT TTGGCCTTGA TCATTCTGTT TCTGCAGGTA TTGTATCTGC GCTTTCTCGT GGCGATATGC TTGAAACAGA AGGTGGAGAA ACCACTATTT ATGCCAACCT CATCCAGGTT GACGCTGCTA TTAACCCAGG TAATTCTGGC GGTGCTTTGG TTGACTCTAA TGGTCAGCTT GTAGGTATTT GTACGCTGTA CTCGTCTGAC ACTAAGTCGT TTGCTGGTAT TGGTTTTGCA ATTCCTATTA ACTATGCCAT TGATATTGCA AATCAGATTC TTTCTGGTCA GCCGGTAAAG CATGCTTATA TTGGTCTTTC TATGCAGACT GTTACTCCAC GAGCTGCAAA GCGCAATAAT CTTTCAGTAG ATTATGGTGC CTATGTGGCA GGTCTTCTCG ATGATAGTCC TGCAGGAAAC GCCGGTATTA AAAAGGGAGA TGTCATTATC TCTATTGGTG GAGAGCGCGT AATTTCTGCT GATGCAGCTA TCATTGCCGT TCGTTCTCAT AAGGTTGGAG AGACAGTTCC TGTAGAAATC ATGCGTGGAG AAGACCGTCT TACCATCAAT GTCACTCTTG GTTCTGATGA GACGCTTTCT TCTATGAAAG ATAAAGAAAA GAATAACAAG AACAATACAA ACGACACTGA TGATAAACAG GATCGTACTG AGGATGATGA TGATTCCTAT AGACATTATC GTCAGCAAAA TTGGTGGCAG GAATTCTGGG ATTACTTCAC TAATCCTTAT GGAGACAGCG ATACTGATTC CGATAATCCA TTTGAGAATT TTGTAGAGAG TATTACTAAT AGCATTGCAG AAGCAATTTA CGGAATTTTT GGTTAA
|
Protein sequence | MENKTYSLTT DSVVYKRFSP VKRDGFMSKL RSSSIKRTAC AIAFAVVCVP TIALGEGVAK LPAVSGQATI SQAAARPAAE PTDTKTAETV ASKVLPSVVS VKATSDSSGS TGSGVVLNTN GDILTNYHVI EGMDTFSISI NDKEYDCTVV GTDPTSDLAV LHADLKGDSV TPIEIGNSDN LAPGSWVMSV GSPFGLDHSV SAGIVSALSR GDMLETEGGE TTIYANLIQV DAAINPGNSG GALVDSNGQL VGICTLYSSD TKSFAGIGFA IPINYAIDIA NQILSGQPVK HAYIGLSMQT VTPRAAKRNN LSVDYGAYVA GLLDDSPAGN AGIKKGDVII SIGGERVISA DAAIIAVRSH KVGETVPVEI MRGEDRLTIN VTLGSDETLS SMKDKEKNNK NNTNDTDDKQ DRTEDDDDSY RHYRQQNWWQ EFWDYFTNPY GDSDTDSDNP FENFVESITN SIAEAIYGIF G
|
| |