Gene Apar_0433 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagApar_0433 
Symbol 
ID8413282 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAtopobium parvulum DSM 20469 
KingdomBacteria 
Replicon accessionNC_013203 
Strand
Start bp500625 
End bp502040 
Gene Length1416 bp 
Protein Length471 aa 
Translation table11 
GC content43% 
IMG OID645022001 
Productpeptidase S1 and S6 chymotrypsin/Hap 
Protein accessionYP_003179455 
Protein GI257784238 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000000434978 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAGAACA AAACATATAG TTTAACCACA GATTCCGTTG TTTATAAGCG TTTCTCGCCA 
GTTAAGCGTG ATGGGTTTAT GTCTAAGCTT CGTAGCTCAA GCATAAAAAG AACAGCTTGT
GCAATTGCCT TTGCTGTTGT ATGCGTTCCT ACAATTGCTT TGGGAGAGGG AGTTGCTAAA
CTCCCTGCAG TGTCTGGTCA GGCAACTATT TCTCAAGCAG CGGCTCGTCC TGCTGCAGAA
CCAACTGACA CCAAGACGGC AGAAACGGTA GCTTCTAAGG TTTTGCCTTC GGTCGTTTCT
GTTAAAGCTA CTAGTGATTC ATCTGGTTCA ACCGGTTCTG GAGTTGTGTT GAACACAAAT
GGAGATATTC TTACTAATTA CCACGTTATC GAGGGTATGG ATACCTTCTC TATTTCAATT
AATGATAAAG AATATGACTG CACCGTTGTA GGTACAGATC CTACCTCTGA CCTTGCTGTC
CTTCATGCTG ACCTCAAGGG AGATAGTGTA ACCCCAATTG AGATTGGCAA TTCCGATAAC
CTGGCCCCTG GTAGTTGGGT TATGAGTGTT GGTAGTCCGT TTGGCCTTGA TCATTCTGTT
TCTGCAGGTA TTGTATCTGC GCTTTCTCGT GGCGATATGC TTGAAACAGA AGGTGGAGAA
ACCACTATTT ATGCCAACCT CATCCAGGTT GACGCTGCTA TTAACCCAGG TAATTCTGGC
GGTGCTTTGG TTGACTCTAA TGGTCAGCTT GTAGGTATTT GTACGCTGTA CTCGTCTGAC
ACTAAGTCGT TTGCTGGTAT TGGTTTTGCA ATTCCTATTA ACTATGCCAT TGATATTGCA
AATCAGATTC TTTCTGGTCA GCCGGTAAAG CATGCTTATA TTGGTCTTTC TATGCAGACT
GTTACTCCAC GAGCTGCAAA GCGCAATAAT CTTTCAGTAG ATTATGGTGC CTATGTGGCA
GGTCTTCTCG ATGATAGTCC TGCAGGAAAC GCCGGTATTA AAAAGGGAGA TGTCATTATC
TCTATTGGTG GAGAGCGCGT AATTTCTGCT GATGCAGCTA TCATTGCCGT TCGTTCTCAT
AAGGTTGGAG AGACAGTTCC TGTAGAAATC ATGCGTGGAG AAGACCGTCT TACCATCAAT
GTCACTCTTG GTTCTGATGA GACGCTTTCT TCTATGAAAG ATAAAGAAAA GAATAACAAG
AACAATACAA ACGACACTGA TGATAAACAG GATCGTACTG AGGATGATGA TGATTCCTAT
AGACATTATC GTCAGCAAAA TTGGTGGCAG GAATTCTGGG ATTACTTCAC TAATCCTTAT
GGAGACAGCG ATACTGATTC CGATAATCCA TTTGAGAATT TTGTAGAGAG TATTACTAAT
AGCATTGCAG AAGCAATTTA CGGAATTTTT GGTTAA
 
Protein sequence
MENKTYSLTT DSVVYKRFSP VKRDGFMSKL RSSSIKRTAC AIAFAVVCVP TIALGEGVAK 
LPAVSGQATI SQAAARPAAE PTDTKTAETV ASKVLPSVVS VKATSDSSGS TGSGVVLNTN
GDILTNYHVI EGMDTFSISI NDKEYDCTVV GTDPTSDLAV LHADLKGDSV TPIEIGNSDN
LAPGSWVMSV GSPFGLDHSV SAGIVSALSR GDMLETEGGE TTIYANLIQV DAAINPGNSG
GALVDSNGQL VGICTLYSSD TKSFAGIGFA IPINYAIDIA NQILSGQPVK HAYIGLSMQT
VTPRAAKRNN LSVDYGAYVA GLLDDSPAGN AGIKKGDVII SIGGERVISA DAAIIAVRSH
KVGETVPVEI MRGEDRLTIN VTLGSDETLS SMKDKEKNNK NNTNDTDDKQ DRTEDDDDSY
RHYRQQNWWQ EFWDYFTNPY GDSDTDSDNP FENFVESITN SIAEAIYGIF G