Gene Ajs_4033 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAjs_4033 
Symbol 
ID4672417 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAcidovorax sp. JS42 
KingdomBacteria 
Replicon accessionNC_008782 
Strand
Start bp4299554 
End bp4301119 
Gene Length1566 bp 
Protein Length521 aa 
Translation table11 
GC content69% 
IMG OID639841073 
Productpeptidase M48, Ste24p 
Protein accessionYP_988213 
Protein GI121596317 
COG category[R] General function prediction only 
COG ID[COG4783] Putative Zn-dependent protease, contains TPR repeats 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones45 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCAAGGTT CTCTACAAAA TTGGCTTCCA GCGCCCAGCT GGAAAGCGCT GACAGCTTCT 
TTTTTTATAG CATTTGGCTC CCTGCAAGCA CCGGCTGTGG TGGCACAGCC ATCGCTGCCC
ACACTGGGAG ACGGCCTGGA GATGACGACC AGCGCCGAGC GCAAGCTGGG CGACCGCATC
ATCCGCGAGC TGTACCGCGA CCCGGATTAC ATCGATGACG CGGTGCTGAC TGAGTATGTG
CAGAGCCTGT TCCTGCCACT GGTGCAGGCC GCCAAGGCGC GGGGGGAGCT GTCGCCCGAA
CTCGAGGAAC GCTTTGCCTG GGAGATCTTG CTGGGCCGCG ACCGTACGGT GAACGCATTT
GCCCTGCCGG GCGGCTACTT CGGCGTGCAC CTCGGGCTGA TCGGGGTGGT GAGCACGCGC
GACGAACTCG CGTCGGTGCT GGCGCACGAA CTCAGCCATG TCACCCAGCG CCACATTTCA
CGCCTGATCG CGCAGCAGGG GCGGCAGACG CCGCTGATGC TGGGCGCGCT GATCCTCGGC
GCGCTGGCTG CGAGCCGCAG CCCCGATGCG GCCCAGGCCC TGATGGTGGG CGGGCAGGCG
CTGGCCGTGC AGAGCCAGCT GAACTTCTCG CGCGACATGG AGCGCGAGGC CGACCGTGTG
GGCTACGGCC TGATGGCGCC CGCTGGCTTT GCGCCGCAGG GTTTCGTGGG CATGTTCGAC
AAGCTGCAGC AGGCCAACCG CATCAACGAC AACGGCAGTT GGCCCTACCT GCGCAGCCAC
CCCTTGACCA CGCAGCGCAT CGCGGACATG CACAGCCGCA TTCCGCCGGG TGCGGCGCAG
CCCGCGCGGG CTACGCTGGA GCACCTGATG ATGGCTGCGC GCGCGCGCGT GCTGATGCGC
CCCGGTGTCG ACGCTTGGCG ACAGTGGGTT GCCGAGCCGC AGGACGCCGG CTTTGCCGCA
CGGCCCGCAC CGCAGCAGGT GGCGGCCTGG TATGCGGCCA CCCTCAGCGC GGTGCAGTTG
GGCGACATGG CGGCTGCGCG CAAGGCGTTG CGTGCGCTGC AGTCCAGCGC AGGTACCGAT
GCTGCAGCGT TGCGCCAGGC GCGCCTGCTG GGTGCTGAGC TGGAGCTGGC TGCGGGCGAT
GCGCCCGCGG CGCTGACGTA CCTGCAGGGC AGCGTCGTCA CGGAAGGCAG CGGTGTGCCC
AGCGCGCCCG CGCCCGCACG GCCAGAGATG CTGCTGCGCA CCCAGGCCCT GCTGCGCACG
GGCGCGGCAG GCGAGATGGT GGGGCCCTTG CAGACCTGGG TGGCCACGCA TCCTCGGGAC
GCGACCGTCT GGCAGGCGCT GGCGCAGGTA TGGCAACAGC AGGGGCAGCC CCTGCGCGCC
GTGCGCGCCG AGGCCGAGGC ACACGCCGCA CGCTATGACT ACGCGGCCGC GGTGGATCGT
TTCAAGGCGG GGCAGGACCT TGCGCGGCGC AGCAGCGCGG CGGGGGATTA CATCGAAGCG
TCGATCATCG ACACGCGCCT GCGTGCCGTG GAGTCACTTC TTAAGGAACA GGCCGCCGAG
CGCTGA
 
Protein sequence
MQGSLQNWLP APSWKALTAS FFIAFGSLQA PAVVAQPSLP TLGDGLEMTT SAERKLGDRI 
IRELYRDPDY IDDAVLTEYV QSLFLPLVQA AKARGELSPE LEERFAWEIL LGRDRTVNAF
ALPGGYFGVH LGLIGVVSTR DELASVLAHE LSHVTQRHIS RLIAQQGRQT PLMLGALILG
ALAASRSPDA AQALMVGGQA LAVQSQLNFS RDMEREADRV GYGLMAPAGF APQGFVGMFD
KLQQANRIND NGSWPYLRSH PLTTQRIADM HSRIPPGAAQ PARATLEHLM MAARARVLMR
PGVDAWRQWV AEPQDAGFAA RPAPQQVAAW YAATLSAVQL GDMAAARKAL RALQSSAGTD
AAALRQARLL GAELELAAGD APAALTYLQG SVVTEGSGVP SAPAPARPEM LLRTQALLRT
GAAGEMVGPL QTWVATHPRD ATVWQALAQV WQQQGQPLRA VRAEAEAHAA RYDYAAAVDR
FKAGQDLARR SSAAGDYIEA SIIDTRLRAV ESLLKEQAAE R