Gene YpAngola_A3366 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYpAngola_A3366 
Symbol 
ID5801843 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pestis Angola 
KingdomBacteria 
Replicon accessionNC_010159 
Strand
Start bp3582384 
End bp3583991 
Gene Length1608 bp 
Protein Length535 aa 
Translation table11 
GC content49% 
IMG OID641341187 
Productsulfatase 
Protein accessionYP_001607709 
Protein GI162418258 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3119] Arylsulfatase A and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones33 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones40 
Fosmid unclonability p-value0.897843 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGTTAC CCGCAGGAAA AAGAAGCCTG TTGGCAGGGA TGATCGCTGC CGCTGGTATG 
AGTATGACAC CTGTGACTCT GGCGGCACCG GCAGAAAAAC CCAATGTATT GCTGGTAATC
ATGGATGATC TGGGTACCGG GCAGTTAGAT TTCACCCTCA ATAATCTGGA TAAAAAAGCA
CTAAGCCAGC GCCCAGTTCC CGTACGCTAT CAAGGCGATC TGGACAAGAT GATCGATGCG
GCACAGCGGG CGATGCCGAA TGTGTCTTTG TTGGCCAAAA ACGGGGTCAA AATGACCAAT
GCGTTTGTGG CGCATCCGGT ATGCGGGCCT TCGCGCGCGG GTATTTATAC AGGTCGCCAC
CCAACCAGTT TTGGTACTTA CAGTAATGAT GATGCCATGC AGGGGATCCC ACTGGATATT
AAACTGCTGC CCGCCTTGTT TCAGGAGCAT GGCTATGCAA CCGCAAATAT CGGGAAATGG
CACAACGCAC GCATAGAGAA AAAAGCGTTC GTCGCCGATG AGGTCAAAAG CCGCGATTAT
CACGACAACA TGATCTCCGT CAGCGCCCCC GGATATGCAC CTGAAAAACG GGGTTTTGAC
TATTCCTACA GTTATTACGC CTCAGGCGCG GCATTGTGGC ACTCTCCAGC CATCTGGCAA
AACAGCAAAA ATATTGCCGC CCCAGGCTAT CTGACCCATA ACCTGACGGA TGAAACGCTG
AAATTTATTG ATGACTCAGG GAAAAAACCG TTTTTCATCA GCCTGGCTTA CAGCGTGCCA
CATATTCCAT TAGAGCAAGC ATCACCCGCG AAATATATGG ATCGGTTTAA TACCGGCAAC
GTTGAAGCAG ATAAATATTT TGCTGCCATT AATGCCGCAG ACGAGGGGAT TGGTAGAATT
GTTCAGCACT TACAAGAAAA AGGTGAGCTG GATAACACAC TGATTTTCTT CATTTCGGAT
AACGGGGCGG TTCATGAATC CCCAATGCCA ATGAATGGCA TGGACCGTGG ACATAAAGGA
CAAATGTATA ACGGGGGGGT GCATATTCCC TTCGTCGCTT ACTGGCCAAA ACAGATCCCC
GCAGGTACGC AAAGTGATGC ATTGGTGAGT GCATTAGATA TTTTACCGAC GGCATTGAAA
GCCGCGGGTA TTGCCATCCC AGCGGAGATG AGAGTGGATG GTAAAGATAT TCTGCCGGTA
CTGGCAGGTA AGGAACAAAC CTCGCCGCAT CAATATATGT ACTGGGCTGG GCCGGGGGCA
AAGCATTACA GCGATGAGAA TCAGTCATTC TGGCATGACT ACTGGAAATG GATCACTTAC
GAACATCAAC AGGCGCCTAA AAATGATCAT GTAGAGACAT TATCGAAAGC CTCTTGGGCA
ATCCGCGATC AGGAGTGGGC ACTCTACTTC TATGATGACG GCACCAATAC GCCAAAATTA
TTTAATGATA AGCATGATCC CATGGAATCA AAGGATTTAG CTGATCAGTA CCCTGAGCGT
GTCAGTGCAA TGAAAGCGGC ATTCTATGAT TGGATCAAAG ATAAACCCAA ACCCGTGGCT
TGGGGGCAAG ATCGCTATCA GATCTTAGCA AGCTCCGCGA AAAGTTAA
 
Protein sequence
MKLPAGKRSL LAGMIAAAGM SMTPVTLAAP AEKPNVLLVI MDDLGTGQLD FTLNNLDKKA 
LSQRPVPVRY QGDLDKMIDA AQRAMPNVSL LAKNGVKMTN AFVAHPVCGP SRAGIYTGRH
PTSFGTYSND DAMQGIPLDI KLLPALFQEH GYATANIGKW HNARIEKKAF VADEVKSRDY
HDNMISVSAP GYAPEKRGFD YSYSYYASGA ALWHSPAIWQ NSKNIAAPGY LTHNLTDETL
KFIDDSGKKP FFISLAYSVP HIPLEQASPA KYMDRFNTGN VEADKYFAAI NAADEGIGRI
VQHLQEKGEL DNTLIFFISD NGAVHESPMP MNGMDRGHKG QMYNGGVHIP FVAYWPKQIP
AGTQSDALVS ALDILPTALK AAGIAIPAEM RVDGKDILPV LAGKEQTSPH QYMYWAGPGA
KHYSDENQSF WHDYWKWITY EHQQAPKNDH VETLSKASWA IRDQEWALYF YDDGTNTPKL
FNDKHDPMES KDLADQYPER VSAMKAAFYD WIKDKPKPVA WGQDRYQILA SSAKS