Gene YpAngola_A3338 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYpAngola_A3338 
SymbolaguA 
ID5801815 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pestis Angola 
KingdomBacteria 
Replicon accessionNC_010159 
Strand
Start bp3553146 
End bp3554261 
Gene Length1116 bp 
Protein Length371 aa 
Translation table11 
GC content53% 
IMG OID641341159 
Productagmatine deiminase 
Protein accessionYP_001607681 
Protein GI162418230 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2957] Peptidylarginine deiminase and related enzymes 
TIGRFAM ID[TIGR03380] agmatine deiminase 


Plasmid Coverage information

Num covering plasmid clones41 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones46 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGGTGC AAGACACGAT GTTACAACAA CAGGCATTAC CCGGCACACC GCGGCAGGAT 
GGTTTCTTTA TGCCTGCAGA ATGGGCACCG CAGGATGCGG TATGGATGCT TTGGCCATAT
CGTCAGGATA ACTGGCGCGG CAAAGCTATT CCCGCTCAGC AGACATTCGC CAAAGTCGCC
GAGGCCATTA GCCGTGCTAC GCCAGTCTTT ATGGGCGTTC CCGCCGAGTT TATGGCACAA
GCCAAGGCAA CCATGCCGGC GAATGTGACC TTAGTGGAAA TGGCCAGTGA CGATGCCTGG
ATGCGTGATA CAGGCCCAAC GATGGTGATT AATGGTGCGG CCGAGCGCCG GGCCGTTGAC
TGGCAGTTTA ATGCATGGGG CGGCCTGAAC GGTGGCTTGT ATGCTGACTG GCAACAAGAT
GAAAAAGTTG CCGTTCAAGT CAGTGATTTT CTGAAAAACG CGCATTACAG TGCACCGTTG
ATACTGGAAG GTGGCTCCAT TCATACTGAT GGCGAAGGCA CGTTACTCAC CACTGCTGAA
TGTTTGCTAA ACCCGAATCG CAACCCACAT TTGAATCAAG CGCAAATCGA ACAGTTACTG
TGTGACTATC TGGGGGTGAC TCACTTCATT TGGTTGCAAG ATGGCGTGTA TAACGATGAG
ACCGATGGTC ATATCGATAA TATGTGCTGT TTTGTCCGCC CAGGCGAAGT AGCCCTGCAT
TGGACAGACG ACCAGCAGGA TCCACAGTAT GCACGTTCAG TCGCAGCGTT CGAGGTGTTA
TCCAATACCG TCGATGCCAA AGGGCGTAAA CTGAAAATCT GGAAGTTACC GGCTCCCGGC
CCGCTTTATA ACACGGAAGA GGAAACCTTC GATGTGTTGA CCAGCGATGC GGTTCCCCGC
ACGGCGGGTG AACGCCTGGC GGGCTCTTAT GTCAACTTCC TGATCAGTAA CCAGCAAATT
ATTTTCCCAC TGCTGGATAG CCGCACTGAC GGGCAGGCTA ACGATCTGTT ACAACAAATG
TTCCCCGGCT ATGCAATTGT CGGCGTGCCT GCCAGAGAGA TTTTACTCGG TGGCGGTAAC
ATACATTGCA TTACTCAGCA GATCCCAGCG GCTTAA
 
Protein sequence
MSVQDTMLQQ QALPGTPRQD GFFMPAEWAP QDAVWMLWPY RQDNWRGKAI PAQQTFAKVA 
EAISRATPVF MGVPAEFMAQ AKATMPANVT LVEMASDDAW MRDTGPTMVI NGAAERRAVD
WQFNAWGGLN GGLYADWQQD EKVAVQVSDF LKNAHYSAPL ILEGGSIHTD GEGTLLTTAE
CLLNPNRNPH LNQAQIEQLL CDYLGVTHFI WLQDGVYNDE TDGHIDNMCC FVRPGEVALH
WTDDQQDPQY ARSVAAFEVL SNTVDAKGRK LKIWKLPAPG PLYNTEEETF DVLTSDAVPR
TAGERLAGSY VNFLISNQQI IFPLLDSRTD GQANDLLQQM FPGYAIVGVP AREILLGGGN
IHCITQQIPA A