Gene YpAngola_A3334 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYpAngola_A3334 
Symbol 
ID5801811 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pestis Angola 
KingdomBacteria 
Replicon accessionNC_010159 
Strand
Start bp3549130 
End bp3550413 
Gene Length1284 bp 
Protein Length427 aa 
Translation table11 
GC content56% 
IMG OID641341155 
Productallantoate amidohydrolase 
Protein accessionYP_001607677 
Protein GI162419379 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0624] Acetylornithine deacetylase/Succinyl-diaminopimelate desuccinylase and related deacylases 
TIGRFAM ID[TIGR01879] amidase, hydantoinase/carbamoylase family 


Plasmid Coverage information

Num covering plasmid clones41 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones41 
Fosmid unclonability p-value0.976999 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGTAA CCCTACCGGA TATCGAGGCT GAACAGGCTG CATTACAGGT CTTGGCCCGT 
TGTGACGTAT TGGCGGCAAT CAGTGAATCC CCCGAAGGGC TGACTCGGGT CTATCTATCT
CCTGAACATT TACGGGCAAA TCGGCAAGTC GGTGAATGGA TGCAAGCGGT TGGTATGCAG
GTTTGGCAGG ATACCGTGGG GAATATTTGT GGCCGTTATG AAGGGCGACA ACCGGATGCC
CCCGCCATCT TGTTAGGTTC TCATCTGGAT ACGGTGCGCA ATGCGGGCCG CTATGATGGC
ATGTTAGGGG TATTAACTGC GCTGGAGGTC GTGGGGTACT TACATCGTCA TCAGCAACGT
TTACCGGTCG CCATTGAAGT GATTGGTTTT GCTGATGAAG AGGGAACGCG GTTTGGCATC
ACCCTGCTTG GCAGTAAAGG GGTGACGGGG CGTTGGCCTG TGGAGTGGTT AAATACGACC
GATGCCGATG GCATCAGTGT CGCGCAAGCG ATGGTTCGTG CCGGTTTAGA TCCAATGGAC
ATCGGGCAAT CTGCGCGTGC AGCCAATGCC TTCTGCGCCT ATCTTGAACT GCATATTGAG
CAGGGGCCGT GTTTAGAAAA CGCGGGTTTG GCGCTGGGTG TGGTGACGGA TATTAATGGC
GCGCGCCGTT TACAATGTCA GTTTACCGGG TTGGCGGGCC ATGCGGGGAC CGTACCGATG
GGGCAACGGC AAGATGCATT GGCCGGTGCC GCTGAATGGA TGTGTGTCGT AGAGGCATTG
ACTGCGGCTC AGGGGGAGCA TTTAGTGGCG ACGGTAGGGA CGTTGACGTG TCTGCCCGGT
GCAGTGAATG TAATCCCCGG TCAAGTGAGG CTGACACTGG ATATTCGCGG CCCAAATGAC
CGTGGGGTGA ATGATTTATT GACCCGTCTA TTGGCTGAGG CTGAAGCGAT CGCCACGCGT
CGTGGCATAA CGTTTGCCGC CGAAGGGTTT TACCGTATCA AGGCAACGGC TTGTGATAGT
GCTTTGCAGC AGTGCATCAG CCAGAGTATC AGCCAGGTGC AGGGCCGTTG TTTAGCGCTG
CCCAGTGGCG CGGGCCATGA TGCGATAGCC ATGGCGGAGT GCTGGCCAGT CGGGATGCTA
TTTGTCCGCT GTAAAGGCGG CGTCAGCCAT CATCCAGATG AGTCCGTCAC GAGTAGTGAT
GTTGCGGTGG CGATTCAGGC GTATCTGGAG GCTGTTCTTA CGTCTCCTTC GTCCCCTTCG
TCCTTGACGC CGCAGCGGTG TTAG
 
Protein sequence
MSVTLPDIEA EQAALQVLAR CDVLAAISES PEGLTRVYLS PEHLRANRQV GEWMQAVGMQ 
VWQDTVGNIC GRYEGRQPDA PAILLGSHLD TVRNAGRYDG MLGVLTALEV VGYLHRHQQR
LPVAIEVIGF ADEEGTRFGI TLLGSKGVTG RWPVEWLNTT DADGISVAQA MVRAGLDPMD
IGQSARAANA FCAYLELHIE QGPCLENAGL ALGVVTDING ARRLQCQFTG LAGHAGTVPM
GQRQDALAGA AEWMCVVEAL TAAQGEHLVA TVGTLTCLPG AVNVIPGQVR LTLDIRGPND
RGVNDLLTRL LAEAEAIATR RGITFAAEGF YRIKATACDS ALQQCISQSI SQVQGRCLAL
PSGAGHDAIA MAECWPVGML FVRCKGGVSH HPDESVTSSD VAVAIQAYLE AVLTSPSSPS
SLTPQRC