Gene YpAngola_A3370 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYpAngola_A3370 
Symbol 
ID5801847 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pestis Angola 
KingdomBacteria 
Replicon accessionNC_010159 
Strand
Start bp3587694 
End bp3588860 
Gene Length1167 bp 
Protein Length388 aa 
Translation table11 
GC content57% 
IMG OID641341191 
Productputative N-acetylgalactosamine-6-phosphate deacetylase 
Protein accessionYP_001607713 
Protein GI162420948 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1820] N-acetylglucosamine-6-phosphate deacetylase 
TIGRFAM ID[TIGR00221] N-acetylglucosamine-6-phosphate deacetylase 


Plasmid Coverage information

Num covering plasmid clones35 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones39 
Fosmid unclonability p-value0.639781 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGTACTG CCTATCTCGC TGATCGCACT TTTACTCCGC AAGGCATTGA AACCGGCGTC 
GCGGTTATCG TCGAGCAAGG CGAGATTGTG GCAGTAACAC GCGAGCTGCC AGCGGATGCA
GAAATTGTGC ATCTGACAGG GAAAACCCTG ATCCCAGGTC TGATCGACAT TCATATTCAT
GGTCGCCAGG GGGCGGATGT CATGGATGCC TCTGCGGAGG CATTACGCAC CATTGCCCGT
GCCCTACCCC AAACCGGGGT TGTTGCCTGG GTCGGCACCA CCGTCAGTGC GCCGATACAG
GATATCTTTG CGGCTCTGGC GCAGGTGCGT GATTTTATTG CAGACCCGGA TAACGCACGT
GATACCCGCA CCGCAACCCT GCTCGGCAGT TTTCTGGAGG GGCCATATTT CACCGCGCCT
TTCCGGGGTT CACACCCAGA AAAGTATCTG ACGACACCAA CACCACAAGA GCTGGAGCAA
TTACGGCATT CGGCGGGTAA CACCTTGTTG CGTGCAGCTA TCGCACCTGA GTCACCCGAG
GCTTTGGCCG CGATCCGCTG GCTGGTGAAT CACGGGATCA AAACCTCTGT GGCGCACACT
GCGGCTAATT TTGAGCAAGT GACGGCGGCC TATCAGCAAG GTGCGGATTG CGGCGTACAT
TTGTATAACG GCATGTCAGG GTTGCATCAC CGTGAACCGG GCTGCTGTGG TGCCGTGTTG
TATCACGACA TGCTGGCAGA GCTGATTGCC GATGGCATTC ATGTGCATCC GGTGATGATG
AATCTGGCGT ATCGCATGAA AGGTTATCGC CGCATTGCAC TGATCACCGA CTGCATGCGC
GCAGGGGGGC TGGGTGAGGG GCGTTATTTA CTCGGCGCAC AGCATATCAC GGTACGTCAG
GGGGAAGCGC GCACCGATGA TGGTTCACTG GCAGGCAGTA CTTGTAGCTT GGATCAGGCG
CTGCGTAACA TGATCCAACA TGCGCAAGTC CCCGAGTGGG AAGCTGTACA AATGGCCAGC
GCAGTACCCG CCGCTTATCT GGGATTAGCG TCAACACTGG GTTCGATCCA GATGGGTGCA
CAAGCCAGCA TGGTGGTGAT GGAGAGTGAC TTTACCGTTG CCGCAACCCT GATTAAAGGT
GAATGGGCTT ATCGCCACTC AGCCTAA
 
Protein sequence
MRTAYLADRT FTPQGIETGV AVIVEQGEIV AVTRELPADA EIVHLTGKTL IPGLIDIHIH 
GRQGADVMDA SAEALRTIAR ALPQTGVVAW VGTTVSAPIQ DIFAALAQVR DFIADPDNAR
DTRTATLLGS FLEGPYFTAP FRGSHPEKYL TTPTPQELEQ LRHSAGNTLL RAAIAPESPE
ALAAIRWLVN HGIKTSVAHT AANFEQVTAA YQQGADCGVH LYNGMSGLHH REPGCCGAVL
YHDMLAELIA DGIHVHPVMM NLAYRMKGYR RIALITDCMR AGGLGEGRYL LGAQHITVRQ
GEARTDDGSL AGSTCSLDQA LRNMIQHAQV PEWEAVQMAS AVPAAYLGLA STLGSIQMGA
QASMVVMESD FTVAATLIKG EWAYRHSA