Gene YpAngola_A0336 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYpAngola_A0336 
SymbolnagC 
ID5798800 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pestis Angola 
KingdomBacteria 
Replicon accessionNC_010159 
Strand
Start bp349747 
End bp350973 
Gene Length1227 bp 
Protein Length408 aa 
Translation table11 
GC content50% 
IMG OID641338344 
ProductN-acetylglucosamine repressor 
Protein accessionYP_001604944 
Protein GI162421808 
COG category[G] Carbohydrate transport and metabolism
[K] Transcription 
COG ID[COG1940] Transcriptional regulator/sugar kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.0129193 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones53 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCACCG GCGGACAAGC ACAAATTGGT AACGTGGATC TGGTAAAACA ACTCAATGGA 
GCCGTGGTTT ACCGGCTAAT TGATCAGCAA GGCCCGATTT CTCGCATACA GATTGCCGAT
CTCAGCCAGC TAGCCCCCGC CAGTGTCACC AAAATCACCC GGCAATTGTT GGAGCGCGGG
CTGATCAAAG AGGTCGATCA GCAAGCCTCC ACCGGGGGGC GTCGCGCTAT CTCTATCGTG
ACGGAAAACC GCCAATTCCA TACCGTTGCA GTCCGTTTAG GTCGTAATGA TGCCACGATC
ACCCTCTTTG ACATGAGCGG TAAATCGCTG GGTGAAGAGC ACTATGCCCT GCCAGAACGA
ACACAAGAAA CGCTGGAACA CGCCTTATTT AATATCATCA GTCAGTTTAT TGACGCCTAT
CAGCGTAAAT TACGTGAACT GATTGCCATC GCGGTTATCC TGCCTGGGCT GGTTGAGCAA
AGCAAAGGTA TCGTGCGCTA TATGCCGCAT ATCAGTGTCA GTAACTGGCC GTTAGTCGAT
AATCTACAAG CGCGCTTTAA CGTCACCAGT TTTGTGGGTC ACGATATCCG CAGCCTGGCA
CTGGCCGAGC ACTATTTTGG TGCAACCCGT GACTGTGAAG ACTCCATTTT GGTTCGTCTA
CATCGAGGCA CGGGTGCCGG TATTATCGTT AACAGCCAAA TATTTTTAGG CAGCAACGGC
AACGTTGGCG AGATAGGCCA TATTCAGATT GATCCATTAG GTGATCGCTG CTATTGCGGT
AACTTTGGTT GTCTGGAAAC CGTGGCATCC AACGCCGCGA TTGAAAACCG CGTCAAGCAC
CTTCTCACCC AGGGTTATCC AAGTAAGCTG TCTCTTGATG ACTGCCATAT TGGTGCTATC
TGTAAGGCCG CAAACCGCGG TGACTTGCTG GCCTGCGAAG TGATTGAACA TGTTGGTCGC
TACTTGGGGA AAGCCATTGC TATCACCATA AACTTATTCA ACCCACAAAA AGTGGTGATT
GCCGGTGAAA TTATTGAAGC CGAGAAAATC CTACTACCCG CCATTCAGGG TTGCATTAAT
ACGCAAGTTT TGAAAAACTT CCGCCAAAAC CTGCCGATAG TGACATCACA ACTTAACCAC
CAGTCGGCTA TCGGCGCTTT CGCACTGGCT AAGCGCGCTA TGCTCAATGG TGTCTTGCTG
CAACGTTTGC TAGAAACTCA CCCGTAG
 
Protein sequence
MSTGGQAQIG NVDLVKQLNG AVVYRLIDQQ GPISRIQIAD LSQLAPASVT KITRQLLERG 
LIKEVDQQAS TGGRRAISIV TENRQFHTVA VRLGRNDATI TLFDMSGKSL GEEHYALPER
TQETLEHALF NIISQFIDAY QRKLRELIAI AVILPGLVEQ SKGIVRYMPH ISVSNWPLVD
NLQARFNVTS FVGHDIRSLA LAEHYFGATR DCEDSILVRL HRGTGAGIIV NSQIFLGSNG
NVGEIGHIQI DPLGDRCYCG NFGCLETVAS NAAIENRVKH LLTQGYPSKL SLDDCHIGAI
CKAANRGDLL ACEVIEHVGR YLGKAIAITI NLFNPQKVVI AGEIIEAEKI LLPAIQGCIN
TQVLKNFRQN LPIVTSQLNH QSAIGAFALA KRAMLNGVLL QRLLETHP