Gene YpAngola_A3149 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYpAngola_A3149 
Symbol 
ID5801623 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pestis Angola 
KingdomBacteria 
Replicon accessionNC_010159 
Strand
Start bp3336752 
End bp3337927 
Gene Length1176 bp 
Protein Length391 aa 
Translation table11 
GC content39% 
IMG OID641340983 
Productputative sulfatase regulator 
Protein accessionYP_001607511 
Protein GI162421309 
COG category[R] General function prediction only 
COG ID[COG0641] Arylsulfatase regulator (Fe-S oxidoreductase) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value0.997665 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones47 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCATATCA CGGCTAAACC AACCAGCTAC CAGTGTAATT TAAAGTGTGA TTACTGCTTT 
TATCTCAGTA AAGAAAACAT TTTTCAGCAC AAAGGTTGGA TGACTGAAGA AACCCTTGAA
ACATTTATCG AGCGATATAT CAGTGCATCG GGGCATGATG TGTATTTCAC TTGGCAAGGG
GGTGAGCCTA CCATGGCTGG GCTGGATTTC TTTGAGAAAG CGATACAGTA TCAGAACCGC
TATAAAGGGA CTAAAAAAAT ACACAACGCT TTACAAACTA ACGGTATTTT ATTAGATGAT
GCATGGTGTC TCTTTCTAAG AGAAAATCAT TTTTTAGTCG GTGTGTCTAT TGATGGCCCT
AAAGAGTTAC ACGATCGCTA CCGTGTGACT CGCTCAGGCA AAGGATCGTT TGATAAAGTG
ATGGCAGGTA TTGAGCAACT CAAAAAACAT CAGGTAGAGT TTAATACATT AACCGTGATA
AATCGTATCA ATGTAAAATA CCCCCTTGAA GTTTATCGAA CGTTAAAATC TATCGGTGCT
AAACATATCC AATTTATTGA ACTGTTGGAA ACAACCGAGC CTAATATTGA TTTTTCAAAT
CAAAAAAGCA CGTTTGAACT TATTGAGTTC ACTGTCCCTG CGGTTGATTA TGGCCATTTC
ATGGCGGAGG TCTTCAAAGA ATGGGTTCGC CATGATGTAG GAACTCTCTT TATTCGCCAG
TTTGAGTCTT TTGTCAGCCG ATTTATTGGC AATGGGCACA CGAGCTGTGT TTTCCAAAAA
TCGTGCAAAA ATAATTTTGT GATGGAGTCC AATGGTGACA TTTATGAATG TGATCACTTT
GTCTACCCTG AGTATAAAAT AGGCAATATT TACCACGATA AATTGGATTC GTTGGCGAGC
GATAAATTAT CCGCGCAAAA AGAGGTGCTA TCTGAGTCAT GCCGTAAGTG TATGTATAAA
GCCATTTGCT ATGGCGGTTG CCCAAAACAT AGGATTGATC AGGACAGCGA TGGGATGAAA
TCCTATTTTT GCGCAGGGTA TAAAATACTC TTCTCGGTTA TGGTGCCTTA TATGAATGCG
CTGGCTGAAT TAGAAAAAAA TGGTATTCCA TTGGATAAGA TCATGGGTAT CGTCGATGAC
ATTGAATGTG GAATAAAATC ACAACAGCAG CATTAA
 
Protein sequence
MHITAKPTSY QCNLKCDYCF YLSKENIFQH KGWMTEETLE TFIERYISAS GHDVYFTWQG 
GEPTMAGLDF FEKAIQYQNR YKGTKKIHNA LQTNGILLDD AWCLFLRENH FLVGVSIDGP
KELHDRYRVT RSGKGSFDKV MAGIEQLKKH QVEFNTLTVI NRINVKYPLE VYRTLKSIGA
KHIQFIELLE TTEPNIDFSN QKSTFELIEF TVPAVDYGHF MAEVFKEWVR HDVGTLFIRQ
FESFVSRFIG NGHTSCVFQK SCKNNFVMES NGDIYECDHF VYPEYKIGNI YHDKLDSLAS
DKLSAQKEVL SESCRKCMYK AICYGGCPKH RIDQDSDGMK SYFCAGYKIL FSVMVPYMNA
LAELEKNGIP LDKIMGIVDD IECGIKSQQQ H