Gene YpAngola_A3003 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYpAngola_A3003 
Symbol 
ID5801475 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pestis Angola 
KingdomBacteria 
Replicon accessionNC_010159 
Strand
Start bp3172565 
End bp3173875 
Gene Length1311 bp 
Protein Length436 aa 
Translation table11 
GC content38% 
IMG OID641340842 
Productradical SAM domain-containing protein 
Protein accessionYP_001607372 
Protein GI162421476 
COG category[R] General function prediction only 
COG ID[COG0641] Arylsulfatase regulator (Fe-S oxidoreductase) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.00234995 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.0000159012 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCGCGAAA CAATACCCAT TAAATACATT GATACAATCG AAATAAATAA ATTAAGCATA 
CAGGATCTGG AGGACCTCAG TTCCTGTAAA TATAAATTAC TATCTGCTTA TTTAATTGGA
AAAATACCTG CCAACAAAAT ATTAGATGCA GATGAATTAG AGATCTATCA TAACGAACGG
ACTAAAAAAA TATTGGCTAA AGCAATAAAA GCAAGAAAGC TAGTTATTGT ATTAAAAGCC
ACCCGATTAT GTAACCTTCG TTGTACCTAT TGCCACTCAT GGGCCGAAGG CCCTAATCAA
ACGATTTTAT TCAGCACGCT TATACACATA GTTCGCCAAA TTCTTGCCAT CCCAAATGTA
AACCGGTTCG AGTTTGTCTG GCATGGTGGC GAAGTCACGC TATTAAAACC TGCTTTCTTT
AAGAAGCTAA TTTGGTTACA AGAGCAATTT AAACGGCCAG AGCAATATAT CACTAATACG
ATGCAATCAA ATATAGTGAA TATCTCTGAT GAGTGGCTAA TATTTATTAA GGGGATAGGA
ATGAACGTAG GTATTAGCCT CGATGGCATA CCGGCGGTTA ATGACAAGCG TCGGGTGGAT
TTTCGTGGAA GAGGGACATC GGATCGTATA GCAAAAGGAA TAAAAAAACT TCAAAAATAT
GATATCTTAT ATGGTGCACT TATTGTAGTT GACCGCGAAG TATACCAAAC AGACATGAGG
GAAATGTTAG ATTATTTTAT TTTAATAGAA TTAAACGGCA TAGAATTTCT TAATATTGTA
CCCGATAATC GACTAACTGC AGGTGAAGAT ATTGGTAATA ACTTTATCAG CTATGCTGAG
TTCATTAAAT TTCTATCGGC ACTGTTTGTT ATTTGGATAA AAGGGTACCG AGAAAAAATA
CATATTGCTA TATTCGAGGA TTTTATCAGT GTACTGGAGC ATCCTGAAAA AAAACTGTCG
GCTTGTTATT GGTCAGGCAA TTGCTCACAA GAGATCATTA CGTTAGAGCC TAACGGTGAT
GTTTCTCCTT GCGACAAATA TAGAGGCGAT GCAGGCAGTA TTTATGGTTC ATTGCTGAAA
ACTGATTTAG CCGGGCTGTT AACTCAATCA TCGCACAATC AACAAGCGAT AGACGAAGAA
GTCGCTGCAA CGAGAAAAAT GCAACATTGT GAGTGGTTTT CGATCTGCCA TGGCGGTTGT
CCCCATGATC GAGTGATCAA CCGCAGGCAT ACCAAAGGAT ATAATGACGA GTGCTGTGGT
ACCGGAAAAT TACTGGCAAC GATTAAGGCT TATCTGGCAG ATATTCGTTG A
 
Protein sequence
MRETIPIKYI DTIEINKLSI QDLEDLSSCK YKLLSAYLIG KIPANKILDA DELEIYHNER 
TKKILAKAIK ARKLVIVLKA TRLCNLRCTY CHSWAEGPNQ TILFSTLIHI VRQILAIPNV
NRFEFVWHGG EVTLLKPAFF KKLIWLQEQF KRPEQYITNT MQSNIVNISD EWLIFIKGIG
MNVGISLDGI PAVNDKRRVD FRGRGTSDRI AKGIKKLQKY DILYGALIVV DREVYQTDMR
EMLDYFILIE LNGIEFLNIV PDNRLTAGED IGNNFISYAE FIKFLSALFV IWIKGYREKI
HIAIFEDFIS VLEHPEKKLS ACYWSGNCSQ EIITLEPNGD VSPCDKYRGD AGSIYGSLLK
TDLAGLLTQS SHNQQAIDEE VAATRKMQHC EWFSICHGGC PHDRVINRRH TKGYNDECCG
TGKLLATIKA YLADIR