Gene YpAngola_A3028 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYpAngola_A3028 
Symbol 
ID5801500 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pestis Angola 
KingdomBacteria 
Replicon accessionNC_010159 
Strand
Start bp3199070 
End bp3200323 
Gene Length1254 bp 
Protein Length417 aa 
Translation table11 
GC content48% 
IMG OID641340865 
Producthypothetical protein 
Protein accessionYP_001607395 
Protein GI162419839 
COG category[S] Function unknown 
COG ID[COG5276] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.0018316 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value0.016195 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAACAC AAGGTTCCTT GCCGATACCT GAATACAGCC GCAATATGCG GCTGATTGGT 
CACAGTGACC AAGGTGGCAA ACCAGACGGT GTACAAGTTA TGGTGCATAG GGGCTATGCC
TATGTCGGCC ATATGGTATC GAAAGGCGTA TCAATTATTG ACGTAAGGGA TGCCAAAAAC
CCGCGTCCCG CAGGTTTCAT CGCTGCGCCT CCGAATACCT GGAATGTGCA TCTGCAAACT
CATGATGATC TGCTGCTGGT GATTAATGCA CGCGACCTAT TTGCCGATGC CCGTTTTGCC
GAAGAGAAAG TCTATTACAC CCGTTCAGTC GCCGAAACAG TCAGCACGCG TCAGGAAGGC
CGCAATTGGA GCGCGGGTTT ACGTATTTTT GATATTTCAA CGCCAGACAA GCCACGAGAA
ATCAGTTTCT TACCGCTTGA TGGCATTGGT ATCCATCGAA TCTGGTATGT CGGTGGGCGT
TGGGCGTATG TATCGGCACT GTTAGATGGT TATAGCGATT ATATCTTCCT CACCATTGAT
CTGGCTGATC CGCAAAAGCC ACAGGTTGCC GGGCGCTATG CGTTACCTGG TATGGATACT
GCGGCAGGTG AACAGCCTAA CTGGCCAGCA GGCAAACGCT ATGCGCTGCA TCATGCCATT
ATTAGTGGCG ATACCGCTTA CGGCAGTTGG CGTGATGGTG GCCTAACCTT ATTGGATGTC
AGTGATCGTC ACGAACCGAA ACTAATAAGC CACCGTAATT GGAGCCCCCC CTTTGGCGGC
GGTACTCATA CTGCTTTGCC ACTCCCTGAT CGTGATTTAT TGGTAGTACT GGATGAGGCC
GTGCTGGATA ACCAAGAAGA TGGCGAGAAA CATATCTGGG TGTTTGATAT TCGTGAACCA
AGTAACCCGG TGAGTATTTC AACGTTTCCG GTTCCGGAAG AACGAGATTA TATAAAAAAA
GGCGCTCATT TTGGCCCACA CAATCTGCAT GAAAACCGCC CAGGTAGTTT TATCAGTTCA
TCTCTGATTT TTGCTACTTA CCAAAATGCT GGGGTCCGAG CTTACGATAT CAGTAACCCC
TATAATCCAA AAGAAACTGG CGCGTTAGTT CCAGCAGCCC CCGAAAAAAT GATGGATAAG
CGCCCTGGTC GCCCACAAAT TATTCAATCA TGTGATGTAT TTGTTGACAC TCAAGGCATC
ATCTACAGTA CCGATTATAA CGGCGGCCTC TCTATCATCG AATATTTAGG TTAA
 
Protein sequence
MATQGSLPIP EYSRNMRLIG HSDQGGKPDG VQVMVHRGYA YVGHMVSKGV SIIDVRDAKN 
PRPAGFIAAP PNTWNVHLQT HDDLLLVINA RDLFADARFA EEKVYYTRSV AETVSTRQEG
RNWSAGLRIF DISTPDKPRE ISFLPLDGIG IHRIWYVGGR WAYVSALLDG YSDYIFLTID
LADPQKPQVA GRYALPGMDT AAGEQPNWPA GKRYALHHAI ISGDTAYGSW RDGGLTLLDV
SDRHEPKLIS HRNWSPPFGG GTHTALPLPD RDLLVVLDEA VLDNQEDGEK HIWVFDIREP
SNPVSISTFP VPEERDYIKK GAHFGPHNLH ENRPGSFISS SLIFATYQNA GVRAYDISNP
YNPKETGALV PAAPEKMMDK RPGRPQIIQS CDVFVDTQGI IYSTDYNGGL SIIEYLG