Gene YpAngola_A3868 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYpAngola_A3868 
Symbol 
ID5802346 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pestis Angola 
KingdomBacteria 
Replicon accessionNC_010159 
Strand
Start bp4100633 
End bp4102171 
Gene Length1539 bp 
Protein Length512 aa 
Translation table11 
GC content49% 
IMG OID641341660 
Producthypothetical protein 
Protein accessionYP_001608170 
Protein GI162419594 
COG category[R] General function prediction only 
COG ID[COG5529] Pyocin large subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.0748992 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value0.0167221 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAAAAG GTTATTATCT GGTTGTGGGC GATGAAACAA CTTGCGGGGG AGTTATCACC 
GAAGGGGAAC CGACCCACAC AATAATGGGA AGAGCCGTCG CCCGTGAACA AGACCGTATT
ACCTGCGGTA AGCATCCGGG AACCTACATC ATTGTCGGGC ACATTCCCGG AGACACAATA
CTGGGACGCA AGTTTGCGGG TTCACTACAC AGCACAAGTA ACTGTCCGTG CAAGGCAAGA
TTTATCCCTT CGATGATGAA TGATGCCTAT GAGTTAATCG GGTCACCTAC CTCCGGTCAG
TCCAGTATCG CACCCTCATT CACGGAGACC TCCGAATCAC CACCAGACGT AACCCCCGTT
TTTGCCAAGT CTTGTCTGCG GGAAAAGGGG TGTACCGATG CGGGTACCGA GGGTGAACCA
CACCGCAATT TCGGCAAAAT GGGCTTTTAT CAGACTATTC CCCCCTCACC AACCTCCCCA
ACTGATAATA ATGAAGTGGA TCAACGCGCT CAAACTGCCA GGCGTAAACC TGTGGAAACA
GAGCCTGATG TTAAGACACC GTGGTACAAG CGGGTGTTGG GGAGAACCAG TGCGGCTCCG
GCCGCGGTAG TTGTACCGGT ATCGACTGGG GCGGGTCTGG CGGCCTTAGA GCAAGCGGGT
ATTCAGGGAA TGCGGTTTGT CGGTGGTAAC CTGATGACGG CGGGACGCTG GATGGTGGGT
TCATCACCTG TCGGTGCTGC TATCATGGGA ATGATGCCGG GCACCCTGAA TGAAGGTGAA
ACCGATTTAC TCGACAAACT CAAGCTGGAA AACATTGCTA AGAATGGTGG CTCAGCACCA
ACACGCGTGC GTTTTCGCTG GGTTGATAGC GTAAATGGCC GGTTAAAAGC CGAGGGCTAT
CATATGAGTG CTGAAGGTGG ATTGGACAGA GTGCCAGTAC GCAAGATGAC GCTCAATGTC
TTTACCGGCA ATTATGAGTT TTGGGAAGAT GGCGCGGCAG GGCCAACCAT TCTGTGGACA
CCGAATGATC CCGGATTTAA AGCCCCATCG AATACCGGTA ATAACGACGG GCCGATCATT
CGCACTACGA TTACGGTATT ACCGATGCCG GAAGCTGACG AAACCGGCGA GCATTCAACT
ACATTACCCA TGCCAGAGGA GAAGGATTTC CGCGATTATA TTCTGATCCA TCCGTTGCTT
GATTTGCCAC CGTTGTATAT TTATTTGAGT AAAAATCCAG ACACGCCAAT TTGGACGAAA
ACCAAAAGGC TCGAACCGGT ATCTAATGCT TATGAACACT GGGTTAAGCA TGGTCATGAA
TTCTCAGACC AGCCTTTTAA TAATGCGAAA GAATATGTTG AGAGCACTCA TGATTTCGTT
AATAATCCTC CGGAAGGTAC ACTTACAAAA ACTCGCCCAA ATGGTGATGA ACTATTTTAC
CATCCACAGT TAAATACGTT TGCAGTAAAG ACAAAAGATG GAGTGCCTAA AACGATGTTT
AAGCCGTCAG ACAAAATGGG ATATTGGAAT AAACAATGA
 
Protein sequence
MAKGYYLVVG DETTCGGVIT EGEPTHTIMG RAVAREQDRI TCGKHPGTYI IVGHIPGDTI 
LGRKFAGSLH STSNCPCKAR FIPSMMNDAY ELIGSPTSGQ SSIAPSFTET SESPPDVTPV
FAKSCLREKG CTDAGTEGEP HRNFGKMGFY QTIPPSPTSP TDNNEVDQRA QTARRKPVET
EPDVKTPWYK RVLGRTSAAP AAVVVPVSTG AGLAALEQAG IQGMRFVGGN LMTAGRWMVG
SSPVGAAIMG MMPGTLNEGE TDLLDKLKLE NIAKNGGSAP TRVRFRWVDS VNGRLKAEGY
HMSAEGGLDR VPVRKMTLNV FTGNYEFWED GAAGPTILWT PNDPGFKAPS NTGNNDGPII
RTTITVLPMP EADETGEHST TLPMPEEKDF RDYILIHPLL DLPPLYIYLS KNPDTPIWTK
TKRLEPVSNA YEHWVKHGHE FSDQPFNNAK EYVESTHDFV NNPPEGTLTK TRPNGDELFY
HPQLNTFAVK TKDGVPKTMF KPSDKMGYWN KQ