Gene YpAngola_A1469 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYpAngola_A1469 
Symbol 
ID5799937 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pestis Angola 
KingdomBacteria 
Replicon accessionNC_010159 
Strand
Start bp1521101 
End bp1522507 
Gene Length1407 bp 
Protein Length468 aa 
Translation table11 
GC content55% 
IMG OID641339423 
Productputative DNA circulation protein 
Protein accessionYP_001605984 
Protein GI162420197 
COG category[R] General function prediction only 
COG ID[COG4228] Mu-like prophage DNA circulation protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones37 
Fosmid unclonability p-value0.539442 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCACTCA TTGGCAACAC ATTATCAGCG TTATTGGGAG GCAGTGACGA CAGCTGGCAA 
TGGTCGGAAC ACCTTCATCG AGCCTCCTTT CGTGGCGTTC CCTTTGTGGT CGTCAGTGGG
CAAGGTACCT TTGGTCGCCG CCAGGTAACA CACAGCTACC CCTATCGCGA TACCAGCTAT
ATCGAAGATT TGGGCCGCAA TACGCGCAAA ATTGTTCTGA AAGGGATTTT GATACAAAAC
AGCCAGATCT ATACCGCACC TGATGTGATG ACTCAACGTG ACTCATTGAT TGCGGCTTGT
GAAATGTCGG GGCCGGGCAC TCTGGTCCAC CCGACACTGG GGGAAATGAC GGTCAGCATT
TCCGAGGCAG GGCTATTGAT CGATGATAGC TTCAGCAGTG AGCGGGTCTT TTCCTTTACC
TTAACCGCCA TCGAGTCTGG CCTGCGTGCC TTTGCTATTA CTGGCTCCGC AGAAATGGGC
GCATCCATTC AGTCCTCCTG GCTAGGGCTA AGTGCTAAAG CGGTTGCGGG CTTTATCTCA
ACGGTGAAAG GCGAAATGCG CTCAGCGACT CAGGCGATAA AAACTCTGAA AAATACCGCT
GCATTCTGGC GTCGGATGGT GACGGGCACG GCCAACGAAG CCAGTAATTT GGGCAACGCC
CTACGCTCAA CCTTTGGTCG CAACCGCTAT GGCCGCTATA ACCACGGCAC TGTCGGAGGC
AGCAGCACGG GAGCGACAAC GACGGTTAGC CAACAAAATG ACACGGCGGA TTTATCCACG
CTGGTGGCGC AACGGATGGC ACTGGTGGTT GAAGGACGGG CGGCGCTCGA CGCGGCGTTG
GACGAGTTAC TCGCCGCCAG CAGTATTGAA AGCCATGCCG ACAGTGTGCT GGCCGTGGTC
GATGCCCTGC TGGCGACGGG CATCAGTACG CGGGATATTA TCCGTATCAT GGAAACCCTG
GCGCTAGCCC ATGACGATAC TTTCCGTGCC AACGACAGTG ATAGGGCCGT CGCGGATGCC
AGCCACCACT TAATGGCCAC ATTATGCACT GGGGCGATGA TCCAAGTGGC AGCGCAATAT
CAACCGGAAA GCTATGACGA TGCGGTTGCG GTATTGGGCC GGGTTTGCCT GGTGATTGAC
AATACTGCAC TGGTCGCCGC CGACAGGGGG AATGATGAGA CCTATCGTGC GCTGGTGCAG
ATGCGTGAAT CTATCGTGAC CGTGCTACAG CAGGCGGGGG CCAATCTATC ACGGGTTGGC
GAGGTCAGTT TTAACCGTTC ACTACCGGCT TTGATGCTGG CAAACCGCCT CTATCAGGAT
GCGTTACGCG GCGATTCGCT GGTGAAAATG GCTAATCCTA TTCACCCGGC ATTTATGCCC
ATCCGATTTA AGGCGCTGAA TCTATGA
 
Protein sequence
MSLIGNTLSA LLGGSDDSWQ WSEHLHRASF RGVPFVVVSG QGTFGRRQVT HSYPYRDTSY 
IEDLGRNTRK IVLKGILIQN SQIYTAPDVM TQRDSLIAAC EMSGPGTLVH PTLGEMTVSI
SEAGLLIDDS FSSERVFSFT LTAIESGLRA FAITGSAEMG ASIQSSWLGL SAKAVAGFIS
TVKGEMRSAT QAIKTLKNTA AFWRRMVTGT ANEASNLGNA LRSTFGRNRY GRYNHGTVGG
SSTGATTTVS QQNDTADLST LVAQRMALVV EGRAALDAAL DELLAASSIE SHADSVLAVV
DALLATGIST RDIIRIMETL ALAHDDTFRA NDSDRAVADA SHHLMATLCT GAMIQVAAQY
QPESYDDAVA VLGRVCLVID NTALVAADRG NDETYRALVQ MRESIVTVLQ QAGANLSRVG
EVSFNRSLPA LMLANRLYQD ALRGDSLVKM ANPIHPAFMP IRFKALNL