Gene YpAngola_A2989 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYpAngola_A2989 
Symbol 
ID5801461 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pestis Angola 
KingdomBacteria 
Replicon accessionNC_010159 
Strand
Start bp3153880 
End bp3157302 
Gene Length3423 bp 
Protein Length1140 aa 
Translation table11 
GC content59% 
IMG OID641340828 
Producthypothetical protein 
Protein accessionYP_001607358 
Protein GI162420423 
COG category[S] Function unknown 
COG ID[COG3523] Uncharacterized protein conserved in bacteria 
TIGRFAM ID[TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.0000392845 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGAAGCGCA TTGCCTTACC GATTAAAAAG CCAGAAGTCT GGTTCTGGAT TGTTGCCCTG 
CTGTTTTTGC TGGCCGGTGC CGTGTTGTGC TGGCTGGTCT GGCAACACCC CGAACGGGTG
GGGTTAATTC CGGGGACACC GCAACGTGAC CGTTGGCTGA CGGGACTGGT GGTGGGAACG
GGGATCCTGA CCCTGTGCGC ATTACTGTCG TATGCGGGGA CCCGGTTATC CGGCAGAAAA
CACTTTGATG AGCTACGGCA GCAGGCACAG GGCGATGATG CGCCGTTGCC GGAAGACAAC
GCGCAGGCGG GTGAGACTCA GCAGGGTGAA CCGTCCCGCC TAAAAACCCG CCTGCGCCGC
CGTTACGGTC TGTTCTGGCG CGACAAAGTC CGGCTGCTGA TGGTGGTGGG TGAACCTGAC
GAAATCGCCG CGCTGGCCCC GCAACTGGCG GAGCAGGGCT GGCTGGAAGG GCAGCGCACG
GTGTTGATAC ACGGTGGCAG TTTACAACGT CCGGCGGATG AAACCGGGCT GAGTGAGTGG
CGCAAACTGC GCCGTGGCCG CCCGTTGGAT GGCATTGTCT GGGCGATGAC CGCCGCGCAG
AGTGGCACGC CACAGTGGAT GGATAACGGC CTGCGAACGC TGGAAAAAAT GGGTGCCGCC
TTGCGCTATC AACCGCCGGT GTATCTGTGG CAGGTCTGTG ACAGCGACTG GCCTCAGGAT
TCGCGGACGG AGCAGCCGGT CGGGGCGGTG TTCCCGGCGA AGGCTACGCC AGAAAAGGTG
GAGCAGCAAC TGCGTGCCCT GTTGCCTCAG TTACGTGAAC AGGGAATGCA GCAGCTTTCC
ATCGAGCCGC ACCATGATTT CTTGTTGCGT CTGGCGCAGT CACTGGATCA GGGCGATGCG
GCTCGCTGGC GGCAGCGTCT GACGCCGTGG TTTACCGAGT ATGCCGCGCG TATTCCCTTG
CGTGGGCTGA TGTTCAGTTT ACCCGATACC TCTGCATCGG CGGCGCAGGT ACATGAGAAA
AGCTGGACGG TACCGGCCAG TTGGCAGGGG GTACTGGATG ATTGCCGGGG GGCGCGTGGA
CGCCGTGTCG GGCTGCCGTG GGAGCAGACA CTGTGCTACA GCTTGCTGGC ACTGATTGTC
TTGTGGGGTG TGGGCAGCGT GGTGTCATTT GCGGTCAACC GCCACCAGAT GGTCTCTGCC
GCACAACAAG CACAGCAACT GGCGCAGTCG CACGCGGTCT CCGACCAGCA ACTGATGGCC
CTACAAGCCC TGCGAAATGA TATCGGGCGC TTACAATCTC GCGTGGCGCA GGGCGCGCCG
TGGTATCAGC GCTTTGGTCT GGACCATAAT GCGCCGCTGC TTGAGGTCCT GATGCAGTGG
TATGGTCAGG CCAACAACCG TATTCTCAGG GATGCCACCG CGCAGAGCCT GTATCAGAAA
CTCAGCGAAC TGGCGGAGCT GCCCGCCAAC AGCCCGCAAC GGGCAGCACG GGCAAAAACA
GGCTATGACC AGTTAAAGGC TTATCTGATG ATGGCCCGCC CGGAAAAAGC CGATGCTGTA
TTTTATGCTC AGGTGATGCA GACCACCGAG CCTACACGGG CCGGTGTCTC CCCCGGTCTG
TGGCAGAGTC TGGCGCCCGA CTTGTGGCAG TTTTATGCAC AAAATCTGCC CGCTCAGCCG
GACTGGAAAA TTACCCCGGA TACCGGGTTG ATCAGCCAGG TGCGGCAGGT GTTGCTGGGG
CAGATGGGTC AGCGCAATGC AGAAAGTACG CTGTATGAAA ACATGCTGCT GTCGGTGCGC
CGCAACTATG CCGACATGAC GCTGATGGAC ATGACCGGCG ATACCGATGC TCAGCGTCTG
TTTCAGACCT CGGAATCCGT GCCCGGTATG TTCACCCGCA AGGCCTGGGA TGAGCAAATC
CAGCAGGCGA TAGATAAAAC GGTGGCCTCC CGCCGCGAAG AAATTGACTG GGTATTGAGT
GATAACCGCC GGGCGATATC CGAAGATATC TCACCGGAAG CGCTGAAAAA ACGCCTGACC
GAACGTTATT TCACCGATTT TGCTGGCAGT TGGCTGAGTT TCCTCAACAG TTTGCACTGG
AATGAGGCGC ATAACCTGTC GGATGTGATT GACCAACTGA CCTTGATGAG TGATGTACGC
CAGTCGCCGC TGATTGCGCT AATGAACACG CTGGCGTGGC AAGGGCAGAC CGGGCAGCAG
AATCAGGCAT TATCGGATTC GCTGGTGAAG TCTGCCAAGG CGCTGATGAA TAAAGACCAG
GCTCCGGCGA TTGACCAGAG TGCCGGTGGG CCAGTAGGGC CACTGGACGA GACCTTTGGC
CCGTTGCTGG CACTGATGGG CAAAGGCGAT GCACAAAACA GGCTGTCGTC GGACAGCTCG
CTGAGCCTGC AAACGTTGCT CACCCGCGTG ACCCGGGTGC GGCTTAAACT CCAGCAGGTG
GTCAATGCTT CGGATCCGCA GGAGATGACA CAGGTGCTGG CCCAGACCGT TTTCCAGGGC
AAAAGTGTTG ATCTGACGGA CACGCAGGAG TACGGCAGCC TGATAGCCGC CAGTCTGGGA
GAGGAGTGGA GCAGCTTCGG GCAGACGATG TTTGTTCAGC CGCTGACGCA GGCGTGGGAA
ACCGTGCTGC AACCTTCATC GGCCAGTCTC AACGACCAGT GGAAAAACGC GGTGGTGGCC
AACTGGAAAT CGGCCTTTGA TGGCCGCTAT CCGTTTGCCG CCAGTAAAAG TGATGCTTCA
TTGCCGATGC TGGCCGAGTT TATCCGCAAG GACAGCGGGC GCATCGACAG CTTCCTGACC
CGTGAACTGG GCGGCGTGCT GCACAAAGAA GGAACGCGCT GGGTGCCGAA TAAGGTGAAC
AGCCAGGGGC TGACCTTTAA CCCGGCCTTC CTGGCGGCGA TTAATCAACT GAGCCAAATC
TCCGACATCT TGTTCACTGA CGGCAGTCAG GGGCTACGCT TTGAGCTGCT GGCCCGTCCG
GTGCCGAATG TGGTGGAAAC GAATCTGGCG ATTGACGGAC AGAAATTGCA TTATTTCAAC
CAGATGGAGA GCTGGCAGAG TTTCCGCTGG CCGGGGGAGA CCTATAAACC CGGCACGATG
TTGACCTGGA CTGGTGTGAG TTCAGGCGCC CGGTTGTACG GTGATTATCA GGGGACATGG
GGGCTGATCC GTTGGCTGGA GCAGGCAAAA CAGAAAAAGC TGGATGAAGG GCGCTATCAA
CTGACCTTCA CCACCGCAGA TAAACAGCCA TTGCAATGGA TATTACGCAC CGAGCTGGGC
AAAGGCCCGT TAGGCCTGTT GCAACTGCGT AATTTCACTC TGCCTGCGCA AATTTTTCTG
ATACAGCACG CGCCGCTTGC GATTGCTGAC ATGTCAGACG ATGAAGACAT GGCAGAAGAC
TGA
 
Protein sequence
MKRIALPIKK PEVWFWIVAL LFLLAGAVLC WLVWQHPERV GLIPGTPQRD RWLTGLVVGT 
GILTLCALLS YAGTRLSGRK HFDELRQQAQ GDDAPLPEDN AQAGETQQGE PSRLKTRLRR
RYGLFWRDKV RLLMVVGEPD EIAALAPQLA EQGWLEGQRT VLIHGGSLQR PADETGLSEW
RKLRRGRPLD GIVWAMTAAQ SGTPQWMDNG LRTLEKMGAA LRYQPPVYLW QVCDSDWPQD
SRTEQPVGAV FPAKATPEKV EQQLRALLPQ LREQGMQQLS IEPHHDFLLR LAQSLDQGDA
ARWRQRLTPW FTEYAARIPL RGLMFSLPDT SASAAQVHEK SWTVPASWQG VLDDCRGARG
RRVGLPWEQT LCYSLLALIV LWGVGSVVSF AVNRHQMVSA AQQAQQLAQS HAVSDQQLMA
LQALRNDIGR LQSRVAQGAP WYQRFGLDHN APLLEVLMQW YGQANNRILR DATAQSLYQK
LSELAELPAN SPQRAARAKT GYDQLKAYLM MARPEKADAV FYAQVMQTTE PTRAGVSPGL
WQSLAPDLWQ FYAQNLPAQP DWKITPDTGL ISQVRQVLLG QMGQRNAEST LYENMLLSVR
RNYADMTLMD MTGDTDAQRL FQTSESVPGM FTRKAWDEQI QQAIDKTVAS RREEIDWVLS
DNRRAISEDI SPEALKKRLT ERYFTDFAGS WLSFLNSLHW NEAHNLSDVI DQLTLMSDVR
QSPLIALMNT LAWQGQTGQQ NQALSDSLVK SAKALMNKDQ APAIDQSAGG PVGPLDETFG
PLLALMGKGD AQNRLSSDSS LSLQTLLTRV TRVRLKLQQV VNASDPQEMT QVLAQTVFQG
KSVDLTDTQE YGSLIAASLG EEWSSFGQTM FVQPLTQAWE TVLQPSSASL NDQWKNAVVA
NWKSAFDGRY PFAASKSDAS LPMLAEFIRK DSGRIDSFLT RELGGVLHKE GTRWVPNKVN
SQGLTFNPAF LAAINQLSQI SDILFTDGSQ GLRFELLARP VPNVVETNLA IDGQKLHYFN
QMESWQSFRW PGETYKPGTM LTWTGVSSGA RLYGDYQGTW GLIRWLEQAK QKKLDEGRYQ
LTFTTADKQP LQWILRTELG KGPLGLLQLR NFTLPAQIFL IQHAPLAIAD MSDDEDMAED