Gene YpAngola_A3743 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYpAngola_A3743 
Symbol 
ID5802220 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pestis Angola 
KingdomBacteria 
Replicon accessionNC_010159 
Strand
Start bp3967690 
End bp3970065 
Gene Length2376 bp 
Protein Length791 aa 
Translation table11 
GC content54% 
IMG OID641341546 
Producthypothetical protein 
Protein accessionYP_001608058 
Protein GI162421540 
COG category[K] Transcription 
COG ID[COG2183] Transcriptional accessory protein 
TIGRFAM ID[TIGR00426] competence protein ComEA helix-hairpin-helix repeat region 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones43 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATGAAC CACTGAGCCG CATTATTGCA AGCGAACTGC AGGCCCGGCC GGAGCAAGTT 
ATCTCCGCTA TCCGCCTGCT TGATGAAGGT AATACCGTGC CCTTTATTTC ACGGTATCGT
AAGGAAGTTA CCGGCGGGTT AGATGATATC CAACTGCGTC AGTTGGAAAG CCGTCTGGGG
TATCTGCGTG AATTAGAAGA TCGCCGCCAA ACCATTCTTA AATCAATTGA AGATCAAGGA
AAACTCACCG ACCAGCTGGC CGGGGCGATC AACGCCACCC TAAGTAAGAC CGAGCTGGAA
GATCTGTATC TTCCTTATAA ACCGAAGCGC CGCACTCGCG GACAAATTGC CATTGAAGCC
GGGTTAGAAC CCCTGGCAGA GCGTTTATGG CAGGATCCAC AACAAGACCC TGAACACACC
GCGCTGGCCT ATGTTGATGC CGATAAAGGC GTCGCTGATA CTAAAGCCGC GTTGGATGGT
GCTCGCTATA TTTTGATGGA GCGGTTTGCC GAAGATGCCA CCCTGCTGGC GAAAGTGCGT
CAGTATCTGT GGAAAAACGC CCATCTGGTG TCAAAAGTCG TGGAAGGTAA AGAGCAGGAA
GGCGCTAAAT TCCGCGATTA CTTCGATCAC CACGAACCTA TCGCACAAGT CCCTTCTCAC
CGCGCATTGG CCATGTTCCG TGGCCGCAAT GAGGGGGTAC TGCAACTGGC CTTGGATCCT
GATCCGCAAT TTGACGAACC GCCGCGTGAA AGTCAGGGTG AACAGATCAT TATCAACCAT
CTTGATCTGC GCTTGAATAA TGCGCCGGCA GACGGCTGGC GTAAGGCGGT GGTCAACTGG
ACCTGGCGTA TCAAGGTGCT GTTGCATCTG GAAACCGAGC TGATGAGCAC CTTGCGTGAA
CGGGCTGAAG ATGAGGCTAT TAATGTCTTT GCCCGTAATA TGCAAGATTT ACTGATGGCC
GCGCCAGCAG GTATGCGCGC GACCATGGGC CTCGATCCCG GCCTGCGTAC TGGCGTGAAA
GTCGCGGTGG TGGATGCAAC AGGCAAGCTG GTCGCTTTCG ATACCATCTA CCCACACACC
GGCCAGGCAG CAAAAGCCGC CGCCGTTGTC GCCGCCCTGT GCATCAAACA CCAGGTTGAA
CTGGTGGCTA TCGGTAACGG TACTGCCTCA CGGGAAACCG AGCGCTTCTT TGTGGAGCTA
CAGCAACAGT ACCCGGCCGT CACCGCCCAA AAAGTCATTG TCAGTGAGGC CGGTGCCTCG
GTCTATTCAG CCTCTGAATT GGCCTCGCAA GAGTTTCCTG ATCTGGATGT CTCCATCCGT
GGCGCGGTTT CCATTGCCCG CCGTCTGCAA GATCCGTTGG CTGAACTGGT AAAAATCGAT
CCGAAATCTA TCGGTGTTGG TCAGTATCAG CACGATGTCA GCCAAAGCCA ATTGGCGAAA
AAGCTGGATG CGGTGGTGGA AGACTGCGTA AACGCCGTTG GCGTGGATTT AAACACGGCT
TCGGTGCCGT TACTGACACG TGTTGCCGGT TTGACGCGCA TGATGGCACA GAACATCGTG
AACTGGCGTG ATGAGAATGG CCGCTTCCGC AACCGTGAGC AATTACTGAA AGTCAGCCGC
CTCGGGCCGA AAGCCTTCGA ACAGTGTGCA GGCTTCTTGC GTATTAACCA CGGCGATAAC
CCCTTAGACG CCTCGACAGT TCACCCAGAA GCCTATCCGG TAGTTGAGCG TATTTTAGCG
GCCACCGAGC AGGCGTTGCA GGACTTAATG GGCAATGCCA ATGCGCTGCG CAACCTTAAT
GCTCGCGATT TTACTACTGA GCGTTTTGGC GTACCAACGG TAACCGATAT TCTGCGAGAG
CTGGAAAAGC CAGGCCGTGA CCCGCGCTCT GAATTTAAAA CAGCCACCTT CGCGGAAGGG
GTGGAAACAC TGAATGACCT GACACCGGGC ATGATCCTTG AAGGCGCGGT CACTAACGTG
ACAAATTTTG GTGCTTTTGT GGATATCGGC GTTCATCAGG ATGGTTTGGT GCATATCTCT
TCACTGGCCG ATAAGTTTGT CGATGATCCA CATAAAGTGG TGAAAGCCGG CGATATCGTC
AAAGTCAAAG TGATGGAAGT GGATCTGCAA CGTAAGCGCA TCGCCCTGAC CATGCGCCTT
GATGAGCAGC CAGGTGAAAC TCACTCCCGC CGATCCAACA ATGGCACGGG TAGCGAGCGC
ACCAATAATG ACAACCGCGG GGTAAATCGC CCACATAACG ACGCGAAAGG TCATAATGCG
CCTAACCGTG CTCCGGCCAA AGGGCGATCA GATAGCAGCT CGGCGGGTAA CAGCGCCATG
AGCGATGCGC TGGCGGCGGC CTTTAAAAAG CGTTAG
 
Protein sequence
MNEPLSRIIA SELQARPEQV ISAIRLLDEG NTVPFISRYR KEVTGGLDDI QLRQLESRLG 
YLRELEDRRQ TILKSIEDQG KLTDQLAGAI NATLSKTELE DLYLPYKPKR RTRGQIAIEA
GLEPLAERLW QDPQQDPEHT ALAYVDADKG VADTKAALDG ARYILMERFA EDATLLAKVR
QYLWKNAHLV SKVVEGKEQE GAKFRDYFDH HEPIAQVPSH RALAMFRGRN EGVLQLALDP
DPQFDEPPRE SQGEQIIINH LDLRLNNAPA DGWRKAVVNW TWRIKVLLHL ETELMSTLRE
RAEDEAINVF ARNMQDLLMA APAGMRATMG LDPGLRTGVK VAVVDATGKL VAFDTIYPHT
GQAAKAAAVV AALCIKHQVE LVAIGNGTAS RETERFFVEL QQQYPAVTAQ KVIVSEAGAS
VYSASELASQ EFPDLDVSIR GAVSIARRLQ DPLAELVKID PKSIGVGQYQ HDVSQSQLAK
KLDAVVEDCV NAVGVDLNTA SVPLLTRVAG LTRMMAQNIV NWRDENGRFR NREQLLKVSR
LGPKAFEQCA GFLRINHGDN PLDASTVHPE AYPVVERILA ATEQALQDLM GNANALRNLN
ARDFTTERFG VPTVTDILRE LEKPGRDPRS EFKTATFAEG VETLNDLTPG MILEGAVTNV
TNFGAFVDIG VHQDGLVHIS SLADKFVDDP HKVVKAGDIV KVKVMEVDLQ RKRIALTMRL
DEQPGETHSR RSNNGTGSER TNNDNRGVNR PHNDAKGHNA PNRAPAKGRS DSSSAGNSAM
SDALAAAFKK R