Gene YpAngola_A2094 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYpAngola_A2094 
SymbolybtE 
ID5800564 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pestis Angola 
KingdomBacteria 
Replicon accessionNC_010159 
Strand
Start bp2169703 
End bp2171280 
Gene Length1578 bp 
Protein Length525 aa 
Translation table11 
GC content59% 
IMG OID641340006 
Productyersiniabactin synthetase, YbtE component 
Protein accessionYP_001606552 
Protein GI162421769 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG1021] Peptide arylation enzymes 
TIGRFAM ID[TIGR02275] 2,3-dihydroxybenzoate-AMP ligase 


Plasmid Coverage information

Num covering plasmid clones33 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones46 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATTCTT CCTTTGAATC TCTGATTGAA CAGTATCCCT TACCCATTGC CGAACAGTTG 
CGCCACTGGG CGGCCCGTTA TGCCTCGCGA ATTGCCGTCG TTGATGCAAA GGGGTCGTTA
ACCTACAGCG CGCTTGATGC ACAAGTTGAC GAACTTGCCG CAGGTCTGTC ATCACTGGGT
TTGCGTTCGG GGGAGCATGT AATTGTGCAG CTTCCCAACG ACAACGCGTT TGTTACCCTG
CTGTTCGCCT TGTTAAGACT GGGCGTTATC CCCGTGCTGG CGATGCCCTC GCAACGGGCG
CTGGATATCG ACGCGCTGAT TGAGCTGGCG CAACCCGTCG CTTACGTTAT TCACGGGGAA
AACCACGCAG AGCTGGCCCG ACAGATGGCG CACAAACACG CCTGCTTGCG TCATGTTCTG
GTCGCTGGAG AGACCGTGAG CGACGATTTT ACGCCGCTCT TCTCCCTTCA CGGTGAGCGA
CAGGCATGGC CGCAGCCTGA TGTTTCCGCC ACCGCGTTGT TGTTGCTCTC AGGCGGCACA
ACCGGCACGC CCAAACTCAT CCCGCGCCGA CATGCCGACT ATAGCTATAA CTTCAGCGCT
TCTGCTGAAC TGTGCGGCAT CAGCCAACAG AGCGTGTATC TCGCCGTCCT CCCGGTGGCG
CATAACTTTC CGCTGGCCTG CCCCGGTATT CTGGGAACGC TTGCCTGCGG CGGAAAAGTG
GTGCTGACCG ACAGCGCCAG CTGTGATGAG GTGATGCCTT TAATCGCGCA GGAAAGAGTG
ACTCACGTCG CCCTGGTTCC GGCGCTGGCG CAATTATGGG TGCAGGCCAG GGAGTGGGAA
GACAGCGACC TTTCGTCGCT GCGCGTCATT CAGGCAGGCG GCGCCCGGCT CGACCCGACG
CTTGCTGAGC AGGTTATCGC CACCTTTGAC TGTACCCTGC AACAGGTTTT CGGTATGGCG
GAAGGCCTGC TCTGTTTTAC CCGACTGGAC GATCCGCATG CCACCATTCT CCACAGCCAG
GGGCGCCCGT TGTCCCCTCT GGATGAAATC CGCATCGTTG ATCAAGACGA GAACGACGTC
GCGCCGGGCG AAACCGGGCA ATTGTTAACG CGCGGCCCTT ATACCATTTC GGGCTATTAC
CGCGCCCCTG CCCACAACGC GCAGGCCTTT ACCGCGCAAG GGTTTTACCG CACAGGCGAC
AATGTCAGGC TGGATGAGGT GGGGAACCTG CACGTTGAGG GACGCATAAA AGAGCAGATC
AACCGCGCCG GAGAAAAAAT AGCCGCGGCT GAAGTGGAAT CGGCACTGCT GCGTTTAGCG
GAAGTGCAAG ATTGCGCGGT GGTCGCCGCG CCGGACACGC TGCTTGGCGA GCGGATTTGC
GCGTTTATCA TCGCGCAGCA GGTGCCAACT GACTATCAGC AGTTGCGTCA ACAACTGACC
CGTATGGGGC TCAGCGCGTG GAAAATTCCT GACCAAATCG AGTTTCTGGA CCACTGGCCG
CTCACCGCCG TCGGCAAGAT AGACAAAAAA CGCCTGACGG CTCTCGCCGT CGACCGTTAT
CGCCATTCTG CCCAATAA
 
Protein sequence
MNSSFESLIE QYPLPIAEQL RHWAARYASR IAVVDAKGSL TYSALDAQVD ELAAGLSSLG 
LRSGEHVIVQ LPNDNAFVTL LFALLRLGVI PVLAMPSQRA LDIDALIELA QPVAYVIHGE
NHAELARQMA HKHACLRHVL VAGETVSDDF TPLFSLHGER QAWPQPDVSA TALLLLSGGT
TGTPKLIPRR HADYSYNFSA SAELCGISQQ SVYLAVLPVA HNFPLACPGI LGTLACGGKV
VLTDSASCDE VMPLIAQERV THVALVPALA QLWVQAREWE DSDLSSLRVI QAGGARLDPT
LAEQVIATFD CTLQQVFGMA EGLLCFTRLD DPHATILHSQ GRPLSPLDEI RIVDQDENDV
APGETGQLLT RGPYTISGYY RAPAHNAQAF TAQGFYRTGD NVRLDEVGNL HVEGRIKEQI
NRAGEKIAAA EVESALLRLA EVQDCAVVAA PDTLLGERIC AFIIAQQVPT DYQQLRQQLT
RMGLSAWKIP DQIEFLDHWP LTAVGKIDKK RLTALAVDRY RHSAQ