Gene YpAngola_A2234 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYpAngola_A2234 
Symbol 
ID5800704 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pestis Angola 
KingdomBacteria 
Replicon accessionNC_010159 
Strand
Start bp2337457 
End bp2338626 
Gene Length1170 bp 
Protein Length389 aa 
Translation table11 
GC content49% 
IMG OID641340134 
Producttetratricopeptide repeat protein 
Protein accessionYP_001606679 
Protein GI162419005 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2956] Predicted N-acetylglucosaminyl transferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000000215898 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value0.00382092 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGTTAGAAA TGCTGTTTCT GCTGTTGCCC GTTGCTGCCG CCTATGGTTG GTATATGGGG 
CGCAGAAGCG CTCAGCAGGA TAAGCAACAG GATGCTAACC GCTTGTCACG TGAATACGTG
GCTGGGGTTA ACTTCCTTCT CTCGAACCAG CAAGATAAAG CAGTCGATCT CTTCCTCGAA
ATGTTGAAAG AAGACAGTTC TACGGTCGAG GCTCATCTTA CTCTGGGTAA TCTGTTCCGC
TCACGCGGCG AAGTTGATCG CGCCATTCGC ATCCATCAAG CCTTGATGGA GAGTGCATCC
CTGACATTTG AACAACGGCT ATTGGCTGTT CAGCAACTCG GTCGAGATTA CATGGCCGCA
GGTTTATATG ATCGGGCAGA AGATATGTTC AACCAATTGG TTGAAGAGCA AGATTTTCGG
CTCGGCGCAT TACAACAATT ACTGATAATT CATCAGGCAA CCAGTGATTG GCATAATGCT
ATTGAAGTTG CGGAAAAATT GGTCAAGATG GGCAAAGATA ATCAGCGTCT GGAAATTGCG
CACTTCTATT GTGAACTTGC GTTGCAAGCA ATGGGCAGTG ACGATCTGGA TAAGGCCATG
GGGTTACTGA AAAAAGCGGC AACGGCGGAT AAACAATGTG CCCGAGTTTC TATTATGCGT
GGCCGTGTGC ATCTCGCTAA AGGTGAGTAT GCCAAAGGCG TTGAAGCGTT GGAGCGAGTA
CTGGAGCAAG ATAAAGAAGT CGTCAGTGAA GCATTACCGA TGCTCAGTGA ATGCTACCAA
CACCTGCAAC AGCCCCAGGC TTGGGCTAAT TTCCTTAAAC GTTGTGTGGA AGATAATACC
GGTGCGGCGG CTGAACTGAT GTTGGCGGAG GTCCTTGAAC AGCAGGAGGG CCATGATGTC
GCGCAAACGT ATATTAACCG CCAATTACAG CGCCACCCAA CGATGCGCGG TTTTTATCGT
CTAATGGATT ACCACTTAGC AGATGCAGAA GAAGGGAGTG CAAAAGAGAG TTTATTGCTA
TTACGGGACA TGGTTGGTGA GCAAATTCGG ACCAAACCAC GTTACCGTTG TCATAAGTGT
GGTTTCACCG CCCACTCACT CTATTGGCAT TGCCCGTCTT GCCGTGCCTG GGCATCAGTT
AAGCCGATCA GAGGATTAGA TGGGCAGTAG
 
Protein sequence
MLEMLFLLLP VAAAYGWYMG RRSAQQDKQQ DANRLSREYV AGVNFLLSNQ QDKAVDLFLE 
MLKEDSSTVE AHLTLGNLFR SRGEVDRAIR IHQALMESAS LTFEQRLLAV QQLGRDYMAA
GLYDRAEDMF NQLVEEQDFR LGALQQLLII HQATSDWHNA IEVAEKLVKM GKDNQRLEIA
HFYCELALQA MGSDDLDKAM GLLKKAATAD KQCARVSIMR GRVHLAKGEY AKGVEALERV
LEQDKEVVSE ALPMLSECYQ HLQQPQAWAN FLKRCVEDNT GAAAELMLAE VLEQQEGHDV
AQTYINRQLQ RHPTMRGFYR LMDYHLADAE EGSAKESLLL LRDMVGEQIR TKPRYRCHKC
GFTAHSLYWH CPSCRAWASV KPIRGLDGQ