Gene YpAngola_A3045 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYpAngola_A3045 
Symbol 
ID5801518 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pestis Angola 
KingdomBacteria 
Replicon accessionNC_010159 
Strand
Start bp3217886 
End bp3219589 
Gene Length1704 bp 
Protein Length567 aa 
Translation table11 
GC content49% 
IMG OID641340882 
Producthypothetical protein 
Protein accessionYP_001607411 
Protein GI162421109 
COG category[R] General function prediction only 
COG ID[COG4533] ABC-type uncharacterized transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones45 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCATTA TTCACCGGCT TACCCAATAT GAGCGTTTAT ATCAAAAATT TGGTGACCAC 
CCGGTGGCGA CTACCGTGGC TGACGTGGCT AGTTTACTCT TTTGTAGCGA ACGGCATGCC
CGCACCCTGA TTCAGCAACT ACAGATGAAG AGTTGGCTAA GCTGGCATTC ACAAGTGGGT
AGAGGGAAAC GAGCGCAACT GCAATGCCTG AAAAAACCTG ATGCATTACG GGCTATTTAC
CTGCAACAGT TCCTTGAGCA AGGCGATCAT CAAGCGGCAT TCTCAATAGC ACAATTGGAA
CCCGAGCGCC TACAGACCTT ACTTACCCCC CATATGGGCG GACAATGGCA AGCCGATAGC
CCTATTCTGC GTATCCCCTA CTACCGCGAG CTGGAACCGC TTAATCCCAT GAATGCTTCA
GGGCGGGCAG AACAGCACCT GATTTATACT TTGCATGCTG GGCTGACACG GTTTAATACA
GGTGACCCGT TGCCTAAACC TGATTTGGCT CATCACTGGC AAATCAGCGC AGATGGCTTA
ACCTGGCAGT TCTTCTTACG CAGCCAACTA CGTTGGCATA ATGGCGACCA CATTCATGGT
AAGCAATTAT TGCAAACACT GGAGATTCTG CGCGCAAACC GACGTAGCCA CCCCAGTTTT
GCTAATATTG TTACTATCAC TCTCCCCCAC GCTTTATGCC TACAATTTAC CCTTTCCCAA
CCAGATTATT GGCTAGCACA CCGGCTGGCT GATTTACCCT GTAGGCTTTT TCATCCAGAC
GATCCCTTTT TAGGTGCGGG TCCTTTTAAA TTAGCGACCT TTGATAAACA TTTAGTTCGA
CTAAAGCAGC ACGAATTTTA CCATTTGCAA CATCCCTATC TGGACATTAT CGAGTACTGG
ATCACCCCTA GCCTGACGGT AAATTCAACA AATGGCAGTT GCCAGCATCC GGTTCGCATC
ACCATCGGCC AAGAGGAAGA GTTCCCACTG GCCCGCCCCG TACAGCGCGG CATGAGCCTC
GGATTCTGCT ATCTGGCTAT TAATCGCCAT CGTAGCAACC TCACTCCACA GCAAATAGCC
AAGCTACTGA TGTTAGTCCA AACCTCGGGT ATATTAGAGG CGCTCTCCAT CAGCCGTGAC
GTAATAACGC CCTGCCATGA AATCCTTCCA GGCTGGCCTA TTCCACAGTT TTCGACGGAT
GAAAATCCCT CCCTTCCCGC CTGTTTGGTT CTGACCTATC AACCGCCGAT GGAGCTTGAG
AGTGTCGCTG AGCAACTAAA AATAGTATTA GCCGCTCATG GCTGTACATT AGAGATCCGC
GCCTGCCATG ATAAACAGTG GCAAGATGTT GACAAAATTA AAGAGAGCGA TTTACTGTTG
GCCGATCATT TAGTCGGTGA ATCGCCAGAG GCCACAATGG AGAGCTGGCT ACGGCTGGAC
CCTCTGTGGC GCGGAATTTT ACAGAACGAA CAGTGGAACC AGCAGCAAAA AACGCTGACC
TTCATTCAGC AGATAGAAAG CGCGCCAGAA CGTTTTCGCC AATTACAGGC ACATTACGAT
GACCTGATGT TAGCGGGACT GATTTTGCCG CTGTTTAACT ATGAATATCA GGTCAATGCC
CCATCACGCA TCAATGGGGT TACATTAACG GCATATGGTT GGTTCGATTT CTGTCAAGCC
TGGCTACCGC CAATAACGAA TTAA
 
Protein sequence
MRIIHRLTQY ERLYQKFGDH PVATTVADVA SLLFCSERHA RTLIQQLQMK SWLSWHSQVG 
RGKRAQLQCL KKPDALRAIY LQQFLEQGDH QAAFSIAQLE PERLQTLLTP HMGGQWQADS
PILRIPYYRE LEPLNPMNAS GRAEQHLIYT LHAGLTRFNT GDPLPKPDLA HHWQISADGL
TWQFFLRSQL RWHNGDHIHG KQLLQTLEIL RANRRSHPSF ANIVTITLPH ALCLQFTLSQ
PDYWLAHRLA DLPCRLFHPD DPFLGAGPFK LATFDKHLVR LKQHEFYHLQ HPYLDIIEYW
ITPSLTVNST NGSCQHPVRI TIGQEEEFPL ARPVQRGMSL GFCYLAINRH RSNLTPQQIA
KLLMLVQTSG ILEALSISRD VITPCHEILP GWPIPQFSTD ENPSLPACLV LTYQPPMELE
SVAEQLKIVL AAHGCTLEIR ACHDKQWQDV DKIKESDLLL ADHLVGESPE ATMESWLRLD
PLWRGILQNE QWNQQQKTLT FIQQIESAPE RFRQLQAHYD DLMLAGLILP LFNYEYQVNA
PSRINGVTLT AYGWFDFCQA WLPPITN