Gene YpAngola_A4073 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYpAngola_A4073 
Symbol 
ID5802552 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pestis Angola 
KingdomBacteria 
Replicon accessionNC_010159 
Strand
Start bp4336368 
End bp4338557 
Gene Length2190 bp 
Protein Length729 aa 
Translation table11 
GC content53% 
IMG OID641341854 
Producthypothetical protein 
Protein accessionYP_001608360 
Protein GI162418977 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG2982] Uncharacterized protein involved in outer membrane biogenesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value0.0571071 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACAAGAA CCGGAAAAGT ATTGGTGGGC GCCAGTGGGT TTATCCTGTT GTCATTGGTG 
GCGGTGGTGA TATTTGTCTC TTCTTTTGAC TGGAATCGCT TGAAGCCAAC GATTAACCAA
AAAGTCTCCG CCGAGTTACA ACGGCCTTTT GCCATTCGTG GCAATTTGAG TGTGGATTGG
TCACGAGAAG GCGAAGGCCC TGGCTGGCGC GGTTGGATCC CCTGGCCGCA TATCCATGCG
GAAGATTTGG TCTTGGGTAA CCCAACGACG TTAATCAGCA CACAAGAGAG CCGTGACGCT
CAATCAGCAC AAGGCACGCC GCTCTCCGAC GCGTTTCCCA CCGGAGAGAT GGTGACGCTT
AAGCGAATCG ACGCCAGCCT TGCTCCGCTG TCACTATTGA GCAAAGAAGT TCGTATCCCC
CGCCTTTGGC TGACGCAGCC AGATATTCAT TTGCAACGCC TGGCGAATGG CAATAACAAC
TGGACGTTTA ATCTGACCAA TACCTCCACG GATAGCGCGT CTTGGTCAGT GGATATCGGC
GATATTATTT TTGATCGCGG TGAGATAACG CTGAAAGATG CCATCCTACA AGCTGATTTA
CTCGCGGTGA TTGATCCGCT GGCTAAAGCG TTGCCGTTCG CACAAGTGAC CGGTGTCCGT
CGCGGGGCCA GTATTAATTC GGTGACCAGT ACTCATTCGG TGAACACGAC TACACCAGTG
AACACGACGA CAGCCACGGC CACGACTAAC CCTGTGACAG AGACGGTAAA AAGCACGACC
CCAGATTACC TCTTCGGCTG GAAAGTAGAC GGTCAGTATC AGGGGCAGCC CTTAGCGGGC
AGTGGGAAAA TAGGGGGCAT GATATCGATG AATGACGCGA ATGTACCGTT CCCTCTGCAA
GCGGATATGC GCTACGGTTC TACTCTGGTG GCCGTCGTCG GGACCTTAAC CGATCCCGGA
AATCTGGCGG GGCTGGATCT ACAATTGGTC TTATCCGGGA CCAGTCTGGA TAACCTCTAT
CCATTGCTCG ATGTCGTACT GCCCGCCACC CCGCCTTATC AGACCGAGGG TCATCTGAGT
GCTCGCCTAA AACAAGCAGG GGGCGCGGTT TATCATTATG AAAATTTTAA TGGGAAGATT
GGTGACAGTG ATATTCACGG TGACCTGACG TATACCGACA GTCAGCCCCG GCCGAAATTA
GCCGGTCAGG TCGATTCGGA AAAATTACGT TTTACCGATT TGGCTCCGCT GATCGGGGCC
GACTCCAATC AGGAAAAAGC CCTACGAGGT GAACGGAATC GGCAGCCGGG CAATAAGGTC
CTGCCGACAG AAACGTTTGA TACCAAAAGT TGGGGAGTGA TGGACGCGGA TGTCACTTAT
ACGGCTAAGC GTATCGAACG GGATAAGTCA TTACCGCTAA GTGATCTGTA CACCCATGTC
GTCTTAAAAG AGGGGATGTT ACTGCTCGAC CCGCTGCGGT TTGGGATGGC GGGAGGGGAT
CTGGCTGCAA CTTTGCGCCT CGATAGTCAC CAAATCCCGA TGAATGGCAA GGTTGATTTG
CATGTCCGGC GCATACAATT AAAAGCCTTG CTGCCACAGG TACAGGCGAT GCGGAGCAGC
CTGGGGCGCT TGAGTGGTGA TGCCTCCTTT ATCGCAGCAG GCAACTCGGT TGCTGGCCTG
TTAGCAACCA GCAACGGTAA CGTGCGCCTG CTACTGAGCC AGGGGCAGAT CAGCCGTAGC
CTGATGGAGT TACTCGGCCT GAATGTGGGT AACTATCTGG TGGCGAAGCT GTTTGGTGAT
GATACGGTGA AAATTAACTG TGCGGTGGCA GATATTACGT TGCGTAATGG GGTGGCGACG
CCGAACGTGT TTGTCTTCGA TACTGAAAAT GCCATTATCA ATATTACCGG TAATGCTAAT
TTTGCTACCG AGCGGCTGAA TTTATCCATC GATCCTGAAA GCAAAGGGCT ACGTATTCTG
ACCCTGCGCT CGCCACTGTA TGTCAAAGGG ACCTTTAAAC GGCCTGATGT GGGGGTGAAA
ACCGGAGCGT TGATTGCCCG AGGTGCCGTT GCGGCAGCAC TGGGGGTGGC ATTAACGCCG
GCAGCGGCAT TACTGGCACT GATCTCCCCC AGTGAAGGTG AAGAGAATCA GTGCGCCCCG
TTACTGCGAA AAATACAGCA AAAGAAATAA
 
Protein sequence
MTRTGKVLVG ASGFILLSLV AVVIFVSSFD WNRLKPTINQ KVSAELQRPF AIRGNLSVDW 
SREGEGPGWR GWIPWPHIHA EDLVLGNPTT LISTQESRDA QSAQGTPLSD AFPTGEMVTL
KRIDASLAPL SLLSKEVRIP RLWLTQPDIH LQRLANGNNN WTFNLTNTST DSASWSVDIG
DIIFDRGEIT LKDAILQADL LAVIDPLAKA LPFAQVTGVR RGASINSVTS THSVNTTTPV
NTTTATATTN PVTETVKSTT PDYLFGWKVD GQYQGQPLAG SGKIGGMISM NDANVPFPLQ
ADMRYGSTLV AVVGTLTDPG NLAGLDLQLV LSGTSLDNLY PLLDVVLPAT PPYQTEGHLS
ARLKQAGGAV YHYENFNGKI GDSDIHGDLT YTDSQPRPKL AGQVDSEKLR FTDLAPLIGA
DSNQEKALRG ERNRQPGNKV LPTETFDTKS WGVMDADVTY TAKRIERDKS LPLSDLYTHV
VLKEGMLLLD PLRFGMAGGD LAATLRLDSH QIPMNGKVDL HVRRIQLKAL LPQVQAMRSS
LGRLSGDASF IAAGNSVAGL LATSNGNVRL LLSQGQISRS LMELLGLNVG NYLVAKLFGD
DTVKINCAVA DITLRNGVAT PNVFVFDTEN AIINITGNAN FATERLNLSI DPESKGLRIL
TLRSPLYVKG TFKRPDVGVK TGALIARGAV AAALGVALTP AAALLALISP SEGEENQCAP
LLRKIQQKK