Gene YpAngola_A0194 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYpAngola_A0194 
Symbol 
ID5798658 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pestis Angola 
KingdomBacteria 
Replicon accessionNC_010159 
Strand
Start bp206361 
End bp208541 
Gene Length2181 bp 
Protein Length726 aa 
Translation table11 
GC content50% 
IMG OID641338212 
ProductTonB-dependent siderophore receptor 
Protein accessionYP_001604818 
Protein GI162419214 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1629] Outer membrane receptor proteins, mostly Fe transport 
TIGRFAM ID[TIGR01783] TonB-dependent siderophore receptor 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value0.0233266 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAACACA AACACCTCTG GGTATTAAAT CCGTGTCTGC TGGTCATGCT CACACCAGCG 
GCGTGGGCAG AGGATCAACT TGTGGTTTCA GCCAACCGCT CACACCGCAG CGTGGCAGAA
ATGGCGCAAA CAACTTGGGT GATTGAAGGC CAGGAACTTG AACAACAAGT TCAAGGCGGC
CTGGAAATCA AAGACATCTT GGCTCAATTG ATCCCAGGTA TCGACGTCAG CAGCCAAGGC
CGTACCAACT ACGGCATGAA CATGCGTGGC CGCTCAATCA TGGTGATGAT CGACGGCGTG
CGGCTGAACT CTTCACGTAG CGACAGCCGC CAACTCGATT CGATCGATCC GTTTAATATT
GCACACATTG AAGTCATCTC CGGTGCGACG TCACTTTATG GCGGGGGTAG CACGGGGGGC
TTGATCAATA TCGTGACCAA AAAAGGCCAG GAAGGAAAAC AGGTTGAGTT GCAAATTGGC
GGCAAAACCG GCTTTAACAG CCATAACGAC CACGATGAGA ACATCTCGGC CGCCATGAGT
GGGGGGACTG AGCGCGCATT CGGCCGATTC TCTGTTTCTT ATCAACGTTA TGGGGGGGGG
TACGACGGCA AAGGCAATGA GGTTTTGATC GACAACACCC AGACCGGCTT GCAATATTCT
AACCGTCTGG ACGTGATGGG CACGGGAACC CTCAATATCG ACGAAAACCA GCAATTGCAG
CTAACCACTC AGTATTTCAA CAGCGAATCC GATGGCAAAC ACGGCTTGTA TCTGGGGCAA
AACTTCTCAG CAGTAACGGG TACCGGGCAG GCCTCCAACA GCGCAGCGCT AAATTCAGAC
CGTATTCCGG GTACCGAACG CCATCTCATC AACTTGCAAT ACTCCAACAC GGATTTCTGG
GGGCAGGATT TAGTGGCGCA AGTGTATTAC CGTGATGAAT CACTAACCTT TTACCCATTC
CCAACACTGA AAGATGGCAA GGTGAGCACT ATCGGTGCAT CACAGCAAAA AACCGATTTC
TACGGCAGTA AACTGACATT GAACAGTGAA CCTATCGATA GCCTAACACT GACTTACGGT
ATCGATTTGG AACATGAAAG TTTCAATGCC AATCAACAGT TCTTTAATCT GGCGAAGGCA
CAGCAATCCG GCGGCATGAC GTTAGAAAAT GCCTACAACG TTGGCCGTTA CCCAAGTTAT
ACCACCACCA ACCTGGCTCC CTTCTTACAA ACCCGCTATG ACATCAACCC GATTTTCACT
CTGAGCGGCG GTGTGCGTTA CCAGTACACA GAGAACAAGG TTGATGATTT TGTGGGTTAC
GCACAACAAC AAGCGATTGC CAGCGGCTCC GCGACTTCCG CTGATCCCGT ACCTGGTGGG
AAAACGGATT ACAACAACTT CCTGTTCAAT GCCGGTTTAC TGGCGCACCT GACAGAAAGC
CAGCAGACCT GGTTCAACTT TTCACAAGGA TTTGAGATCC CGGATCTGGC GAAATATTAC
GGTTCTGGCT CCTACACGCT GGTTAACGGT CACTATCAGT TGCAAAACAG TGTCAACGTG
AATGACTCCA AACTGGAGGG CATCAAGGTT GATTCCTATG AGCTGGGCTG GCGCTATACC
GGCGATAACC TGCGCACCCA AATGGCGGGT TATTACTCGC TTTCTGATCA GACTATCTCG
ATCAACAAAA CAGACATGAC CATCAATGTG TTACCGGATA AGCGCCGAAT CTATGGGGTA
GAAGGCGCAG TAGATTACTT CTTCGATAAC AGCGAATGGA GTGCCGGTGC AACATTTAAC
CTAATTAAGT CGGAAACTAA GGTTAGCGGT AAGTGGCAGA AACTGACTAT CGATGCTGCC
AGCCCGTCGA AAGCCACCGC CTACATTGGC TGGGCACCGG GTGATTGGAA TCTGCGCGTG
CAATCCCAAC AAACGTTCGA TGTTTCCGAT AGCAAGGGTG ACAAAATAGA CGGTTACAAC
ACCATCGATT TTCTCAGTAG TTACGCCCTA CCCGTGGGAA AACTCAGTTT CAGTATCGAA
AACTTGCTGG ACAAAGAGTA CACCACCGTT TGGGGTCAGC GCGCACCGAT TCTGTATAGC
CCAACCTACG GCTCACCCAA CTTATACAGC TATAAGGGCC GTGGCCGGAC TTTTGGTGTG
AACTACTCAG TGTTGTTCTG A
 
Protein sequence
MKHKHLWVLN PCLLVMLTPA AWAEDQLVVS ANRSHRSVAE MAQTTWVIEG QELEQQVQGG 
LEIKDILAQL IPGIDVSSQG RTNYGMNMRG RSIMVMIDGV RLNSSRSDSR QLDSIDPFNI
AHIEVISGAT SLYGGGSTGG LINIVTKKGQ EGKQVELQIG GKTGFNSHND HDENISAAMS
GGTERAFGRF SVSYQRYGGG YDGKGNEVLI DNTQTGLQYS NRLDVMGTGT LNIDENQQLQ
LTTQYFNSES DGKHGLYLGQ NFSAVTGTGQ ASNSAALNSD RIPGTERHLI NLQYSNTDFW
GQDLVAQVYY RDESLTFYPF PTLKDGKVST IGASQQKTDF YGSKLTLNSE PIDSLTLTYG
IDLEHESFNA NQQFFNLAKA QQSGGMTLEN AYNVGRYPSY TTTNLAPFLQ TRYDINPIFT
LSGGVRYQYT ENKVDDFVGY AQQQAIASGS ATSADPVPGG KTDYNNFLFN AGLLAHLTES
QQTWFNFSQG FEIPDLAKYY GSGSYTLVNG HYQLQNSVNV NDSKLEGIKV DSYELGWRYT
GDNLRTQMAG YYSLSDQTIS INKTDMTINV LPDKRRIYGV EGAVDYFFDN SEWSAGATFN
LIKSETKVSG KWQKLTIDAA SPSKATAYIG WAPGDWNLRV QSQQTFDVSD SKGDKIDGYN
TIDFLSSYAL PVGKLSFSIE NLLDKEYTTV WGQRAPILYS PTYGSPNLYS YKGRGRTFGV
NYSVLF