Gene YpAngola_A3892 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYpAngola_A3892 
SymbolhasD 
ID5802370 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pestis Angola 
KingdomBacteria 
Replicon accessionNC_010159 
Strand
Start bp4129726 
End bp4131534 
Gene Length1809 bp 
Protein Length602 aa 
Translation table11 
GC content49% 
IMG OID641341683 
ProductABC transporter 
Protein accessionYP_001608193 
Protein GI162419728 
COG category[R] General function prediction only 
COG ID[COG4618] ABC-type protease/lipase transport system, ATPase and permease components 
TIGRFAM ID[TIGR01842] type I secretion system ABC transporter, PrtD family 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones39 
Fosmid unclonability p-value0.881504 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAATCTT GCAAAACGCC CGGAGCTTCA TCGGTACCGC CAACAATTCT GTCGGTTCTG 
GCGGGTAATA AAAAAATCCT CTGGGGCATC GGGCTATTTA CTGCCGTAAT AAACTTGCTG
ATGTTGGCAC CGGCCATTTA TATGCTGCAA GTGTATGATC GCGTCCTCGC CTCAGCGAAT
ACCATGACAT TATTGATGCT CACTGTTCTG GTGCTGGGCG TATTTGTTTT TATTGGTTTA
TTAGAATGGG TTCGCAGTGC AGTAGTGATT CGCTTGGGGA CTCAGATTGA TATGCAACTC
AATCAGCCGG TCTTTAACGC GGCCTTTGCC GCCAATCTTA AGGGACATAA CACGCCAGCC
GCGCAAGCGC TAAATGACCT GACAGTGTTA CGCCAGTTTG CCACCGGTAA TGCGTTATTT
GCCTTTTTTG ATGCACCTTG GTTCCCGCTT TATTTGCTGG TAATTTTTTT GCTTCATCCG
TGGCTGGGGA TGCTGGCCGC CGCAGGCGCA GGTATATTGG TTGTTCTCGC TTGGCTCAAT
CAGTGGATCT GTAAAAAACC ATTACACGAT GCCTCAATTA TCACCTCACA CGCGACACAA
CAAGCCAATG CTAATTTACG TAATGCGGAT GTCATTGAAG CGATGGGGAT GCTCAAAGCA
TTACGTGAAC GTTGGTTAAT GCAGCACGCG AATTTTCTCT ACCAACAAAA TCTTGCCAGC
GATAAAAGTA GCCGGGTGAC GGCGGTCGCT AAATCAAGCC GCCAGGCATT GCAATCGATG
ATGTTAGGTC TGGGAGCGTT ACTGGTGATT TATAATGAAA TCACGGCTGG CGTGATGATT
GCCGGGTCAA TTTTAATCGG GCGGGTGCTG GGGCCTATCG ATCAACTTAT TGCAGTCTGG
AAACAATGGA GCCACGCCCG GCTGGCTTAT CAACGTCTCT CCCAATTACT GGCGCAGCAC
CCATCATCAC CCACCGGCAT GGTTTTGCCT GCTCCACAGG GGAAACTGAA CGTTACGCAA
CTTATGGCCT GCAAGCCGGG CACGCACATT CCAGTATTAC ACTCCATCAA CTTTGAACTG
CAACCCGGTG ATGTGCTAGG TATTTTGGGG CCATCAGGCA GTGGTAAATC TACGCTGGCA
AAACTCTTGG TCGCCAGCCA GCCCACATTC AGCGGAACAG TGCGTTTAGA TAGCGCAGAC
CTTTCTCGTT GGGATAAAAC GCAGTTAGGG GAATTTATCG GTTACCTGCC ACAAAATATT
CAGCTATTCC GCGGCACTGT TGCTGAAAAT ATTGCACGTT TTGGCGCTAT TGATACCGCA
AAAGTGGTCG CAGCTGCACA GTTGGCAGAT GTACACGATT TAATTCTGCA TTTGCCGCAA
GGGTATGACA CCCCGTTAGG TGACGACGGC GAGGGGTTAT CCGGTGGTCA ACGTCAACGT
ATCGCCTTAG CTCGGGCAAT GTACGGAATA CCTCGGCTTA TCGTATTGGA TGAACCCAAT
GCCAGTTTGG ATAAAGAAGG GGAGCAAGCC TTATTAGCCA GTATTATCCA GCTAAAACAA
CAAGGTTGCA CCATCGTGAT GATCACCCAT AAACCAGAGC TGTTATCTGG CAGTGACTAT
TTGCTGTTTT TAAAGAATGG GCAAATGGAT CTATTTGATC GTACTCAGGC GGTCTTACAG
AACATTCAGG GTAAGGATAA GCCTGCTGTG CAACCTGAAA CGAAAATACT GAATAGCCGA
AGCGGTTGGA GCAACGGTGT GTCATATGGC ATCGGGCCGG CTCGTACTAC ATCATCACCG
AAGCCATGA
 
Protein sequence
MESCKTPGAS SVPPTILSVL AGNKKILWGI GLFTAVINLL MLAPAIYMLQ VYDRVLASAN 
TMTLLMLTVL VLGVFVFIGL LEWVRSAVVI RLGTQIDMQL NQPVFNAAFA ANLKGHNTPA
AQALNDLTVL RQFATGNALF AFFDAPWFPL YLLVIFLLHP WLGMLAAAGA GILVVLAWLN
QWICKKPLHD ASIITSHATQ QANANLRNAD VIEAMGMLKA LRERWLMQHA NFLYQQNLAS
DKSSRVTAVA KSSRQALQSM MLGLGALLVI YNEITAGVMI AGSILIGRVL GPIDQLIAVW
KQWSHARLAY QRLSQLLAQH PSSPTGMVLP APQGKLNVTQ LMACKPGTHI PVLHSINFEL
QPGDVLGILG PSGSGKSTLA KLLVASQPTF SGTVRLDSAD LSRWDKTQLG EFIGYLPQNI
QLFRGTVAEN IARFGAIDTA KVVAAAQLAD VHDLILHLPQ GYDTPLGDDG EGLSGGQRQR
IALARAMYGI PRLIVLDEPN ASLDKEGEQA LLASIIQLKQ QGCTIVMITH KPELLSGSDY
LLFLKNGQMD LFDRTQAVLQ NIQGKDKPAV QPETKILNSR SGWSNGVSYG IGPARTTSSP
KP