Gene YpAngola_A3360 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYpAngola_A3360 
Symbol 
ID5801837 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pestis Angola 
KingdomBacteria 
Replicon accessionNC_010159 
Strand
Start bp3573682 
End bp3576051 
Gene Length2370 bp 
Protein Length789 aa 
Translation table11 
GC content50% 
IMG OID641341181 
Productglycosy hydrolase family protein 
Protein accessionYP_001607703 
Protein GI162419588 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1501] Alpha-glucosidases, family 31 of glycosyl hydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones40 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAACAT TAAAAAACTG GCTGTTAATC AATGAGTATC CCGATCACCT TGAACTGCGG 
GTCGATGACC GTCATATCTT TTGTCTGTAT GTATTAGAAC CAAACTTATG CAGGGTGCTG
ATTAAACGCG AGGGTGAACT GGCACTGAGC CGCACCTGGA GTATTGCCCC GCAAGGTGAT
GTGCCATGGT CAGGCCGCGA TCGCCTGAGT CTGGAAGGCT TCTCTCTCCC CGGTTATCAA
TTGGAGAAAC ATGAACAGCA ACTGGTCGTC ACGACGGAGT GCCTTCGGGT CACCATCCAT
CAACCACTGC ATCTGACATG GGAATATAAA AACCCGCATG GTGAGTGGCT ACCGCTGGCC
GCAGACCGGC CAACCAGCGC CTATCTACTC AGCCCTCAGG GAGAAGCGAT TGCTCACTAT
CAGCGCCGTT ATCCCAATGA ACAATATTAC GGATTGGGCG AAAAAGCCGG TGATCTGAAC
CGCGCAGGTC GTCGCTTTGA AATGCGTAAT CTGGATGCGA TGGGCTATAA CGCCGCCAGC
ACGGACCCGC TCTACAAACA TATTCCGTTT ACTATCACCC GCCGTGATGA TGTCAGTTTT
GGCCTTTTTT ATGACAATCT GAGTAGCTGT TGGCTGGATT TAGGCAATGA GATCGATAAC
TACCATCTGG CCTATCGCCG TTATCAGGCC GAAGCGGGTG ACCTTGATTA CTATCTGTTC
CTCGGCCCGA AAGTTTTGGA TGTCACCAAG GCGTTTGTTC GCCTGACCGG TAAAACCCAA
TTTGGCCCTA AATGGAGCCT GGGTTACAGC GGTTCAACCA TGCATTACAC CGATGCCCCT
GATGCTCAGG TACAACTGCA AAAGTTCATT GCATTGTGTC AGCAGCATGA CATCCCTTGC
GATTCGTTCC AGCTCTCTTC CGGCTACACC TCCATCAAAA ACAAGCGTTA TGTCTTTAAC
TGGAATTACG ACAAAGTGCC GCAACCGAAA GTCATGAGCC AGACGTTCCT ACAAGCAGGT
ATCAAATTAG CCGCCAATAT AAAACCGTGT TTATTGCAAG ATCACCCACA GTATCAACAG
GCTGCGGAGC GGGGGCTTTT CATTCGTGAC AGCGAAACTG ATTTACCTGA ACGCTCTACA
TTTTGGGATG ATGAAGGCTC CCACCTTGAT TTCACCAATC CAGACACCAT GAACTGGTGG
CAAGAGAACG TCACTAAACA GTTATTGGAG ATGGGGATCG GTTCTACCTG GAATGATAAT
AACGAATATG AAGTGTGGGA TGGTGAAGCC CGCTGTAATG GCTTCGGTAA ATCTATCGCC
ATCAAACATA TCAGACCGGT AATGCCTTTA CTGATGATGC GAGCCTCGAT GGAAGCACAG
AAAGCATTCG CACCAGAAAT GCGCCCATAC TTGATATCCC GTTCAGGCTG TGCAGGGATG
CAGCGCTACG CTCAGACCTG GAGTGGTGAT AACCGTACCT GTTGGCAAAC ACTGCGCTAT
AACATCCGAA TGGGGCTGGG GATGAGCCTG TCAGGGCTGT ATAACCTGGG GCATGATGTC
GGCGGTTTTT CTGGAGATAA ACCGGAACCT GAGCTGTTTG TCCGCTGGGT ACAAAACGGT
GTGATGCATC CACGGTTTAC CATTCATTCA TGGAATGATG ACAATACAGT CAACGAGCCA
TGGATGTATC CAGCCGCCAC GCCGATGATC CGGGATGCGA TGGCGTTACG ATATCGATTA
TTGCCCTATT TTTACACATT GCAATGGCAG GCCAGCCATG ATGATGAGCC GATGTTGCGC
CCTACGTTCC TCGACCACGA TAGCCTCACA TTTAAAGAGA ACGATGACTT TATGCTAGGC
CGTGATCTGT TGGTTGCCAG TGTGGTGGAC GCTGGGCAAC GGCAACGGCA GATTTATTTA
CCTGATAATC AGGTCGGCTG GTATTGCTTC CACAGCGGGC AATGGTATAG CGGTGGGCAG
ACAATCACAT TGGATGCACC ACTAGAGCGA TTACCTCTGC TGGTACGCGC GGGGGCAGCC
CTGCCGCTAT CCCGACGTAT CGCCTTTGTT AATCCTGAGG CTGATTGTCA GCGCGAATTG
GCGCTTTATC CCACTCAAGG TGGCGGGCAA TCAAGCGGGA TGCTATTTGA GGACGATGGT
GAAAGCCACC GCTGGCAACA AGGCCATGCG CTATGGCTAA ATTGGCAGGT CACCACCGAT
AATCACCGTA TTGATATCAC CTTTAGCCGC ACCGGGAACT ATCAACCGGC ATGGCGTGAA
CTGACGATTA ATTTGCCAAC AAATGAAAGT CGCACGTTAT CAATCAACGG TGTCGCAGGT
AATACACTGG ATTTAACCAA ACTGGCCTAA
 
Protein sequence
MKTLKNWLLI NEYPDHLELR VDDRHIFCLY VLEPNLCRVL IKREGELALS RTWSIAPQGD 
VPWSGRDRLS LEGFSLPGYQ LEKHEQQLVV TTECLRVTIH QPLHLTWEYK NPHGEWLPLA
ADRPTSAYLL SPQGEAIAHY QRRYPNEQYY GLGEKAGDLN RAGRRFEMRN LDAMGYNAAS
TDPLYKHIPF TITRRDDVSF GLFYDNLSSC WLDLGNEIDN YHLAYRRYQA EAGDLDYYLF
LGPKVLDVTK AFVRLTGKTQ FGPKWSLGYS GSTMHYTDAP DAQVQLQKFI ALCQQHDIPC
DSFQLSSGYT SIKNKRYVFN WNYDKVPQPK VMSQTFLQAG IKLAANIKPC LLQDHPQYQQ
AAERGLFIRD SETDLPERST FWDDEGSHLD FTNPDTMNWW QENVTKQLLE MGIGSTWNDN
NEYEVWDGEA RCNGFGKSIA IKHIRPVMPL LMMRASMEAQ KAFAPEMRPY LISRSGCAGM
QRYAQTWSGD NRTCWQTLRY NIRMGLGMSL SGLYNLGHDV GGFSGDKPEP ELFVRWVQNG
VMHPRFTIHS WNDDNTVNEP WMYPAATPMI RDAMALRYRL LPYFYTLQWQ ASHDDEPMLR
PTFLDHDSLT FKENDDFMLG RDLLVASVVD AGQRQRQIYL PDNQVGWYCF HSGQWYSGGQ
TITLDAPLER LPLLVRAGAA LPLSRRIAFV NPEADCQREL ALYPTQGGGQ SSGMLFEDDG
ESHRWQQGHA LWLNWQVTTD NHRIDITFSR TGNYQPAWRE LTINLPTNES RTLSINGVAG
NTLDLTKLA