Gene YpAngola_A3066 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYpAngola_A3066 
Symbol 
ID5801539 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pestis Angola 
KingdomBacteria 
Replicon accessionNC_010159 
Strand
Start bp3242445 
End bp3243815 
Gene Length1371 bp 
Protein Length456 aa 
Translation table11 
GC content48% 
IMG OID641340903 
Productmajor facilitator transporter 
Protein accessionYP_001607432 
Protein GI162418911 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones64 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGCAATGA ACGATAATAA AATGACTCCG CTAGAGCTTC GAGCTACCTG GGGGCTAGGT 
ACGGTATTCT CACTACGCAT GCTGGGCATG TTCATGGTAT TACCGGTTCT GACCACCTAT
GGTATGGCAC TTTCCGGTGC CAGTGAGGCA TTGATAGGTA TCGCAATTGG TATCTATGGT
TTATCTCAGG CCATTTTTCA GATCCCCTTT GGGCTGCTTT CTGATCGTAT CGGTCGTAAA
CCCATGATCA TCGGTGGGCT GCTGGTTTTT GCATTAGGTA GCATTATTGC CGCATTAAGT
GATTCTATTT GGGGCATTAT TCTTGGCCGT GCACTGCAAG GCTCCGGTGC AATCGCCGCC
GCTGTCATGG CGCTATTGTC TGATTTAACC CGTGAGCAAA ACCGCACCAA AGCAATGGCA
TTTATTGGTG TCAGTTTTGG TGTCACCTTT GCTATGGCCA TGGTGCTGGG GCCAATTGTG
ACGCATGCTT TTGGTCTGCA AGCTCTGTTT TGGGGGATTG CAATACTGGC GCTGTTAGGC
ATTGTTATCA CATTAACCGT GGTTCCTTCA GCCAATAGCC ATGTCCTCAA TCGTGAATCG
AGCATGGTTA AAGGCAGCGT GAGTAAGGTG CTCCACAACA GTCGGTTACT TAAACTCAAT
TTCGGCATCA TGTGCTTGCA TATCCTACTG ATGTCGAGCT TTGTTGCCTT ACCTCAAATG
ATGGCTAATG CTGGGTTAGC GCCCGCTCAA CATTGGGTTG TTTATCTGGT AACCATGTTG
GTCTCTTTCG CCGCAGTAGT ACCGTTTATT ATTTATGCCG AAATGAAGCG CCGCATGAAG
CAGGTCTTTA TGGGCTGTGT AGCGGTATTA TTTATCGCCG AGGTCGTATT GTGGTTTGCT
GGCCAAGACC TATGGATAAT TATTGCCGGT GTGCAGTTAT TCTTTATTGC TTTTAATGTG
ATGGAAGCTA TTTTGCCATC GCTGATCAGT AAAGAATCAC CTGCTGGTTA CAAAGGCACT
GCCATGGGGA TTTATTCCAC CAGTCAGTTT ATTGGCGTAG CGATTGGCGG CAGTCTTGGG
GGCTGGATAT TTGGCCTTGA AGGTGCCGAT ATGGTATTTG CGGCTGGAGC TATTATCGCA
CTGGTATGGT TCGCCGTCAG TGTCACCATG CAAGAACCGC CTTATGTTAG CAGCCTGCGC
ATCACCTTGT CAGAGTCGGC TGTAAAAAAT ACAACCTTGG AAGAGCGCCT CAAAGCCCAG
CCAGGTGTTA CCGAGGCGGT CGTGGTCACG GCGGAACGCA GTGCTTATGT CAAAGTCGAT
ATTAAACAGA CTAATCGCAA CCAACTGGAA CAGTTGATCA ATGCGGCTTA A
 
Protein sequence
MAMNDNKMTP LELRATWGLG TVFSLRMLGM FMVLPVLTTY GMALSGASEA LIGIAIGIYG 
LSQAIFQIPF GLLSDRIGRK PMIIGGLLVF ALGSIIAALS DSIWGIILGR ALQGSGAIAA
AVMALLSDLT REQNRTKAMA FIGVSFGVTF AMAMVLGPIV THAFGLQALF WGIAILALLG
IVITLTVVPS ANSHVLNRES SMVKGSVSKV LHNSRLLKLN FGIMCLHILL MSSFVALPQM
MANAGLAPAQ HWVVYLVTML VSFAAVVPFI IYAEMKRRMK QVFMGCVAVL FIAEVVLWFA
GQDLWIIIAG VQLFFIAFNV MEAILPSLIS KESPAGYKGT AMGIYSTSQF IGVAIGGSLG
GWIFGLEGAD MVFAAGAIIA LVWFAVSVTM QEPPYVSSLR ITLSESAVKN TTLEERLKAQ
PGVTEAVVVT AERSAYVKVD IKQTNRNQLE QLINAA