Gene YpAngola_A1071 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYpAngola_A1071 
Symbol 
ID5799534 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pestis Angola 
KingdomBacteria 
Replicon accessionNC_010159 
Strand
Start bp1098319 
End bp1099437 
Gene Length1119 bp 
Protein Length372 aa 
Translation table11 
GC content49% 
IMG OID641339056 
ProductABC transporter ATP-binding protein 
Protein accessionYP_001605628 
Protein GI162418596 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3839] ABC-type sugar transport systems, ATPase components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.457538 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones41 
Fosmid unclonability p-value0.825195 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGATGACAG GAGTGATGTT ACATAACGTC AATAAAATAT ATCCAAACGG CATGCAGGTT 
CTGCATGATA TTAATTTGGA TATTAAAGAC GGTGAATTCA TGGTGATCGT TGGCCCCTCT
GGCTGCGCAA AATCAACGAT GTTGCGCATG ATTGCTGGCC TTGAAGAAAT ATCCAGCGGT
GAGCTCACTA TCGCGGACAG AAAATGTAAT GACGTGCAGC CTAAAGATCG CGGCATTGCC
ATGGTATTTC AAAACTATGC CCTCTATCCG CATATGACCG TGTACGAAAA TATGGCGTTT
GGCCTGAAGG TGGCGAAGAA GACCAAAGCT GAAATCGCTC AGCGAGTCAA TGAGGCCGCC
GAGTTACTGG AAATCACGGA TCTGCTGCAG CGCAAGCCGA AAGAGATGTC AGGCGGCCAG
TGCCAACGTG TGGCGTTAGG TCGGGCCATG GTCAGAAAAC CTGCGGTATT TTTATTCGAT
GAACCCCTGT CCAACCTAGA TGCCAAGCTG CGCGTCTCCA TGCGCCTGCG TATCAGTAAG
TTGCATCAAC AACTACGCGA ATCGGGTAGA CCGGCCACGA TGATTTACGT CACTCACGAT
CAGGTTGAGG CGATGACCAT GGGTGAACGC GTCTGTATCC TGCGCAAGGG GAAGATCATG
CAGGTTGACC GGCCAATGAA CGTCTATCGC CATCCGCTCA ATAAATTTGT TGCGGAGTTT
GTGGGGTCGC CAGCCATGAA CATGTTTAGC GGTCGCATAG AGGAATACCA AGAGCAATTG
GTTATTCGCT GTGGCGAATA TCGCTTTCCA CTGCCGGCGG ATAAAGTCGC TAAGAGTCGG
TCTTATCTGG GCCGGGAGAT CGATTTTGGG ATCAGGCCTG AGGATATCAG CATCAGTCGC
GCAGACGTAG CCAATGCGCT AACTGCCACG GTTATCAGAG TCGATGCGAT GGGTTCCGAT
GAGTTTATCT ACTTTAACTT GATGGGGCAG CAGGCTATCT GTAAACGGCC TAATCCTGAA
GATGAAATCC AAATCAATGA GACCTGCCAT TTCTCTTTTA ATCTGGATAA GTGCCATCTT
TTCGATAAAG AAAGCGAAAA AAATATTCTC TATATCTAA
 
Protein sequence
MMTGVMLHNV NKIYPNGMQV LHDINLDIKD GEFMVIVGPS GCAKSTMLRM IAGLEEISSG 
ELTIADRKCN DVQPKDRGIA MVFQNYALYP HMTVYENMAF GLKVAKKTKA EIAQRVNEAA
ELLEITDLLQ RKPKEMSGGQ CQRVALGRAM VRKPAVFLFD EPLSNLDAKL RVSMRLRISK
LHQQLRESGR PATMIYVTHD QVEAMTMGER VCILRKGKIM QVDRPMNVYR HPLNKFVAEF
VGSPAMNMFS GRIEEYQEQL VIRCGEYRFP LPADKVAKSR SYLGREIDFG IRPEDISISR
ADVANALTAT VIRVDAMGSD EFIYFNLMGQ QAICKRPNPE DEIQINETCH FSFNLDKCHL
FDKESEKNIL YI