Gene YpAngola_A1685 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYpAngola_A1685 
Symbol 
ID5800156 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pestis Angola 
KingdomBacteria 
Replicon accessionNC_010159 
Strand
Start bp1737067 
End bp1738059 
Gene Length993 bp 
Protein Length330 aa 
Translation table11 
GC content44% 
IMG OID641339625 
ProductTRAP transporter solute receptor DctP family protein 
Protein accessionYP_001606181 
Protein GI162418921 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1638] TRAP-type C4-dicarboxylate transport system, periplasmic component 
TIGRFAM ID[TIGR00787] tripartite ATP-independent periplasmic transporter solute receptor, DctP family 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.00806024 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value0.0274262 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGATTGC CAACGTCTTT CCCTACATTA TGTTTAACCG CCTCGGCGTT GTTACTGAGC 
CAAGTGCTAT TTAGCCAATC CGCAACTGCC CAACAAATTA AAGCAGCCGA TGTTCATCCT
AAAGATTATC CCAATGTCGT GGCGATGAAA GATATGGGCG AAAAACTAAA AGTCGCGACT
GATGGCCGTC TGGAGATGAA AACATTTCCT GGTGGAGTAT TAGGTGATGA GAAGCAAATG
ATAGAACAGG CGCAAATGGG GGCAATCGAT ATTATTCGTG TATCCATGAC ACCCGTTGCC
TCTATTTTAC CGGAGATTAA TGTCTTTACC CTCCCCTATA TCTTCCGCGA CGAAGATCAT
TTACATAAGG TCCTTGATGG TAAAATTGGT CAGGAAATAG GCGGTAAAAT CACCGATAAC
AAAAATTCAA AATTGGTGTT CTTAGGTTGG ATGGATGGCG GTACCCGTCA CCTGATTACT
AAACAACCAG TGATTAAACC GGAAGATCTC AGCGGGATGA AAATCCGTGT TCAAGGCAGC
CCGATTGCGT TAGCAACGCT AAAATCAATG GGGGCCAATG CCCTGAGTAT GGGCGTCAGT
GAAGTGTTCA GCGGTATGCA AACCGGGGTC ATTGATGGCA CCGAAAATAA CCAGCCTACA
TTTGTAGCCC ATAATTACCT GCCTGTTGTC AAAAATTATA CCCTGAGCGG CCATTTCATT
ATTCCCGAAG TATTCCTGTA TTCGAAAGCC AAGTGGGACA AATTATCTGC CGAAGATCAG
CAAACCATTC TTAAATTAGC AAAAGAAGCA CAAGCTGAGC AACGCGTATT ATGGACAGCC
TATGAGCAAC AAGCTCATGA GAAAATGAAG GCCGGCGGTG TGCAGTATCA TGAGATTGAT
CGTGATTATT ATTATAAAGC CACTCAGCCC GTACGCGATG AGTTCGGCAA AGGTCATGAG
GACTTGATTA AACAGATTGG TGACGTCCAA TAA
 
Protein sequence
MRLPTSFPTL CLTASALLLS QVLFSQSATA QQIKAADVHP KDYPNVVAMK DMGEKLKVAT 
DGRLEMKTFP GGVLGDEKQM IEQAQMGAID IIRVSMTPVA SILPEINVFT LPYIFRDEDH
LHKVLDGKIG QEIGGKITDN KNSKLVFLGW MDGGTRHLIT KQPVIKPEDL SGMKIRVQGS
PIALATLKSM GANALSMGVS EVFSGMQTGV IDGTENNQPT FVAHNYLPVV KNYTLSGHFI
IPEVFLYSKA KWDKLSAEDQ QTILKLAKEA QAEQRVLWTA YEQQAHEKMK AGGVQYHEID
RDYYYKATQP VRDEFGKGHE DLIKQIGDVQ