Gene YpAngola_A2766 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYpAngola_A2766 
SymbolcysA 
ID5801238 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pestis Angola 
KingdomBacteria 
Replicon accessionNC_010159 
Strand
Start bp2896315 
End bp2897406 
Gene Length1092 bp 
Protein Length363 aa 
Translation table11 
GC content52% 
IMG OID641340622 
Productsulfate/thiosulfate transporter subunit 
Protein accessionYP_001607156 
Protein GI162418455 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1118] ABC-type sulfate/molybdate transport systems, ATPase component 
TIGRFAM ID[TIGR00968] sulfate ABC transporter, ATP-binding protein 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value0.0027443 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAGCATTG AAATTAATAA TATCAGTAAG TATTTTGGCC GGACCAAGGT ACTGAATGAC 
ATTACGCTTG ATATTCCTTC AGGCCAAATG GTGGCTCTGC TCGGCCCTTC TGGTTCGGGT
AAAACCACCT TATTGCGGAT TATTGCCGGG TTGGAAAACC AAAATGCCGG GCGCTTGAGT
TTCCATGGCA CGGATGTCAG CCGGTTACAT GCTCGTGATC GTCGGGTCGG GTTTGTTTTC
CAACACTATG CGTTATTCCG CCACATGACG GTGTTCGATA ATATTGCTTT TGGTTTGACG
GTGTTACCGC GCCGCGAGCG CCCGAATGCG GCGGCCATTA AGCAGAAAGT GGGGCAATTA
CTGGAAATGG TGCAATTAGG GCATCTGGCC GAGCGTTTTC CATCACAGTT GTCTGGTGGT
CAAAAACAGC GGGTTGCTTT AGCGCGTGCA TTGGCGGTCG AACCACAGAT TTTGTTACTG
GATGAACCTT TTGGCGCGCT GGATGCACAG GTGCGTAAAG AGTTGCGCCG TTGGTTACGT
CAGTTGCATG AAGAACTGAA ATTCACCAGT GTCTTTGTCA CCCACGATCA GGAAGAAGCG
ATGGAGGTCG CAGATCGGGT GGTGGTGGTG AGTCAGGGCA ATATTGAGCA GGTGGGGACA
CCGGATGAAG TCTGGCGTGA ACCGGCAACC CGTTTTGTTT TGGAATTTCT GGGTGAAGTT
AACCGCCTGA GCGGGGAGAT CCGCGGTTCG CAGCTGTTTA TTGGCGCACA TCACTGGCCG
CTGGATCTTG CACCGATGCA TCAGGGCAGT GTCGATCTAT TCCTGCGCCC TTGGGAGATG
GAAGTGAGTA CGCAATCAAG TGATCGCTGC CCACTGCCGG TTCAGGTGCT TGAAGTTAGC
CCCCGTGGTC ACTTCTGGCA ATTAACCGTG CAACCCATCG GCTGGCATCA GGATCCGATC
AGTGTGGTGC TCCCAGAGGG GAATATTGAT GCCCCGGTGC GCGGCAACCG TTATTATGTT
GGCGGGTTAA ATGCACGCTT ATATTCTGGC AATCAATTAT TGCAACCCAT TGCTTTAGCC
CAAAGCGCCT GA
 
Protein sequence
MSIEINNISK YFGRTKVLND ITLDIPSGQM VALLGPSGSG KTTLLRIIAG LENQNAGRLS 
FHGTDVSRLH ARDRRVGFVF QHYALFRHMT VFDNIAFGLT VLPRRERPNA AAIKQKVGQL
LEMVQLGHLA ERFPSQLSGG QKQRVALARA LAVEPQILLL DEPFGALDAQ VRKELRRWLR
QLHEELKFTS VFVTHDQEEA MEVADRVVVV SQGNIEQVGT PDEVWREPAT RFVLEFLGEV
NRLSGEIRGS QLFIGAHHWP LDLAPMHQGS VDLFLRPWEM EVSTQSSDRC PLPVQVLEVS
PRGHFWQLTV QPIGWHQDPI SVVLPEGNID APVRGNRYYV GGLNARLYSG NQLLQPIALA
QSA