Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | YpAngola_A2769 |
Symbol | cysP |
ID | 5801241 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Yersinia pestis Angola |
Kingdom | Bacteria |
Replicon accession | NC_010159 |
Strand | - |
Start bp | 2899104 |
End bp | 2900141 |
Gene Length | 1038 bp |
Protein Length | 345 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 641340625 |
Product | thiosulfate transporter subunit |
Protein accession | YP_001607159 |
Protein GI | 162420937 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG4150] ABC-type sulfate transport system, periplasmic component |
TIGRFAM ID | [TIGR00971] sulfate/thiosulfate-binding protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 31 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 0.00135265 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAAACAGA ACTCAGTCAA AAAAAATGCG CGTAACGGAT TGACCCACAT TGCATTATTT AGTGCGCTGA TACTCAGTGG TGTCACTGCC TCAGCCACTG AGTTGCTCAA TAGCTCTTAT GATGTTTCCC GTGAATTATT CACGGCGCTG AATCCGGGTT TTCAGGCACA GTGGGAACAG CAGAACCCCG GTGACAAACT GACTATCAAA CAATCCCATG CGGGTTCTTC CAAGCAGGCA TTGGCTATCT TACAAGGCCT GAAAGCCGAT GTGGTCACCT ACAACCAGAT TACTGATGTG CAAATCCTTC ATGACCGTGG CAATTTGATC CCAGCCGATT GGCAAGCCCG TTTGCCGAAT AACAGCTCAC CGTTTTACTC CACCATGGCG TTCCTGGTCC GCAAAGGCAA TCCAAAAAAT ATCCGCAATT GGGATGATTT GGTCCGCGAA GACGTCAAGC TGGTTTTCCC TAATCCTAAA ACCTCCGGTA ATGGCCGCTA TACCTATCTG GCTGCTTGGG GAGCGACCCA ACTGGCCGAT GGCGGCGATG AGGCGAAAAC CCGCGACTGG ATGAAACGCT TTTTGAAAAA TGTTGAAGTG TTTGATACCG GTGGCCGTGG GGCAACCACG ACATTCGTTG AACGTGGCTT GGGTGATGTA CTGATCAGTT TCGAATCGGA AGTTAACAAT ATTCGCAAAG AGTACGGCAG CGACCAATAC GAAGTTATTG TCCCGCCAGT GGATATTTTG GCTGAGTTCC CGGTTGCCTG GGTGGATAAA AATGTAGAAA AGAATGGCAC TGAAAAAGCC GCCAAAGCTT ATCTCAATTA TCTCTATAGC CCGCCAGCAC AGCAGGTGAT CACTCGCTTC AATTACCGTG TTTACGATAA AGCGGCAATG GAGGCGGCGA AATCTCAATT CCCAGACACC AAATTATTCC GAGTTGAAGA TCAATTCGGT AGCTGGCCAC AGGTGATGAG TACCCATTTT ACGACTGGCG GCGTACTGGA TAAATTGTTA GCAGAAGGGC ATCAGTAA
|
Protein sequence | MKQNSVKKNA RNGLTHIALF SALILSGVTA SATELLNSSY DVSRELFTAL NPGFQAQWEQ QNPGDKLTIK QSHAGSSKQA LAILQGLKAD VVTYNQITDV QILHDRGNLI PADWQARLPN NSSPFYSTMA FLVRKGNPKN IRNWDDLVRE DVKLVFPNPK TSGNGRYTYL AAWGATQLAD GGDEAKTRDW MKRFLKNVEV FDTGGRGATT TFVERGLGDV LISFESEVNN IRKEYGSDQY EVIVPPVDIL AEFPVAWVDK NVEKNGTEKA AKAYLNYLYS PPAQQVITRF NYRVYDKAAM EAAKSQFPDT KLFRVEDQFG SWPQVMSTHF TTGGVLDKLL AEGHQ
|
| |