Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_3652 |
Symbol | cysA |
ID | 6968118 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | - |
Start bp | 3367265 |
End bp | 3368362 |
Gene Length | 1098 bp |
Protein Length | 365 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 643387446 |
Product | sulfate/thiosulfate transporter subunit |
Protein accession | YP_002271899 |
Protein GI | 209398303 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG1118] ABC-type sulfate/molybdate transport systems, ATPase component |
TIGRFAM ID | [TIGR00968] sulfate ABC transporter, ATP-binding protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.992152 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 74 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCATTG AGATTGCCAA TATTAAGAAG TCGTTTGGTC GCACCCAGGT GCTGAACGAT ATCTCACTGG ATATTCCTTC AGGTCAGATG GTCGCGTTGC TGGGGCCGTC CGGTTCCGGG AAAACCACGC TGCTGCGCAT TATCGCTGGG CTGGAGCATC AAACCAGCGG GCATATTCGC TTCCACGGCA CCGACGTGAG CCGCCTGCAC GCACGTGATC GTAAAGTCGG TTTCGTGTTC CAGCATTACG CGCTGTTCCG CCATATGACG GTGTTCGACA ATATCGCTTT TGGCCTGACG GTGCTGCCGC GTCGCGAGCG CCCGAATGCC GCAGCCATCA AAGCGAAAGT GACAAAATTG CTGGAAATGG TCCAGCTTGC CCATCTGGCG GATCGTTATC CGGCGCAGCT TTCCGGCGGC CAGAAACAGC GTGTAGCGCT GGCGCGTGCG CTTGCTGTAG AACCGCAAAT TCTGCTGCTT GATGAACCGT TTGGCGCGCT GGATGCGCAG GTGCGTAAAG AGCTGCGTCG ATGGCTGCGT CAACTCCATG AAGAACTAAA ATTCACCAGC GTTTTTGTGA CCCACGATCA GGAAGAAGCG ACCGAAGTAG CTGATCGTGT AGTTGTGATG AGCCAGGGCA ATATTGAACA GGCTGACGCG CCGGATCAGG TATGGCGCGA ACCGGCGACC CGTTTTGTGC TCGAATTTAT GGGCGAAGTG AACCGCCTGC AGGGAACCAT TCGCGGCGGG CAGTTCCATG TTGGCGCACA TCGCTGGCCG CTGGGCTACA CACCTGCGTA TCAGGGGCCG GTGGATCTCT TCCTGCGCCC GTGGGAAGTG GATATCAGCC GCCGTACCAG CCTCGATTCG CCGCTGCCGG TACAGGTACT GGAAGCCAGC CCGAAAGGTC ACTACACCCA ATTAGTGGTG CAGCCGCTGG GGTGGTACAA CGAACCGCTG ACGGTCGTGA TGCATGGCGA CGATGCCCCG CAGCGTGGCG AGCGTTTATT CGTTGGTCTG CAACATGCGC GGCTGTATAA CGGCGACGAG CGTATCGAAC CCCGAGATGA GGAACTTGCT CTCGCACAAA GCGCCTGA
|
Protein sequence | MSIEIANIKK SFGRTQVLND ISLDIPSGQM VALLGPSGSG KTTLLRIIAG LEHQTSGHIR FHGTDVSRLH ARDRKVGFVF QHYALFRHMT VFDNIAFGLT VLPRRERPNA AAIKAKVTKL LEMVQLAHLA DRYPAQLSGG QKQRVALARA LAVEPQILLL DEPFGALDAQ VRKELRRWLR QLHEELKFTS VFVTHDQEEA TEVADRVVVM SQGNIEQADA PDQVWREPAT RFVLEFMGEV NRLQGTIRGG QFHVGAHRWP LGYTPAYQGP VDLFLRPWEV DISRRTSLDS PLPVQVLEAS PKGHYTQLVV QPLGWYNEPL TVVMHGDDAP QRGERLFVGL QHARLYNGDE RIEPRDEELA LAQSA
|
| |