Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_2577 |
Symbol | cysA |
ID | 6143535 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 2631186 |
End bp | 2632283 |
Gene Length | 1098 bp |
Protein Length | 365 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 641617448 |
Product | sulfate/thiosulfate transporter subunit |
Protein accession | YP_001744613 |
Protein GI | 170679900 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG1118] ABC-type sulfate/molybdate transport systems, ATPase component |
TIGRFAM ID | [TIGR00968] sulfate ABC transporter, ATP-binding protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 52 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCATTG AGATTGCCAA TATTAAGAAG TCTTTTGGTC GCACCCAGGT GCTGAACGAT ATCTCACTGG ATATTCCTTC TGGTCAGATG GTCGCGTTGC TGGGGCCGTC CGGTTCCGGG AAAACCACGC TGCTGCGGAT TATCGCCGGG CTGGAGCATC AAACCAGCGG GCATATTCGC TTCCACGGCA CCGACGTTAG CCGCCTGCAT GCGCGCGACC GTAAAGTCGG TTTCGTGTTC CAGCATTACG CGTTGTTCCG CCATATGACG GTATTCGACA ATATCGCTTT TGGCCTGACG GTGCTACCGC GTCGCGAGCG CCCGAATGCG GCGGCGATCA AAGCGAAAGT GACAAAATTG CTGGAGATGG TGCAGCTTGC CCATCTGGCA GACCGCTATC CTGCGCAGCT TTCCGGCGGC CAGAAACAGC GTGTGGCGCT GGCGCGCGCG CTGGCTGTAG AACCGCAAAT TCTGCTGCTT GATGAACCGT TTGGCGCGCT GGATGCGCAG GTGCGTAAAG AGTTGCGTCG CTGGCTGCGT CAGTTACATG AAGAGCTGAA ATTCACCAGC GTGTTTGTGA CCCATGACCA GGAAGAAGCG ACCGAAGTAG CCGATCGTGT AGTGGTGATG AGCCAGGGCA ATATCGAGCA GGCTGATGCA CCGGACCAGG TCTGGCGCGA ACCAGCGACC CGTTTTGTAC TCGAATTTAT GGGCGAAGTT AACCGCCTGC AGGGAACCAT TCGCGGCGGG CAGTTCCATG TTGGCGCACA TCGCTGGCCG CTGGGCTATA CACCTGCGTA TCAGGGGCCA GTGGATCTCT TCCTGCGTCC GTGGGAAGTG GATATCAGCC GCCGTACCAG CCTCGATTCG CCGCTGCCGG TACAGGTACT GGAAGCCAGC CCGAAAGGCC ACTACACCCA ATTAGTCGTG CAGCCGCTGG GGTGGTACAA CGAAGCGCTG ACGGTCGTGA TGCACGGCGA CGACGCCCCG CAACGTGGCG AGCGTTTATT CGTTGGTCTG CAACATGCGC GGCTGTATAA CGGCGACGAG CGTATTGAAA CCCGCGATGA GGAACTTGCT CTCGCACAAA GCGCCTGA
|
Protein sequence | MSIEIANIKK SFGRTQVLND ISLDIPSGQM VALLGPSGSG KTTLLRIIAG LEHQTSGHIR FHGTDVSRLH ARDRKVGFVF QHYALFRHMT VFDNIAFGLT VLPRRERPNA AAIKAKVTKL LEMVQLAHLA DRYPAQLSGG QKQRVALARA LAVEPQILLL DEPFGALDAQ VRKELRRWLR QLHEELKFTS VFVTHDQEEA TEVADRVVVM SQGNIEQADA PDQVWREPAT RFVLEFMGEV NRLQGTIRGG QFHVGAHRWP LGYTPAYQGP VDLFLRPWEV DISRRTSLDS PLPVQVLEAS PKGHYTQLVV QPLGWYNEAL TVVMHGDDAP QRGERLFVGL QHARLYNGDE RIETRDEELA LAQSA
|
| |