Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | B21_02283 |
Symbol | cysA |
ID | 8113399 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli BL21 |
Kingdom | Bacteria |
Replicon accession | NC_012892 |
Strand | - |
Start bp | 2411429 |
End bp | 2412526 |
Gene Length | 1098 bp |
Protein Length | 365 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 644848487 |
Product | hypothetical protein |
Protein accession | YP_003000060 |
Protein GI | 251785756 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG1118] ABC-type sulfate/molybdate transport systems, ATPase component |
TIGRFAM ID | [TIGR00968] sulfate ABC transporter, ATP-binding protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCATTG AGATTGCCAA TATTAAGAAG TCTTTTGGTC GCACCCAGGT GCTGAACGAT ATCTCACTGG ATATTCCTTC TGGTCAGATG GTCGCGTTGC TGGGGCCGTC CGGTTCCGGG AAAACCACGC TGCTGCGCAT TATCGCCGGG CTGGAGCATC AAACCAGCGG GCATATTCGC TTTCACGGCA CCGACGTGAG CCGCCTGCAC GCACGCGATC GTAAAGTCGG TTTCGTGTTC CAGCATTACG CGCTGTTCCG CCATATGACG GTGTTCGACA ATATCGCTTT TGGCCTGACG GTGCTGCCGC GTCGCGAGCG CCCGAATGCG GCGGCGATCA AAGCGAAAGT GATAAAATTG CTGGAGATGG TGCAGCTTGC GCATCTGGCG GATCGTTATC CGGCGCAGCT TTCCGGCGGG CAGAAACAGC GTGTGGCGCT GGCGCGTGCC CTTGCCGTGG AGCCGCAAAT TCTGCTGCTT GATGAACCGT TTGGCGCGCT GGATGCGCAG GTGCGTAAAG AGCTGCGTCG CTGGCTGCGT CAACTGCATG AAGAGCTGAA ATTCACCAGC GTGTTCGTGA CCCACGACCA GGAAGAAGCG ACCGAAGTAG CCGATCGTGT AGTGGTGATG AGCCAGGGCA ATATTGAACA GGCTGATGCG CCGGATCAGG TATGGCGCGA ACCAGCGACC CGTTTTGTAC TCGAATTTAT GGGCGAAGTG AACCGCCTGC AGGGAACCAT TCGCGGCGGG CAGTTCCATG TTGGCGCACA TCGCTGGCCG CTGGGCTATA CACCTGCGTA TCAGGGGCCG GTGGATCTCT TCCTGCGCCC TTGGGAAGTG GATATCAGCC GCCGTACCAG CCTCGATTCG CCGCTGCCGG TACAGGTACT GGAAGCCAGC CCGAAAGGTC ACTACACCCA ATTAGTGGTG CAGCCGCTGG GGTGGTACAA CGAACCGCTG ACGGTCGTGA TGCATGGCGA CGATGCCCCG CAGCGTGGCG AGCGTTTATT CGTTGGTCTG CAACATGCGC GGCTGTATAA CGGCGACGAG CGTATCGAAA CCCGAGATGA GGAACTTGCT CTCGCACAAA GCGCCTGA
|
Protein sequence | MSIEIANIKK SFGRTQVLND ISLDIPSGQM VALLGPSGSG KTTLLRIIAG LEHQTSGHIR FHGTDVSRLH ARDRKVGFVF QHYALFRHMT VFDNIAFGLT VLPRRERPNA AAIKAKVIKL LEMVQLAHLA DRYPAQLSGG QKQRVALARA LAVEPQILLL DEPFGALDAQ VRKELRRWLR QLHEELKFTS VFVTHDQEEA TEVADRVVVM SQGNIEQADA PDQVWREPAT RFVLEFMGEV NRLQGTIRGG QFHVGAHRWP LGYTPAYQGP VDLFLRPWEV DISRRTSLDS PLPVQVLEAS PKGHYTQLVV QPLGWYNEPL TVVMHGDDAP QRGERLFVGL QHARLYNGDE RIETRDEELA LAQSA
|
| |