Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_4637 |
Symbol | |
ID | 6143627 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 4737710 |
End bp | 4738849 |
Gene Length | 1140 bp |
Protein Length | 379 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641619453 |
Product | iron-sulfur cluster binding protein |
Protein accession | YP_001746561 |
Protein GI | 170680505 |
COG category | [C] Energy production and conversion |
COG ID | [COG1600] Uncharacterized Fe-S protein |
TIGRFAM ID | [TIGR00276] iron-sulfur cluster binding protein, putative |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.00000000171021 |
Plasmid hitchhiking | No |
Plasmid clonability | unclonable |
| |
Fosmid Coverage information |
Num covering fosmid clones | 35 |
Fosmid unclonability p-value | 0.0426509 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCAGAGC CCCTCGATCT CAATCAGTTA GCGCAAAAAA TTAAACAGTG GGGGCTGGAA CTGGGCTTTC AGCAGGTAGG TATTACCGAT ACCGATCTCA GCGCGTCCGA GCCCAAACTG CAAGCATGGC TGGACAAACA ATACCACGGC GAAATGGACT GGATGGCGCG TCACGGTATG CTGCGTGCTC GTCCTCACGA ATTATTGCCC GGTACGCTGC GCGTGATCAG CGTGCGGATG AATTACCTTC CTGCTAACGC CGCATTTGCC AGCACGCTGA AAAACCCCAA ACTCGGCTAT GTTAGCCGTT ATGCGCTGGG CCGTGACTAT CACAAACTTC TGCGCAACCG ACTCAAAAAG CTGGGCGAGA TGATTCAGCA ACATTGTGTT TCGCTGAATT TTAGACCGTT TGTCGATTCT GCGCCTATTC TTGAGCGCCC GTTAGCTGAA AAAGCTGGGC TCGGCTGGAC AGGTAAGCAC TCACTTATCC TCAATCGCGA GGCCGGCTCG TTCTTCTTTT TAGGCGAATT GCTGGTCGAT ATTCCGCTAC CCGTGGATCA ACCAGTCGAG GAAGGATGCG GTAAATGCGT GGCCTGTATG ACGATTTGCC CGACCGGTGC CATCGTCGAG CCATATACCG TCGATGCTCG CCGCTGTATC TCTTATCTCA CCATCGAACT GGAAGGGGCG ATCCCGGTAG AGTTGCGACC ATTAATGGGA AACCGTATTT ACGGTTGCGA TGACTGCCAG CTTATCTGCC CGTGGAATCG CTATTCGCAA CTCACTACAG AAGACGATTT CAGCCCGCGT AAGCCGCTAC ACGCACCGGA ACTCATTGAG TTATTCGCCT GGAGCGAAGA GAAGTTTTTA AAAGTCACGG AAGGTTCGGC GATTCGCCGT ATAGGTCACC TGCGTTGGCT GCGTAATATC GCCGTAGCCT TAGGCAATGC CCCCTGGGAT GAAACGATTT TGACAGCGCT GGAAAGTCGT AAAGGTGAGC ACCCACTTCT TGATGAGCAC ATAGCGTGGG CGATTGCGCA GCAAATCGAG AGGCGAAATG CGTGCGTGGT CGAAGTGCAA TTACCGAAAA AACAGCGTCT GGTTCGGGTG ATTGAAAAAG GGTTACCGCG TGACGCCTGA
|
Protein sequence | MSEPLDLNQL AQKIKQWGLE LGFQQVGITD TDLSASEPKL QAWLDKQYHG EMDWMARHGM LRARPHELLP GTLRVISVRM NYLPANAAFA STLKNPKLGY VSRYALGRDY HKLLRNRLKK LGEMIQQHCV SLNFRPFVDS APILERPLAE KAGLGWTGKH SLILNREAGS FFFLGELLVD IPLPVDQPVE EGCGKCVACM TICPTGAIVE PYTVDARRCI SYLTIELEGA IPVELRPLMG NRIYGCDDCQ LICPWNRYSQ LTTEDDFSPR KPLHAPELIE LFAWSEEKFL KVTEGSAIRR IGHLRWLRNI AVALGNAPWD ETILTALESR KGEHPLLDEH IAWAIAQQIE RRNACVVEVQ LPKKQRLVRV IEKGLPRDA
|
| |