Gene EcolC_3847 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_3847 
Symbol 
ID6064404 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp4203009 
End bp4204148 
Gene Length1140 bp 
Protein Length379 aa 
Translation table11 
GC content53% 
IMG OID641603259 
Productiron-sulfur cluster binding protein 
Protein accessionYP_001726778 
Protein GI170021824 
COG category[C] Energy production and conversion 
COG ID[COG1600] Uncharacterized Fe-S protein 
TIGRFAM ID[TIGR00276] iron-sulfur cluster binding protein, putative 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000000133072 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00000262117 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGTCAGAGC CCCTCGATCT CAATCAGTTA GCGCAAAAAA TTAAACAGTG GGGGCTGGAA 
CTGGGCTTTC AGCAGGTAGG TATTACCGAT ACCGATCTCA GCGAGTCCGA GCCCAAACTG
CAAGCATGGC TGGACAAACA ATACCACGGC GAAATGGACT GGATGGCACG TCACGGTATG
CTGCGCGCTC GCCCTCATGA GTTATTGCCC GGTACGCTGC GCGTGATCAG CGTGCGGATG
AATTACCTTC CTGCTAACGC CGCATTTGCC AGCACGCTGA AAAACCCCAA ACTCGGCTAT
GTTAGCCGTT ATGCGCTGGG CCGTGACTAT CACAAACTTC TGCGCAACCG ACTCAAAAAG
CTGGGCGAGA TGATTCAGCA ACATTGTGTT TCGCTGAATT TTAGACCGTT TGTCGATTCT
GCGCCTATTC TCGAGCGCCC GTTAGCTGAA AAAGCTGGGC TCGGCTGGAC AGGTAAGCAC
TCACTTATCC TCAATCGCGA GGCCGGTTCG TTCTTCTTTT TAGGCGAATT GCTGGTCGAT
ATTCCGCTGC CCGTGGATCA ACCAGTCGAG GAAGGATGCG GCAAATGCGT GGCCTGTATG
ACGATTTGCC CGACCGGTGC CATCGTCGAG CCATATACCG TCGATGCTCG CCGCTGTATC
TCTTATCTCA CCATCGAACT TGAAGGGGCG ATCCCGGAAG AGTTGCGACC GTTAATGGGA
AACCGTATTT ACGGTTGCGA TGACTGCCAG CTTATCTGCC CGTGGAATCG CTATTCACAA
CTCACCACAG AAGAGGATTT CAGCCCGCGT AAGCCGCTAC ACGCACCGGA ACTCATTGAG
TTATTCGCCT GGAGCGAAGA GAAGTTTTTA AAAGTCACGG AAGGATCGGC GATTCGTCGT
ATTGGTCACC TGCGTTGGCT GCGTAATATC GCCGTAGCAT TAGGCAATGC CCCTTGGGAT
GAAACGATTT TGACAGCGCT GGAAAGTCGT AAAGGTGAGC ACCCACTTCT CGATGAGCAC
ATAGCGTGGG CGATTGCGCA GCAAATAGAG AGACGAAATG CGTGCATAGT CGAAGTGCAA
TTGCCGAAAA AACAGCGTCT GGTTCGGGTG ATTGAAAAAG GGTTACCGCG TGACGCCTGA
 
Protein sequence
MSEPLDLNQL AQKIKQWGLE LGFQQVGITD TDLSESEPKL QAWLDKQYHG EMDWMARHGM 
LRARPHELLP GTLRVISVRM NYLPANAAFA STLKNPKLGY VSRYALGRDY HKLLRNRLKK
LGEMIQQHCV SLNFRPFVDS APILERPLAE KAGLGWTGKH SLILNREAGS FFFLGELLVD
IPLPVDQPVE EGCGKCVACM TICPTGAIVE PYTVDARRCI SYLTIELEGA IPEELRPLMG
NRIYGCDDCQ LICPWNRYSQ LTTEEDFSPR KPLHAPELIE LFAWSEEKFL KVTEGSAIRR
IGHLRWLRNI AVALGNAPWD ETILTALESR KGEHPLLDEH IAWAIAQQIE RRNACIVEVQ
LPKKQRLVRV IEKGLPRDA