Gene EcDH1_3827 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcDH1_3827 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli DH1 
KingdomBacteria 
Replicon accessionCP001637 
Strand
Start bp4120514 
End bp4121653 
Gene Length1140 bp 
Protein Length379 aa 
Translation table11 
GC content53% 
IMG OID 
Productiron-sulfur cluster binding protein 
Protein accessionACX41429 
Protein GI260451007 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000000000294328 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCAGAGC CCCTCGATCT CAATCAGTTA GCGCAAAAAA TTAAACAGTG GGGGCTGGAA 
CTGGGCTTTC AGCAGGTAGG TATTACCGAT ACCGATCTCA GCGAGTCCGA GCCCAAACTG
CAAGCATGGC TGGACAAACA ATACCACGGC GAAATGGACT GGATGGCACG TCACGGTATG
CTGCGCGCTC GCCCTCATGA GTTATTGCCC GGTACGCTGC GCGTGATCAG CGTGCGGATG
AATTACCTTC CTGCTAACGC CGCATTTGCC AGCACGCTGA AAAACCCCAA ACTCGGCTAT
GTTAGCCGTT ATGCGCTGGG CCGTGACTAT CACAAACTTC TGCGCAACCG ACTCAAAAAG
CTGGGCGAGA TGATTCAGCA ACATTGTGTT TCGCTGAATT TTAGACCGTT TGTCGATTCT
GCGCCTATTC TCGAGCGCCC GTTAGCTGAA AAAGCTGGGC TCGGCTGGAC AGGTAAGCAC
TCACTTATCC TCAATCGCGA GGCCGGTTCG TTCTTCTTTT TAGGCGAATT GCTGGTCGAT
ATTCCGCTGC CCGTGGATCA ACCAGTCGAG GAAGGATGCG GCAAATGCGT GGCCTGTATG
ACGATTTGCC CGACCGGTGC CATCGTCGAG CCATATACCG TCGATGCTCG CCGCTGTATC
TCTTATCTCA CCATCGAACT TGAAGGGGCG ATCCCGGAAG AGTTGCGACC GTTAATGGGA
AACCGTATTT ACGGTTGCGA TGACTGCCAG CTTATCTGCC CGTGGAATCG CTATTCACAA
CTCACCACAG AAGAGGATTT CAGCCCGCGT AAGCCGCTAC ACGCACCGGA ACTCATTGAG
TTATTCGCCT GGAGCGAAGA GAAGTTTTTA AAAGTCACGG AAGGATCGGC GATTCGTCGT
ATTGGTCACC TGCGTTGGCT GCGTAATATC GCCGTAGCAT TAGGCAATGC CCCTTGGGAT
GAAACGATTT TGACAGCGCT GGAAAGTCGT AAAGGTGAGC ACCCACTTCT CGATGAGCAC
ATAGCGTGGG CGATTGCGCA GCAAATAGAG AGACGAAATG CGTGCATAGT CGAAGTGCAA
TTGCCGAAAA AACAGCGTCT GGTTCGGGTG ATTGAAAAAG GGTTACCGCG TGACGCCTGA
 
Protein sequence
MSEPLDLNQL AQKIKQWGLE LGFQQVGITD TDLSESEPKL QAWLDKQYHG EMDWMARHGM 
LRARPHELLP GTLRVISVRM NYLPANAAFA STLKNPKLGY VSRYALGRDY HKLLRNRLKK
LGEMIQQHCV SLNFRPFVDS APILERPLAE KAGLGWTGKH SLILNREAGS FFFLGELLVD
IPLPVDQPVE EGCGKCVACM TICPTGAIVE PYTVDARRCI SYLTIELEGA IPEELRPLMG
NRIYGCDDCQ LICPWNRYSQ LTTEEDFSPR KPLHAPELIE LFAWSEEKFL KVTEGSAIRR
IGHLRWLRNI AVALGNAPWD ETILTALESR KGEHPLLDEH IAWAIAQQIE RRNACIVEVQ
LPKKQRLVRV IEKGLPRDA