Gene EcDH1_1961 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcDH1_1961 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli DH1 
KingdomBacteria 
Replicon accessionCP001637 
Strand
Start bp2116687 
End bp2117958 
Gene Length1272 bp 
Protein Length423 aa 
Translation table11 
GC content53% 
IMG OID 
ProductFeS assembly protein SufD 
Protein accessionACX39618 
Protein GI260449196 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value0.0846147 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCTGGCT TACCGAACAG CAGTAACGCG CTGCAACAGT GGCATCACTT GTTTGAAGCT 
GAAGGGACAA AACGCTCCCC GCAAGCACAG CAGCATTTAC AACAATTGCT GCGTACCGGA
CTGCCGACAC GTAAACATGA AAACTGGAAA TATACGCCGC TGGAAGGGCT GATCAATAGC
CAGTTTGTCA GCATTGCGGG AGAGATATCC CCACAGCAGC GTGATGCCTT AGCGTTAACG
TTAGACTCCG TGCGGCTGGT GTTTGTCGAT GGGCGTTACG TGCCCGCACT GAGCGATGCA
ACTGAAGGCA GCGGATATGA AGTGAGCATT AACGACGACC GTCAGGGTTT ACCCGACGCT
ATTCAGGCGG AAGTGTTTCT GCATTTGACG GAAAGCCTGG CACAAAGCGT GACGCATATC
GCCGTGAAGC GCGGTCAACG GCCGGCAAAG CCATTGCTGT TAATGCATAT CACCCAGGGC
GTGGCAGGTG AAGAGGTGAA CACTGCCCAT TACCGACATC ATCTGGATCT GGCGGAAGGT
GCCGAAGCAA CGGTGATCGA ACATTTTGTC AGCCTGAATG ATGCTCGTCA TTTTACCGGG
GCACGGTTCA CTATCAACGT CGCAGCGAAT GCCCACTTGC AGCATATCAA GCTGGCGTTT
GAAAACCCGC TCAGTCACCA CTTTGCTCAT AACGATTTGT TGCTGGCTGA GGATGCCACC
GCATTTAGCC ACAGTTTCCT GCTGGGTGGC GCAGTGTTAC GACACAACAC CAGTACGCAA
CTCAATGGCG AAAACAGCAC GCTGCGGATC AATAGCCTGG CGATGCCGGT GAAAAACGAG
GTGTGTGATA CCCGTACCTG GCTGGAACAC AATAAAGGTT TTTGTAACAG CCGACAGTTG
CACAAAACTA TCGTCAGCGA CAAAGGCCGC GCGGTATTTA ACGGTTTGAT CAACGTCGCG
CAGCACGCCA TCAAAACGGA TGGTCAGATG ACCAACAACA ATCTGCTGAT GGGCAAACTG
GCGGAAGTGG ATACGAAACC GCAGCTGGAA ATCTATGCAG ATGATGTGAA ATGCAGCCAC
GGCGCGACGG TGGGGCGTAT TGATGATGAA CAGATATTCT ATCTGCGCTC GCGCGGGATC
AATCAGCAGG ATGCCCAGCA GATGATCATT TACGCCTTCG CTGCCGAACT GACGGAAGCA
CTGCGTGATG AGGGGCTTAA ACAGCAGGTG CTGGCCCGAA TCGGTCAACG GCTGCCAGGA
GGTGCAAGAT GA
 
Protein sequence
MAGLPNSSNA LQQWHHLFEA EGTKRSPQAQ QHLQQLLRTG LPTRKHENWK YTPLEGLINS 
QFVSIAGEIS PQQRDALALT LDSVRLVFVD GRYVPALSDA TEGSGYEVSI NDDRQGLPDA
IQAEVFLHLT ESLAQSVTHI AVKRGQRPAK PLLLMHITQG VAGEEVNTAH YRHHLDLAEG
AEATVIEHFV SLNDARHFTG ARFTINVAAN AHLQHIKLAF ENPLSHHFAH NDLLLAEDAT
AFSHSFLLGG AVLRHNTSTQ LNGENSTLRI NSLAMPVKNE VCDTRTWLEH NKGFCNSRQL
HKTIVSDKGR AVFNGLINVA QHAIKTDGQM TNNNLLMGKL AEVDTKPQLE IYADDVKCSH
GATVGRIDDE QIFYLRSRGI NQQDAQQMII YAFAAELTEA LRDEGLKQQV LARIGQRLPG
GAR