Gene EcDH1_3546 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcDH1_3546 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli DH1 
KingdomBacteria 
Replicon accessionCP001637 
Strand
Start bp3816675 
End bp3817961 
Gene Length1287 bp 
Protein Length428 aa 
Translation table11 
GC content54% 
IMG OID 
ProductSurA domain protein 
Protein accessionACX41160 
Protein GI260450738 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.00416172 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGAACT GGAAAACGCT GCTTCTCGGT ATCGCCATGA TCGCGAATAC CAGTTTCGCT 
GCCCCCCAGG TAGTCGATAA AGTCGCAGCC GTCGTCAATA ACGGCGTCGT GCTGGAAAGC
GACGTTGATG GATTAATGCA GTCGGTAAAA CTGAACGCTG CTCAGGCAAG GCAGCAACTT
CCTGATGACG CGACGCTGCG CCACCAAATC ATGGAACGTT TGATCATGGA TCAAATCATC
CTGCAGATGG GGCAGAAAAT GGGAGTGAAA ATCTCCGATG AGCAGCTGGA TCAGGCGATT
GCTAACATCG CGAAACAGAA CAACATGACG CTGGATCAGA TGCGCAGCCG TCTGGCTTAC
GATGGACTGA ACTACAACAC CTATCGTAAC CAGATCCGCA AAGAGATGAT TATCTCTGAA
GTGCGTAACA ACGAGGTGCG TCGTCGCATC ACCATCCTGC CGCAGGAAGT CGAATCCCTG
GCGCAGCAGG TGGGTAACCA AAACGACGCC AGCACTGAGC TGAACCTGAG CCACATCCTG
ATCCCGCTGC CGGAAAACCC GACCTCTGAT CAGGTGAACG AAGCGGAAAG CCAGGCGCGC
GCCATTGTCG ATCAGGCGCG TAACGGCGCT GATTTCGGTA AGCTGGCGAT TGCTCATTCT
GCCGACCAGC AGGCGCTGAA CGGCGGCCAG ATGGGCTGGG GCCGTATTCA GGAGTTGCCC
GGGATCTTCG CCCAGGCATT AAGCACCGCG AAGAAAGGCG ACATTGTTGG CCCGATTCGT
TCCGGCGTTG GCTTCCATAT TCTGAAAGTT AACGACCTGC GCGGCGAAAG CAAAAATATC
TCGGTGACCG AAGTTCATGC TCGCCATATT CTGCTGAAAC CGTCGCCGAT CATGACTGAC
GAACAGGCCC GTGTGAAACT GGAACAGATT GCTGCTGATA TCAAGAGTGG TAAAACGACT
TTTGCTGCCG CAGCGAAAGA GTTCTCTCAG GATCCAGGCT CTGCTAACCA GGGCGGCGAT
CTCGGCTGGG CTACACCAGA TATTTTCGAT CCGGCCTTCC GTGACGCCCT GACTCGCCTG
AACAAAGGTC AAATGAGTGC ACCGGTTCAC TCTTCATTCG GCTGGCATTT AATCGAACTG
CTGGATACCC GTAATGTCGA TAAAACCGAC GCTGCGCAGA AAGATCGTGC ATACCGCATG
CTGATGAACC GTAAGTTCTC GGAAGAAGCA GCAAGCTGGA TGCAGGAACA ACGTGCCAGC
GCCTACGTTA AAATCCTGAG CAACTAA
 
Protein sequence
MKNWKTLLLG IAMIANTSFA APQVVDKVAA VVNNGVVLES DVDGLMQSVK LNAAQARQQL 
PDDATLRHQI MERLIMDQII LQMGQKMGVK ISDEQLDQAI ANIAKQNNMT LDQMRSRLAY
DGLNYNTYRN QIRKEMIISE VRNNEVRRRI TILPQEVESL AQQVGNQNDA STELNLSHIL
IPLPENPTSD QVNEAESQAR AIVDQARNGA DFGKLAIAHS ADQQALNGGQ MGWGRIQELP
GIFAQALSTA KKGDIVGPIR SGVGFHILKV NDLRGESKNI SVTEVHARHI LLKPSPIMTD
EQARVKLEQI AADIKSGKTT FAAAAKEFSQ DPGSANQGGD LGWATPDIFD PAFRDALTRL
NKGQMSAPVH SSFGWHLIEL LDTRNVDKTD AAQKDRAYRM LMNRKFSEEA ASWMQEQRAS
AYVKILSN