Gene EcDH1_1341 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcDH1_1341 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli DH1 
KingdomBacteria 
Replicon accessionCP001637 
Strand
Start bp1444030 
End bp1445298 
Gene Length1269 bp 
Protein Length422 aa 
Translation table11 
GC content56% 
IMG OID 
ProductFolC bifunctional protein 
Protein accessionACX39013 
Protein GI260448591 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value0.420522 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATTATCA AACGCACTCC TCAAGCCGCG TCGCCTCTGG CTTCGTGGCT TTCTTATCTG 
GAAAACCTGC ACAGTAAAAC TATCGATCTC GGCCTTGAGC GCGTGAGCCT GGTCGCGGCG
CGTCTTGGCG TCCTGAAACC AGCGCCATTT GTGTTTACCG TTGCGGGTAC GAATGGCAAA
GGCACCACCT GCCGTACGCT GGAGTCGATT CTGATGGCGG CAGGGTACAA AGTGGGCGTC
TACAGTTCGC CTCATCTGGT GCGTTATACC GAGCGCGTAC GTGTGCAGGG CCAGGAATTG
CCGGAATCGG CCCACACCGC CTCTTTTGCG GAGATTGAAT CGGCACGCGG TGATATTTCC
CTGACCTATT TCGAGTACGG TACGCTGTCG GCGTTGTGGC TGTTCAAGCA GGCACAACTT
GACGTGGTGA TTCTGGAAGT AGGGCTGGGC GGTCGTCTGG ACGCAACCAA TATTGTCGAC
GCCGATGTCG CGGTAGTAAC CAGTATTGCG CTGGATCATA CCGACTGGCT GGGTCCAGAT
CGCGAAAGTA TTGGTCGCGA GAAAGCAGGC ATCTTCCGCA GCGAAAAACC GGCAATTGTC
GGTGAGCCGG AAATGCCTTC TACCATTGCT GATGTGGCGC AGGAAAAAGG TGCACTGTTA
CAACGTCGGG GCGTTGAGTG GAACTATTCC GTCACCGATC ATGACTGGGC GTTTAGCGAT
GCTCACGGCA CGCTGGAAAA TCTGCCGTTG CCGCTTGTCC CGCAACCGAA TGCCGCAACA
GCGCTGGCGG CACTGCGTGC CAGCGGGCTG GAAGTCAGTG AAAATGCCAT TCGCGACGGG
ATTGCCAGCG CAATTTTGCC GGGACGTTTC CAGATTGTGA GCGAGTCGCC ACGCGTTATT
TTTGATGTCG CGCATAATCC ACATGCGGCG GAATATCTCA CCGGGCGTAT GAAAGCGCTA
CCGAAAAACG GGCGCGTGCT GGCGGTTATC GGTATGCTAC ATGATAAAGA TATTGCCGGA
ACTCTGGCCT GGTTGAAAAG CGTGGTTGAT GACTGGTATT GTGCGCCACT GGAAGGGCCG
CGCGGTGCCA CGGCAGAACA ACTGCTTGAG CATTTGGGTA ACGGCAAATC ATTTGATAGC
GTTGCGCAGG CATGGGATGC CGCAATGGCG GACGCTAAAG CGGAAGACAC CGTGCTGGTG
TGTGGTTCTT TCCACACGGT CGCACATGTC ATGGAAGTGA TTGACGCGAG GAGAAGCGGT
GGCAAGTAA
 
Protein sequence
MIIKRTPQAA SPLASWLSYL ENLHSKTIDL GLERVSLVAA RLGVLKPAPF VFTVAGTNGK 
GTTCRTLESI LMAAGYKVGV YSSPHLVRYT ERVRVQGQEL PESAHTASFA EIESARGDIS
LTYFEYGTLS ALWLFKQAQL DVVILEVGLG GRLDATNIVD ADVAVVTSIA LDHTDWLGPD
RESIGREKAG IFRSEKPAIV GEPEMPSTIA DVAQEKGALL QRRGVEWNYS VTDHDWAFSD
AHGTLENLPL PLVPQPNAAT ALAALRASGL EVSENAIRDG IASAILPGRF QIVSESPRVI
FDVAHNPHAA EYLTGRMKAL PKNGRVLAVI GMLHDKDIAG TLAWLKSVVD DWYCAPLEGP
RGATAEQLLE HLGNGKSFDS VAQAWDAAMA DAKAEDTVLV CGSFHTVAHV MEVIDARRSG
GK