Gene EcDH1_1202 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcDH1_1202 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli DH1 
KingdomBacteria 
Replicon accessionCP001637 
Strand
Start bp1298237 
End bp1300216 
Gene Length1980 bp 
Protein Length659 aa 
Translation table11 
GC content56% 
IMG OID 
Productglutamate synthase, small subunit 
Protein accessionACX38876 
Protein GI260448454 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones35 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATCGTT TTATTATGGC CAACAGTCAG CAATGTCTGG GTTGTCATGC TTGTGAAATC 
GCCTGTGTCA TGGCTCACAA TGATGAGCAA CATGTCCTGA GCCAACACCA TTTTCATCCC
CGAATTACGG TTATCAAACA TCAACAGCAA CGTAGTGCAG TGACCTGTCA CCATTGTGAA
GATGCGCCCT GCGCCCGTAG CTGCCCTAAT GGCGCAATCA GCCACGTTGA TGACAGCATT
CAGGTCAATC AGCAAAAGTG TATTGGCTGT AAATCCTGCG TGGTGGCCTG TCCTTTTGGT
ACGATGCAAA TCGTCCTGAC ACCCGTCGCG GCAGGAAAAG TAAAAGCCAC GGCGCATAAA
TGCGACCTTT GTGCGGGGCG CGAAAACGGT CCTGCCTGTG TTGAGAATTG CCCGGCGGAC
GCGCTGCAAC TGGTCACTGA CGTCGCACTC TCCGGCATGG CGAAATCCCG CCGCTTGCGC
ACCGCGCGTC AGGAACATCA ACCGTGGCAT GCCAGTACCG CGGCGCAAGA AATGCCGGTA
ATGAGTAAAG TCGAACAAAT GCAGGCAACG CCCGCGCGTG GCGAGCCGGA TAAACTGGCG
ATTGAAGCGC GCAAAACCGG TTTTGATGAA ATTTATCTGC CATTTCGCGC CGACCAGGCA
CAACGGGAAG CCTCGCGCTG CCTTAAGTGC GGCGAGCACA GCGTTTGTGA ATGGACCTGC
CCGCTGCATA ACCATATACC GCAGTGGATT GAACTGGTGA AAGCCGGAAA CATCGACGCC
GCCGTCGAGC TTTCTCACCA GACCAACACC CTGCCGGAAA TTACCGGACG CGTTTGTCCG
CAAGACCGTT TGTGTGAAGG TGCCTGTACT ATTCGCGATG AGCACGGCGC GGTAACTATC
GGCAACATTG AACGCTACAT TTCAGATCAG GCGTTGGCGA AAGGTTGGCG TCCTGACTTA
AGCCATGTCA CCAAAGTGGA CAAGCGGGTG GCGATTATCG GTGCAGGTCC GGCAGGGCTG
GCCTGTGCGG ATGTTCTGAC CCGCAATGGC GTGGGGGTGA CGGTGTACGA TCGCCATCCA
GAAATCGGTG GCTTGCTCAC TTTCGGCATT CCTTCTTTCA AACTGGATAA ATCCCTGCTG
GCACGCCGTC GGGAAATCTT CAGCGCGATG GGGATTCACT TCGAACTCAA TTGTGAAGTG
GGTAAAGATG TCTCTTTGGA TTCGCTTTTG GAACAATACG ACGCGGTCTT CGTTGGCGTA
GGCACTTACC GTTCCATGAA AGCGGGTTTA CCCAATGAAG ATGCGCCGGG CGTTTATGAC
GCGCTGCCGT TCCTCATTGC CAACACTAAA CAGGTGATGG GGCTCGAAGA GCTACCGGAA
GAGCCGTTTA TCAATACCGC CGGACTTAAC GTCGTGGTAC TGGGCGGCGG CGACACCGCG
ATGGACTGTG TGCGTACCGC ACTGCGCCAC GGCGCGAGTA ACGTCACCTG CGCTTATCGT
CGTGATGAAG CTAACATGCC AGGCTCGAAG AAAGAAGTGA AGAACGCCCG CGAAGAGGGG
GCCAACTTCG AATTTAACGT CCAGCCGGTG GCGCTTGAGC TGAATGAACA AGGTCACGTC
TGCGGGATTC GTTTCCTGCG CACGCGTCTT GGAGAGCCGG ATGCCCAGGG GCGTCGGCGT
CCAGTGCCGG TGGAAGGCAG TGAATTTGTC ATGCCAGCCG ACGCGGTGAT TATGGCGTTT
GGCTTCAATC CGCACGGGAT GCCGTGGCTG GAGTCGCACG GTGTAACGGT AGACAAATGG
GGCCGCATCA TCGCGGATGT GGAAAGCCAG TACCGTTACC AGACCACCAA TCCGAAAATC
TTCGCTGGTG GTGACGCCGT GCGTGGTGCG GATCTGGTGG TTACCGCAAT GGCAGAAGGA
CGTCATGCGG CACAGGGGAT TATTGACTGG CTGGGGGTAA AATCAGTCAA ATCTCACTGA
 
Protein sequence
MNRFIMANSQ QCLGCHACEI ACVMAHNDEQ HVLSQHHFHP RITVIKHQQQ RSAVTCHHCE 
DAPCARSCPN GAISHVDDSI QVNQQKCIGC KSCVVACPFG TMQIVLTPVA AGKVKATAHK
CDLCAGRENG PACVENCPAD ALQLVTDVAL SGMAKSRRLR TARQEHQPWH ASTAAQEMPV
MSKVEQMQAT PARGEPDKLA IEARKTGFDE IYLPFRADQA QREASRCLKC GEHSVCEWTC
PLHNHIPQWI ELVKAGNIDA AVELSHQTNT LPEITGRVCP QDRLCEGACT IRDEHGAVTI
GNIERYISDQ ALAKGWRPDL SHVTKVDKRV AIIGAGPAGL ACADVLTRNG VGVTVYDRHP
EIGGLLTFGI PSFKLDKSLL ARRREIFSAM GIHFELNCEV GKDVSLDSLL EQYDAVFVGV
GTYRSMKAGL PNEDAPGVYD ALPFLIANTK QVMGLEELPE EPFINTAGLN VVVLGGGDTA
MDCVRTALRH GASNVTCAYR RDEANMPGSK KEVKNAREEG ANFEFNVQPV ALELNEQGHV
CGIRFLRTRL GEPDAQGRRR PVPVEGSEFV MPADAVIMAF GFNPHGMPWL ESHGVTVDKW
GRIIADVESQ YRYQTTNPKI FAGGDAVRGA DLVVTAMAEG RHAAQGIIDW LGVKSVKSH