Gene EcDH1_0494 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcDH1_0494 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli DH1 
KingdomBacteria 
Replicon accessionCP001637 
Strand
Start bp523148 
End bp524566 
Gene Length1419 bp 
Protein Length472 aa 
Translation table11 
GC content56% 
IMG OID 
Productglutamate synthase, small subunit 
Protein accessionACX38182 
Protein GI260447760 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones58 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTCAGA ATGTTTATCA ATTTATCGAC CTGCAGCGCG TTGATCCGCC AAAGAAACCG 
CTGAAGATCC GCAAAATTGA GTTTGTTGAA ATTTACGAGC CGTTTTCCGA AGGCCAGGCC
AAAGCGCAGG CTGACCGCTG CCTGTCGTGC GGCAACCCAT ACTGCGAGTG GAAATGCCCG
GTACACAACT ACATCCCGAA CTGGCTGAAG CTCGCCAACG AGGGGCGTAT TTTTGAAGCG
GCGGAACTGT CGCACCAGAC CAACACCCTG CCGGAAGTTT GCGGACGAGT CTGCCCGCAA
GACCGTCTGT GCGAAGGTTC CTGCACTCTG AACGATGAGT TTGGCGCGGT GACCATCGGC
AACATTGAGC GCTATATCAA CGATAAAGCG TTCGAGATGG GCTGGCGTCC GGATATGTCT
GGTGTGAAAC AGACCGGTAA AAAAGTGGCG ATTATCGGCG CAGGCCCGGC AGGTCTGGCG
TGTGCGGATG TCCTGACGCG TAACGGCGTA AAAGCCGTTG TCTTCGACCG TCATCCAGAA
ATTGGCGGGC TGCTGACCTT CGGTATTCCG GCCTTCAAGC TGGAAAAAGA GGTAATGACG
CGTCGCCGTG AAATCTTCAC CGGCATGGGT ATTGAATTCA AACTCAATAC CGAAGTGGGC
CGCGACGTAC AGCTGGACGA TCTGCTGAGT GATTACGATG CCGTGTTCCT TGGCGTCGGG
ACTTATCAGT CAATGCGCGG CGGGCTGGAA AACGAAGACG CCGATGGCGT GTACGCAGCG
CTGCCGTTCC TCATCGCCAA CACCAAACAG TTAATGGGCT TTGGTGAAAC CCGCGACGAA
CCGTTCGTCA GCATGGAAGG CAAACGCGTG GTGGTCCTTG GCGGTGGCGA CACTGCGATG
GACTGCGTGC GTACGTCCGT GCGCCAGGGA GCGAAGCACG TTACCTGTGC CTATCGTCGT
GATGAAGAGA ACATGCCGGG TTCCCGCCGC GAAGTGAAAA ACGCGCGGGA AGAAGGCGTA
GAGTTCAAAT TCAACGTCCA GCCGCTGGGT ATTGAAGTGA ACGGTAACGG CAAAGTCAGC
GGCGTAAAAA TGGTGCGTAC CGAAATGGGC GAACCGGACG CCAAAGGCCG TCGCCGCGCG
GAGATCGTTG CAGGTTCCGA ACATATCGTT CCGGCAGATG CGGTGATCAT GGCGTTTGGT
TTCCGTCCAC ACAACATGGA ATGGCTGGCA AAACACAGCG TCGAGCTGGA TTCACAAGGC
CGCATCATCG CCCCGGAAGG CAGCGACAAC GCCTTCCAGA CCAGCAACCC GAAAATCTTT
GCTGGCGGCG ATATCGTCCG TGGTTCCGAT CTGGTGGTGA CCGCTATTGC CGAAGGTCGT
AAGGCGGCAG ACGGTATTAT GAACTGGCTG GAAGTTTAA
 
Protein sequence
MSQNVYQFID LQRVDPPKKP LKIRKIEFVE IYEPFSEGQA KAQADRCLSC GNPYCEWKCP 
VHNYIPNWLK LANEGRIFEA AELSHQTNTL PEVCGRVCPQ DRLCEGSCTL NDEFGAVTIG
NIERYINDKA FEMGWRPDMS GVKQTGKKVA IIGAGPAGLA CADVLTRNGV KAVVFDRHPE
IGGLLTFGIP AFKLEKEVMT RRREIFTGMG IEFKLNTEVG RDVQLDDLLS DYDAVFLGVG
TYQSMRGGLE NEDADGVYAA LPFLIANTKQ LMGFGETRDE PFVSMEGKRV VVLGGGDTAM
DCVRTSVRQG AKHVTCAYRR DEENMPGSRR EVKNAREEGV EFKFNVQPLG IEVNGNGKVS
GVKMVRTEMG EPDAKGRRRA EIVAGSEHIV PADAVIMAFG FRPHNMEWLA KHSVELDSQG
RIIAPEGSDN AFQTSNPKIF AGGDIVRGSD LVVTAIAEGR KAADGIMNWL EV