Gene EcHS_A3046 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A3046 
SymbolaegA2 
ID5594044 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp3056526 
End bp3058445 
Gene Length1920 bp 
Protein Length639 aa 
Translation table11 
GC content53% 
IMG OID640922163 
Productputative oxidoreductase Fe-S binding subunit 
Protein accessionYP_001459665 
Protein GI157162347 
COG category[C] Energy production and conversion
[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG0493] NADPH-dependent glutamate synthase beta chain and related oxidoreductases
[COG1142] Fe-S-cluster-containing hydrogenase components 2 
TIGRFAM ID[TIGR01318] glutamate synthase small subunit family protein, proteobacterial 


Plasmid Coverage information

Num covering plasmid clones51 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATAAGT TTATCGCTGC TGAAGCTGCG GAATGTATAG GCTGTCATGC TTGTGAAATT 
GCCTGTGCGG TGGCACACAA TCAAGAAAAC TGGCCGCTGA GTCACAGTGA CTTTCGACCG
CGTATCCACG TTGTAGGGAA AGGCCAGGCT GCGAATCCGG TGGCCTGCCA TCACTGCAAC
AATGCCCCTT GCGTTACGGC TTGTCCGGTT AATGCTCTGA CTTTCCAGTC CGATAGCGTA
CAACTGGACG AGCAAAAATG TATTGGTTGT AAAAGATGCG CAATCGCTTG CCCCTTTGGC
GTCGTTGAGA TGGTCGATAC GATTGCGCAG AAATGCGACC TTTGTAACCA GCGCAGTTCC
GGCACGCAAG CCTGTATTGA AGTCTGCCCA ACACAGGCGT TACGACTGAT GGACGATAAA
GGGTTACAGC AGATAAAGGT GGCCCGCCAG CGCAAAACGG CAGCAGGAAA AGCGTCATCA
GACGCTCAGC CATCTCGCAG TGCAGCGTTG CTCCCCGTTA ACTCGCGTAA AGGCGCAGAT
AAAATTTCAG CGAGTGAACG GAAAACCCAC TTTGGCGAAA TATATTGCGG GCTGGATCCA
CAACAAGCGA CTTATGAGAG TGACCGCTGT GTTTATTGTG CCGAAAAAGC TAACTGCAAC
TGGCATTGCC CGCTGCATAA CGCTATTCCG GATTACATCC GTCTGGTACA GGAAGGAAAG
ATTATTGAAG CGGCAGAACT TTGCCACCAG ACCAGTTCCT TACCAGAAAT CTGCGGCAGG
GTATGTCCAC AGGACCGTCT TTGTGAAGGC GCATGTACTT TGAAAGATCA CTCTGGCGCA
GTCACTATCG GTAATCTGGA ACGCTACATC ACCGATACCG CGCTGGCGAT GGGCTGGTGT
CCCGATGTCA GCAAAGTTGT TCCCCGTAGC GAAAAAGTGG CGGTGATTGG CGCTGGGCCA
GCAGGGTTAG GGTGTGCTGA TATTCTGGCG CGCGCAGGAG TTCAGGTTGA TGTCTTTGAT
CGCCATCCAG AAATTGGCGG TATGCTGACT TTTGGCATAC CTCCTTTCAA ACTCGATAAA
ACGGTATTAA GCCAGCGGCG AGAGATATTC ACCGCAATGG GAATCGACTT TCATCTTAAC
TGTGAAATTG GCCGCGATAT TACCTTTAGC GATTTAACTT CTGAATATGA TGCAGTTTTC
ATCGGCGTGG GGACTTACGG GATGATGCGA GCTGATCTGC CGCATGAAGA TGCGCCCGGT
GTCATTCAGG CTCTACCGTT CTTGACTGCC CATACCCGCC AGCTCATGGG GTTGCCGGAG
TCTGAAGAGT ATCCGCTGAC GGACGTGGAA GGTAAGCGAG TCGTGGTATT GGGCGGTGGC
GATACGACAA TGGATTGTTT GCGGACTTCC ATCCGCCTCA ATGCCGCCAG CGTGACCTGC
GCGTATCGTC GTGATGAAGT CAGTATGCCG GGCTCGCGCA AAGAGGTGGT CAATGCGCGC
GAGGAAGGTG TCGAGTTTCA ATTCAATGTT CAGCCGCAAT ATATCGCTTG TGACGAAGAT
GGGCGTTTAA CTGCGGTGGG CCTGATTCGT ACCGCCATGG GTGAGCCGGG GCCGGATGGT
CGCCGTCGTC CTCGTCCGGT TGCGGGTTCA GAGTTTGAAT TGCCCGCAGA TGTTCTCATT
ATGGCCTTTG GTTTCCAGGC GCATGCCATG CCGTGGTTGC AGGGCAGCGG AATTAAACTC
GATAAATGGG GCCTGATTCA AACCGGCGAC GTTGGGTATT TACCTACCCA GACGCATCTG
AAAAAAGTCT TTGCTGGTGG GGATGCAGTT CATGGCGCGG ATCTGGTTGT CACTGCCATG
GCCGCAGGAA GGCAGGCGGC GCGCGATATG TTAACTCTGT TTGATACGAA GGCATCGTGA
 
Protein sequence
MNKFIAAEAA ECIGCHACEI ACAVAHNQEN WPLSHSDFRP RIHVVGKGQA ANPVACHHCN 
NAPCVTACPV NALTFQSDSV QLDEQKCIGC KRCAIACPFG VVEMVDTIAQ KCDLCNQRSS
GTQACIEVCP TQALRLMDDK GLQQIKVARQ RKTAAGKASS DAQPSRSAAL LPVNSRKGAD
KISASERKTH FGEIYCGLDP QQATYESDRC VYCAEKANCN WHCPLHNAIP DYIRLVQEGK
IIEAAELCHQ TSSLPEICGR VCPQDRLCEG ACTLKDHSGA VTIGNLERYI TDTALAMGWC
PDVSKVVPRS EKVAVIGAGP AGLGCADILA RAGVQVDVFD RHPEIGGMLT FGIPPFKLDK
TVLSQRREIF TAMGIDFHLN CEIGRDITFS DLTSEYDAVF IGVGTYGMMR ADLPHEDAPG
VIQALPFLTA HTRQLMGLPE SEEYPLTDVE GKRVVVLGGG DTTMDCLRTS IRLNAASVTC
AYRRDEVSMP GSRKEVVNAR EEGVEFQFNV QPQYIACDED GRLTAVGLIR TAMGEPGPDG
RRRPRPVAGS EFELPADVLI MAFGFQAHAM PWLQGSGIKL DKWGLIQTGD VGYLPTQTHL
KKVFAGGDAV HGADLVVTAM AAGRQAARDM LTLFDTKAS