Gene EcolC_0822 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_0822 
Symbol 
ID6065417 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp881812 
End bp883731 
Gene Length1920 bp 
Protein Length639 aa 
Translation table11 
GC content53% 
IMG OID641600227 
Productputative oxidoreductase Fe-S binding subunit 
Protein accessionYP_001723821 
Protein GI170018867 
COG category[C] Energy production and conversion
[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG0493] NADPH-dependent glutamate synthase beta chain and related oxidoreductases
[COG1142] Fe-S-cluster-containing hydrogenase components 2 
TIGRFAM ID[TIGR01318] glutamate synthase small subunit family protein, proteobacterial 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones38 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATAAGT TTATCGCTGC TGAAGCTGCG GAATGTATAG GCTGTCATGC TTGTGAAATT 
GCCTGTGCGG TGGCACACAA TCAAGAAAAC TGGCCGCTGA GTCACAGTGA CTTTCGACCG
CGTATCCACG TTGTAGGGAA AGGCCAGGCT GCGAATCCGG TGGCCTGCCA TCACTGCAAC
AATGCCCCTT GCGTTACGGC TTGTCCGGTT AATGCTCTGA CTTTCCAGTC CGATAGCGTA
CAACTGGACG AGCAAAAATG TATTGGTTGT AAAAGATGCG CAATCGCTTG CCCCTTTGGC
GTCGTTGAGA TGGTCGATAC GATTGCGCAG AAATGCGACC TTTGTAACCA GCGCAGTTCC
GGCACGCAAG CCTGTATTGA AGTTTGCCCA ACACAGGCGT TACGACTGAT GGACGATAAA
GGGTTACAGC AGATAAAGGT GGCCCGCCAG CGCAAAACGG CAGCAGGAAA TGCGTCATCA
GACGCTCAGC CATCTCGCAG TGCAGCGTTG CTCCCCGTTA ACTCGCGTAA AGGCGCAGAT
AAAATTTCAG CAAGTGAACG GAAAAACCAC TTTGGCGAAA TCTATTGCGG GCTGGATCCA
CAACAAGCGA CTTATGAGAG TGACCGCTGT GTTTATTGTG CCGAAAAAGC TAACTGCAAC
TGGCATTGCC CGCTGCATAA CGCTATTCCG GATTACATCC GTCTGGTACA GGAGGGAAAG
ATTATTGAAG CGGCAGAACT TTGCCACCAG ACCAGTTCCT TACCAGAAAT CTGCGGCAGG
GTATGTCCAC AGGACCGTCT TTGTGAAGGC GCATGTACTT TGAAAGATCA CTCTGGCGCA
GTCACTATCG GTAATCTGGA ACGCTACATC ACCGATACCG CGCTGGCGAT GGGCTGGCGT
CCCGATGTCA GCAAAGTTGT TCCCCGTAGC GAAAAAGTGG CGGTGATTGG CGCTGGGCCA
GCAGGGTTAG GGTGTGCTGA TATTCTGGCG CGCGCAGGAG TTCAGGTTGA TGTCTTTGAT
CGCCATCCAG AAATTGGCGG TATGCTGACT TTTGGCATTC CTCCTTTCAA ACTCGATAAA
ACGGTATTAA GCCAGCGGCG AGAGATATTC ACCGCAATGG GAATCGACTT TCATCTCAAC
TGTGAAATTG GCCGCGATAT TACCTTTAGC GATTTAACTT CTGAATATGA TGCAGTTTTC
ATCGGCGTGG GGACTTACGG GATGATGCGA GCAGATCTGC CGCATGAAGA TGCGCCCGGT
GTCATTCAGG CTCTACCGTT CCTGACTGCC CATACCCGCC AGCTCATGGG ATTGCCGGAG
TCTGAAGAGT ATCCGCTGAC GGACGTGGAA GGTAAGCGAG TCGTGGTATT GGGCGGTGGC
GATACGACAA TGGATTGTTT GCGGACTTCC ATCCGCCTCA ATGCCGCCAG CGTGACCTGC
GCGTATCGTC GTGATGAAGT CAGTATGCCG GGCTCGCGCA AAGAGGTGGT TAATGCGCGC
GAGGAAGGTG TCGAGTTTCA TTTCAATGTT CAACCGCAAT ATATCGCTTG TGACGAAGAT
GGGCGTTTAA CTGCGGTGGG CCTGATTCGT ACCGCCATGG GTGAGCCGGG GCCGGATGGT
CGCCGTCGTC CTCGTCCGGT TGCGGGTTCA GAGTTTGAAT TGCCCGCCGA TGTTCTCATT
ATGGCCTTTG GTTTCCAGGC GCATGCCATG CCGTGGTTGC AGGGCAGCGG AATTAAACTC
GATAAATGGG GCCTGATTCA AACCGGTGAC GTCGGGTATT TACCTACCCA GACGCATCTG
AAAAAAGTCT TTGCTGGTGG GGATGCAGTT CATGGCGCGG ATCTGGTTGT CACTGCAATG
GCCGCAGGAA GGCAGGCGGC GCGCGATATG TTAACTCTGT TTGATACGAA GGCATCGTGA
 
Protein sequence
MNKFIAAEAA ECIGCHACEI ACAVAHNQEN WPLSHSDFRP RIHVVGKGQA ANPVACHHCN 
NAPCVTACPV NALTFQSDSV QLDEQKCIGC KRCAIACPFG VVEMVDTIAQ KCDLCNQRSS
GTQACIEVCP TQALRLMDDK GLQQIKVARQ RKTAAGNASS DAQPSRSAAL LPVNSRKGAD
KISASERKNH FGEIYCGLDP QQATYESDRC VYCAEKANCN WHCPLHNAIP DYIRLVQEGK
IIEAAELCHQ TSSLPEICGR VCPQDRLCEG ACTLKDHSGA VTIGNLERYI TDTALAMGWR
PDVSKVVPRS EKVAVIGAGP AGLGCADILA RAGVQVDVFD RHPEIGGMLT FGIPPFKLDK
TVLSQRREIF TAMGIDFHLN CEIGRDITFS DLTSEYDAVF IGVGTYGMMR ADLPHEDAPG
VIQALPFLTA HTRQLMGLPE SEEYPLTDVE GKRVVVLGGG DTTMDCLRTS IRLNAASVTC
AYRRDEVSMP GSRKEVVNAR EEGVEFHFNV QPQYIACDED GRLTAVGLIR TAMGEPGPDG
RRRPRPVAGS EFELPADVLI MAFGFQAHAM PWLQGSGIKL DKWGLIQTGD VGYLPTQTHL
KKVFAGGDAV HGADLVVTAM AAGRQAARDM LTLFDTKAS