Gene EcolC_0767 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_0767 
Symbol 
ID6064899 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp819904 
End bp820854 
Gene Length951 bp 
Protein Length316 aa 
Translation table11 
GC content53% 
IMG OID641600171 
Productglutathione synthetase 
Protein accessionYP_001723766 
Protein GI170018812 
COG category[H] Coenzyme transport and metabolism
[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0189] Glutathione synthase/Ribosomal protein S6 modification enzyme (glutaminyl transferase) 
TIGRFAM ID[TIGR01380] glutathione synthetase, prokaryotic 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0268001 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00000369912 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGATCAAGC TCGGCATCGT GATGGACCCC ATCGCAAACA TCAACATCAA GAAAGATTCC 
AGTTTTGCTA TGTTGCTGGA AGCACAGCGT CGTGGTTACG AACTTCACTA TATGGAGATG
GGCGATCTGT ATCTGATCAA TGGTGAAGCC CGCGCCCATA CCCGCACGCT GAACGTGAAG
CAGAACTACG AAGAGTGGTT TTCGTTCGTC GGTGAACAGG ATCTGCCGCT GGCCGATCTC
GATGTGATCC TGATGCGTAA AGACCCGCCG TTTGATACCG AGTTTATCTA CGCGACCTAT
ATTCTGGAAC GTGCCGAAGA GAAAGGGACG CTGATCGTTA ACAAGCCGCA GAGCCTGCGC
GACTGTAACG AGAAACTGTT TACCGCCTGG TTCTCTGACT TAACGCCAGA AACGCTGGTT
ACGCGCAATA AAGCGCAGCT AAAAGCGTTC TGGGAGAAAC ACAGCGACAT CATTCTTAAG
CCGCTGGACG GTATGGGCGG CGCGTCGATT TTCCGCGTGA AAGAAGGCGA TCCAAACCTC
GGCGTGATTG CCGAAACCCT GACTGAGCAT GGCACTCGCT ACTGCATGGC GCAAAATTAC
CTGCCAGCCA TTAAAGATGG CGACAAACGC GTGCTGGTGG TGGATGGCGA GCCGGTACCG
TACTGCCTGG CGCGTATTCC GCAGGGGGGC GAAACCCGTG GCAATCTGGC TGCCGGTGGT
CGCGGTGAAC CTCGTCCGCT GACGGAAAGT GACTGGAAAA TCGCCCGTCA GATCGGGCCG
ACGCTGAAAG AAAAAGGGCT GATTTTTGTT GGTCTGGATA TCATCGGCGA CCGTCTGACT
GAAATTAACG TCACCAGCCC AACCTGTATT CGTGAGATTG AAGCAGAGTT TCCGGTGTCG
ATCACCGGAA TGTTAATGGA TGCCATCGAA GCACGTTTAC AGCAGCAGTA A
 
Protein sequence
MIKLGIVMDP IANINIKKDS SFAMLLEAQR RGYELHYMEM GDLYLINGEA RAHTRTLNVK 
QNYEEWFSFV GEQDLPLADL DVILMRKDPP FDTEFIYATY ILERAEEKGT LIVNKPQSLR
DCNEKLFTAW FSDLTPETLV TRNKAQLKAF WEKHSDIILK PLDGMGGASI FRVKEGDPNL
GVIAETLTEH GTRYCMAQNY LPAIKDGDKR VLVVDGEPVP YCLARIPQGG ETRGNLAAGG
RGEPRPLTES DWKIARQIGP TLKEKGLIFV GLDIIGDRLT EINVTSPTCI REIEAEFPVS
ITGMLMDAIE ARLQQQ