Gene EcolC_1019 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_1019 
Symbol 
ID6066930 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp1107624 
End bp1109180 
Gene Length1557 bp 
Protein Length518 aa 
Translation table11 
GC content51% 
IMG OID641600432 
Productglutamate--cysteine ligase 
Protein accessionYP_001724015 
Protein GI170019061 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG2918] Gamma-glutamylcysteine synthetase 
TIGRFAM ID[TIGR01434] glutamate--cysteine ligase
[TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.145998 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0132259 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGATCCCGG ACGTATCACA GGCGCTGGCC TGGCTGGAAA AACATCCTCA GGCGTTAAAG 
GGGATACAGC GTGGGCTGGA GCGCGAAACT TTGCGTGTTA ATGCTGATGG CACACTGGCA
ACAACAGGTC ATCCTGAAGC ATTAGGTTCC GCACTGACGC ACAAATGGAT TACTACCGAT
TTTGCGGAAG CATTGCTGGA ATTCATTACA CCAGTGGATG GTGATATTGA ACATATGCTG
ACCTTTATGC GCGATCTGCA TCGTTATACG GCGCGCAATA TGGGCGATGA GCGGATGTGG
CCGTTAAGTA TGCCATGCTA CATCGCAGAA GGTCAGGACA TCGAACTGGC ACAGTACGGC
ACTTCTAACA CCGGACGCTT TAAAACGCTG TATCGTGAAG GGCTGAAAAA TCGCTACGGC
GCGCTGATGC AAACCATTTC CGGCGTGCAC TACAATTTCT CTTTGCCAAT GGCATTCTGG
CAAGCGAAGT GCGGTGATAT CTCGGGCGCT GATGCCAAAG AGAAAATTTC TGCGGGCTAT
TTCCGCGTTA TCCGCAATTA CTATCGTTTC GGTTGGGTCA TTCCTTATCT GTTTGGTGCA
TCTCCGGCGA TTTGTTCTTC TTTCCTGCAA GGAAAACCAA CGTCGCTGCC GTTTGAGAAA
ACCGAGTGCG GTATGTATTA CCTGCCGTAT GCGACCTCTC TTCGTTTGAG CGATCTCGGC
TATACCAATA AATCGCAAAG CAATCTTGGT ATTACCTTCA ACGATCTTTA CGAGTACGTA
GCGGGCCTTA AACAGGCAAT CAAAACGCCA TCGGAAGAGT ACGCGAAGAT TGGTATTGAG
AAAGACGGTA AGAGGCTGCA AATCAACAGC AACGTGTTGC AGATTGAAAA CGAACTGTAC
GCGCCGATTC GTCCAAAACG CGTTACCCGC AGCGGCGAGT CGCCTTCTGA TGCGCTGTTA
CGTGGCGGCA TTGAATATAT TGAAGTGCGT TCGCTGGACA TCAACCCGTT CTCGCCGATT
GGTGTAGATG AACAGCAGGT GCGATTCCTC GACCTGTTTA TGGTCTGGTG TGCGCTGGCT
GATGCACCGG AAATGAGCAG TAGCGAACTT GCCTGTACAC GCGTTAACTG GAACCGGGTG
ATCCTCGAAG GTCGCAAACC GGGTCTGACG CTGGGTATCG GCTGCGAAAC CGCACAGTTC
CCGTTACCGC AGGTGGGTAA AGATCTGTTC CGCGATCTGA AACGCGTCGC GCAAACGCTG
GATAGTATTA ACGGCGGCGA AGCGTATCAG AAAGTGTGTG ATGAACTGGT TGCCTGCTTC
GATAATCCCG ATCTGACTTT CTCTGCCCGT ATCTTAAGGT CTATGATTGA TACTGGTATT
GGCGGAACAG GCAAAGCATT TGCAGAAGCC TACCGTAATC TGCTGCGTGA AGAGCCGCTG
GAAATTCTGC GCGAAGAGGA TTTTGTAGCC GAGCGCGAGG CGTCTGAACG CCGTCAGCAG
GAAATGGAAG CCGCTGATAC CGAACCGTTT GCGGTGTGGC TGGAAAAACA CGCCTGA
 
Protein sequence
MIPDVSQALA WLEKHPQALK GIQRGLERET LRVNADGTLA TTGHPEALGS ALTHKWITTD 
FAEALLEFIT PVDGDIEHML TFMRDLHRYT ARNMGDERMW PLSMPCYIAE GQDIELAQYG
TSNTGRFKTL YREGLKNRYG ALMQTISGVH YNFSLPMAFW QAKCGDISGA DAKEKISAGY
FRVIRNYYRF GWVIPYLFGA SPAICSSFLQ GKPTSLPFEK TECGMYYLPY ATSLRLSDLG
YTNKSQSNLG ITFNDLYEYV AGLKQAIKTP SEEYAKIGIE KDGKRLQINS NVLQIENELY
APIRPKRVTR SGESPSDALL RGGIEYIEVR SLDINPFSPI GVDEQQVRFL DLFMVWCALA
DAPEMSSSEL ACTRVNWNRV ILEGRKPGLT LGIGCETAQF PLPQVGKDLF RDLKRVAQTL
DSINGGEAYQ KVCDELVACF DNPDLTFSAR ILRSMIDTGI GGTGKAFAEA YRNLLREEPL
EILREEDFVA EREASERRQQ EMEAADTEPF AVWLEKHA