Gene EcolC_2955 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_2955 
Symbol 
ID6065660 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp3222525 
End bp3226718 
Gene Length4194 bp 
Protein Length1397 aa 
Translation table11 
GC content59% 
IMG OID641602366 
ProductYD repeat-containing protein 
Protein accessionYP_001725908 
Protein GI170020954 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3209] Rhs family protein 
TIGRFAM ID[TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type
[TIGR01643] YD repeat (two copies) 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGGAA AACCGGCGGC GCGTCAGGGC GACATGACGC AGTATGGCGG TAGCATTGTT 
CAGGGTTCAG CCGGGGTACG TATTGGTGCC CCCACCGGCG TGGCCTGTTC GGTGTGCCCC
GGCGGGGTGA CGTCCGGCCA TCCGGTCAAT CCGCTGCTGG GTGCAAAGGT CCTTCCCGGT
GAAACCGACA TCGCCCTGCC CGGCCCGCTG CCGTTCATTC TCTCCCGCAC CTACAGCAGT
TACCGGACAA AAACGCCCGC GCCGGTGGGG AGCCTCGGCC CCGGCTGGAA AATGCCTGCG
GATATCCGCT TACAGCTGCG CGATAACACA CTGATACTCA GTGATAACGG CGGCAGAAGC
CTGTATTTTG AGCACCTGTT TCCCGGTGAG GACGGTTACA GCCGCAGCGA ATCACTCTGG
CTGGTGCGCG GCGGTGTGGC GAAACTGGAT GAAGGTCACC GGCTGGCCGC ACTCTGGCAG
GCGCTGCCGG AAGAGCTTCG CCTGAGTCCG CATCGTTATC TGGCGACAAA CAGTCCGCAG
GGGCCGTGGT GGGTACTCGG CTGGTGTGAG CGGGTGCCGG AAGCGGATGA GGTGCTGCCT
GCGCCGCTGC CGCCGTACCG GGTACTGACC GGGCTGGTGG ACCGCTTCGG GCGCACACAG
ACGTTCCACC ACGAAGCCGC CGGTGAATTC AGCGGCGAAA TCACCGGCGT GACGGATGGT
GCCGGGCGTC ACTTCCGGCT GGTACTGACC ACGCAGGCGC AGCGGGCAGA AGAAGCCCGG
CAGCAGGCCT CTTCCGGCGG GACGGAACCG TCCGCTTTTC CTGATACCCT GCCGGGTTAC
ACCGAATATG GCCGGGACAA CGGCATCCGT CTGTCTGCCG TGTGGCTGAC GCACGACCCG
GAATACCCGG AGAATTTACC TGCCGCGCCG CTGGTGCGCT ATGGCTGGAC GCCACGCGGC
GAACTGGCGG TGGTGTATGA CCGTAGTGGC AAACAGGTGC GCAGCTTTAC TTACGATGAT
AAATACCGGG GCCGGATGGT GGCGCACCGT CACACGGGCC GGCCGGAAAT CCGTTACCGT
TACGACAGCG ACGGGCGGGT GACAGAACAG CTAAACCCGG CAGGCTTAAG CTACACGTAT
CAGTATGAGA AAGACCGCAT CACCATCACC GACAGCCTGG ACCGCCGTGA AGTGCTGCAC
ACGCAGGGCG AAGCCGGGCT GAAGCGGGTG GTGAAAAAGG AACACGCGGA CGGCAGCGTC
ACGCAGAGTC AGTTTGACGC CGTGGGCAGG CTCAGGACAC AGACGGATGC CGCAGGCCGG
ACAACAGAAT ACAGCCCGGA TGTGGTGACG GGCCTCATCA CGCGCATCAC CACGCCGGAT
GGCAGGGCAT CGGCGTTTTA CTATAACCAC CACAGCCAGT TAACGTCAGC CACCGGGCCT
GACGGGCTGG AAATACGCCG GGAATATGAT GAATTGGGCC GTCTGATTCA GGAAACTGCC
CCTGACGGCG ATATCACCCG CTACCGTTAT GATAATCCAC ACAGTGACTT ACCCTGCGCA
ACGGAAGATG CCACCGGCAG CCGGAAAACC ATGACGTGGA GCCGTTACGG TCAGTTGCTG
AGCTTCACCG ACTGTTCCGG TTATGTAACC CGTTATGACC ATGACCGCTT CGGGCAGATG
ACGGCGGTGC ACCGCGAGGA AGGGCTGAGT CAGTACCGCG CATACGACAG CCGTGGACAG
TTAATTGCCG TGAAAGACAC GCAGGGCCAT GAAACGCGGT ATGAATACAA CATCGCCGGT
GACCTGACCG CCGTCATTGC CCCGGACGGC AGCAGAAACG GGACACAGTA CGATGCGTGG
GGAAAGGCCG TCCGTACCAC GCAGGGCGGG CTGACCCGTA GTATGGAATA CGATGCTGCC
GGACGGGTCA TCCGCCTGAC CAGTGAAAAC GGCAGCCACA CCACCTTCCG TTACGATGTA
CTCGACCGGC TGATACAGGA AACCGGCTTT GACGGCCGCA CACAGCGTTA TCACCACGAC
CTGACCGGCA AACTTATCCG CAGCGAGGAT GAGGGGCTGG TCACCCACTG GTACTATGAC
GAAGCAGACC GCCTCACTCA CCGCACCGTG AAGGGTGAAA CCGCAGAGCA GTGGCAGTAT
GACGAACGCG GCTGGCTGAC AGACATCAGC CATATCAGCG AAGGGCACCG GGTGGCGGTG
CACTATGGTT ATGACAGTAA AGGCCGCCTC GCCAGTGAAC ACCTGACGGT GCATCATCCG
CAGACGAATG AACTGCTCTG GCAGCATGAG ACCAGACATG CGTACAACGC ACAGGGACTG
GCGAACCGCT GTATACCGGA CAGCCTGCCC GCCGTGGAAT GGCTGACCTA CGGCAGCGGC
TGGCTGGCAG GCATGAAGCT CGGCGACACA CCGCTGGTGG ATTTCACCCG CGACCGCCTG
CACCGGGAAA CGCTGCGCAG CTTCGGTCGT TATGAACTCA CCACCGCTTA CACCCCTGCC
GGGCAGTTAC AGCGTCAGCA CCTGAACAGC CTGCAGTATG ACCGCGATTA CACCTGGAAC
GACAACGGCG AACTCATCCG CATCAGCAGC CCGCGCCAGA CCCGGAGTTA CAGCTACAGC
ACCACCGGCA GGCTGACCGG CGTTCACACC ACCGCAGCGA ATCTGGATAT CCGCATCCCG
TATGCCACAG ACCCGGCAGG TAACCGCCTG CCCGACCCGG AGCTGCACCC GGACAGCACC
CTCAGCATGT GGCCGGATAA CCGTATCGCC CGTGACGCGC ACTATCTTTA CCGGTATGAC
CGTCACGGCA GGCTGACAGA GAAAACCGAC CTCATCCCGG AAGGGGTTAT CCGCACGGAT
GATGAGCGCA CCCACCAGTA CCATTACGAC AGTCAGCACC GGCTGGTGCA CTACACGCGG
ACACAATATG CAGAGCCGCT GGTCGAAAGT CGCTATCTTT ACGACCCGCT GGGCCGCAGG
GTGGCAAAAC GGGTATGGCG ACGTGAACGG GACCTGACGG GCTGGATGTC GCTGTCACGG
AAACCGGAAG TGACCTGGTA CGGCTGGGAC GGCGACCGGC TGACCACGAT ACAGAACGAC
AGGAGCCGCA TCCAGACGAT TTATCAGCCG GGGAGCTTCA CGCCACTCAT CAGGGTCGAA
ACTGCCACCG GTGAGCTGGC GAAAACGCAG CGCCGCAGCC TGGCGGATGC CCTTCAGCAG
TCCGGCGGCG AAGACGGTGG CAGTGTGGTG TTCCCGCCGG TGCTGGTGCA GATGCTCGAC
CGGCTGGAAA GTGAAATCCT GGCTGACCGG GTGAGTGAGG AAAGCCGCCG CTGGCTGGCA
TCGTGCGGCC TGACCGTGGA GCAGATGCAA AACCAGATGG ACCCGGTGTA CACGCCGGCG
CGAAAAATCC ACCTGTACCA CTGCGACCAT CGCGGCCTGC CGCTGGCGCT TGTCAGCACG
GAAGGGGCAA CAGAATGGTG CGCAGAATAC GATGAATGGG GCAACCTGCT GAATGAAGAG
AACCCGCATC AGCTGCAGCA GCTTATCCGC CTGCCGGGGC AGCAGTATGA TGAGGAGTCC
GGCCTGTATT ACAACCGCCA CCGCTATTAT GACCCGCTGC AGGGGCGATA TATCACTCAG
GATCCGATTG GGCTGAAAGG GGGATGGAAT TTTTATCAGT ATCCGTTGAA TCCGATCTCA
AATATAGATC CATTAGGATT AGAAACACTA AAATGCATTA AGCCACTGCA TTCAATGGGC
GGAACTGGTG AAAGAAGCGG TCCAGATATA TGGGGGAATC CGTTCTATCA TCAATATCTT
TGTGTCCCAG ATGGTAAAGG GGACTATACT TGTGGTGGCC AAGACCAACG GGGAGAATCA
AAAGGAGATG GTCTATGGGG GCCAGGTAAA GCAAGTAATG ATACAAAAGA AGCTGCTGGC
CGTTGTGACC TCGTTGAAAC CGATAATAGT TGTGTGGAGA ACTGTTTAAA AGGGAAGTTT
AAAGAGGTAA GGCCGCGTTA TTCTGTATTG CCTGATATAT TCACACCTAT AAATTTAGGG
CTATTTAAAA ACTGCCAAGA CTGGTCTAAT GATTCTTTAG AAACATGTAA GATGAAGTGC
TCCGGAAATA ACATTGGACG TTTTATTAGA TTTGTATTCA CCGGAGTGAT GTAA
 
Protein sequence
MSGKPAARQG DMTQYGGSIV QGSAGVRIGA PTGVACSVCP GGVTSGHPVN PLLGAKVLPG 
ETDIALPGPL PFILSRTYSS YRTKTPAPVG SLGPGWKMPA DIRLQLRDNT LILSDNGGRS
LYFEHLFPGE DGYSRSESLW LVRGGVAKLD EGHRLAALWQ ALPEELRLSP HRYLATNSPQ
GPWWVLGWCE RVPEADEVLP APLPPYRVLT GLVDRFGRTQ TFHHEAAGEF SGEITGVTDG
AGRHFRLVLT TQAQRAEEAR QQASSGGTEP SAFPDTLPGY TEYGRDNGIR LSAVWLTHDP
EYPENLPAAP LVRYGWTPRG ELAVVYDRSG KQVRSFTYDD KYRGRMVAHR HTGRPEIRYR
YDSDGRVTEQ LNPAGLSYTY QYEKDRITIT DSLDRREVLH TQGEAGLKRV VKKEHADGSV
TQSQFDAVGR LRTQTDAAGR TTEYSPDVVT GLITRITTPD GRASAFYYNH HSQLTSATGP
DGLEIRREYD ELGRLIQETA PDGDITRYRY DNPHSDLPCA TEDATGSRKT MTWSRYGQLL
SFTDCSGYVT RYDHDRFGQM TAVHREEGLS QYRAYDSRGQ LIAVKDTQGH ETRYEYNIAG
DLTAVIAPDG SRNGTQYDAW GKAVRTTQGG LTRSMEYDAA GRVIRLTSEN GSHTTFRYDV
LDRLIQETGF DGRTQRYHHD LTGKLIRSED EGLVTHWYYD EADRLTHRTV KGETAEQWQY
DERGWLTDIS HISEGHRVAV HYGYDSKGRL ASEHLTVHHP QTNELLWQHE TRHAYNAQGL
ANRCIPDSLP AVEWLTYGSG WLAGMKLGDT PLVDFTRDRL HRETLRSFGR YELTTAYTPA
GQLQRQHLNS LQYDRDYTWN DNGELIRISS PRQTRSYSYS TTGRLTGVHT TAANLDIRIP
YATDPAGNRL PDPELHPDST LSMWPDNRIA RDAHYLYRYD RHGRLTEKTD LIPEGVIRTD
DERTHQYHYD SQHRLVHYTR TQYAEPLVES RYLYDPLGRR VAKRVWRRER DLTGWMSLSR
KPEVTWYGWD GDRLTTIQND RSRIQTIYQP GSFTPLIRVE TATGELAKTQ RRSLADALQQ
SGGEDGGSVV FPPVLVQMLD RLESEILADR VSEESRRWLA SCGLTVEQMQ NQMDPVYTPA
RKIHLYHCDH RGLPLALVST EGATEWCAEY DEWGNLLNEE NPHQLQQLIR LPGQQYDEES
GLYYNRHRYY DPLQGRYITQ DPIGLKGGWN FYQYPLNPIS NIDPLGLETL KCIKPLHSMG
GTGERSGPDI WGNPFYHQYL CVPDGKGDYT CGGQDQRGES KGDGLWGPGK ASNDTKEAAG
RCDLVETDNS CVENCLKGKF KEVRPRYSVL PDIFTPINLG LFKNCQDWSN DSLETCKMKC
SGNNIGRFIR FVFTGVM