Gene EcHS_A1896 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A1896 
Symbol 
ID5592333 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp1909527 
End bp1911437 
Gene Length1911 bp 
Protein Length636 aa 
Translation table11 
GC content55% 
IMG OID640921039 
Producthypothetical protein 
Protein accessionYP_001458590 
Protein GI157161272 
COG category[K] Transcription
[L] Replication, recombination and repair 
COG ID[COG1199] Rad3-related DNA helicases 
TIGRFAM ID[TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type 


Plasmid Coverage information

Num covering plasmid clones53 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGACGGACG ATTTTGCACC AGACGGTCAG CTGGCGAAAG CGATACCAGG CTTTAAGCCG 
CGAGAACCAC AGCGACAGAT GGCGGTAGCC GTCACCCAGG CGATAGAAAA AGGCCAGCCG
CTGGTGGTGG AAGCAGGAAC CGGTACGGGC AAAACCTACG CTTACCTGGC TCCTGCGCTG
CGGGCGAAAA AGAAAGTCAT TATCTCGACC GGCTCAAAAG CGTTGCAGGA TCAGCTCTAC
AGCCGCGATT TGCCAACAGT CTCAAAGGCA TTGAAATATA CGGGCAACGT GGCGCTGCTG
AAAGGGCGCT CAAACTACCT CTGCCTCGAA CGTCTCGAAC AGCAGGCGCT GGCGGGGGGC
GATCTGCCGG TACAAATCTT AAGCGATGTG ATCCTGCTGC GCTCCTGGTC TAATCAAACA
GTCGATGGTG ATATCAGCAC CTGCGTCAGC GTGGCGGAAG ATTCACAGGC GTGGCCGCTG
GTCACCAGCA CCAACGACAA CTGTCTTGGC AGCGACTGCC CGATGTATAA AGATTGCTTT
GTGGTCAAAG CACGTAAAAA AGCGATGGAC GCCGATGTGG TGGTGGTAAA CCATCATCTC
TTTCTGGCGG ATATGGTGGT TAAAGAGAGT GGATTTGGCG AACTGATCCC GGAAGCGGAC
GTCATGATCT TCGACGAAGC CCACCAGCTA CCGGACATTG CCAGCCAGTA TTTTGGTCAG
TCACTCTCCA GTCGACAACT GCTCGACCTG GCAAAAGACA TCACCATCGC CTACCGCACC
GAATTAAAAG ACACCCAGCA GTTACAAAAG TGCGCTGATC GTCTTGCCCA GAGTGCGCAG
GATTTTCGTC TGCAACTCGG TGAGCCAGGT TATCGCGGTA ACCTGCGTGA GCTGTTAGCT
AATCCGCAAA TTCAGCGGGC ATTTTTACTG CTCGATGACA CCCTGGAACT TTGTTATGAC
GTGGCGAAAC TGTCACTGGG GCGTTCCGCC TTGCTGGATG CGGCATTTGA GCGCGCCACG
TTGTATCGCA CACGGCTGAA GCGGCTAAAA GAGATCAATC AGCCGGGCTA CAGCTACTGG
TACGAATGCA CTTCGCGCCA TTTTACTCTG GCTCTCACGC CGCTCAGCGT GGCGGATAAA
TTCAAAGAGT TAATGGCGCA AAAGCCCGGT AGCTGGATCT TCACCTCAGC AACGCTGTCG
GTGAACGACG ATCTGCATCA TTTCACCTCG CGGCTTGGGA TAGAACAGGC GGAGTCGTTG
CTATTGCCAA GCCCGTTTGA TTACAGCCGC CAGGCGTTAC TCTGTGTGCC GCGCAATCTG
CCGCAAACCA ACCAGCCAGG TTCTGCTCGC CAGTTAGCGG CAATGCTGCG ACCGATCATC
GAAGCTAACA ACGGTCGTTG TTTTATGCTT TGTACCTCGC ACGCCATGAT GCGCGATCTG
GCCGAGCAGT TCCGCGCTAC CATGACGCTT CCCGTTTTGT TGCAGGGGGA AACCAGTAAA
GGCCAACTGT TGCAGCAGTT TGTCAGTGCC GGTAATGCGC TTCTTGTGGC AACCAGTAGT
TTCTGGGAAG GGGTGGACGT GCGTGGCGAT ACATTGTCAT TGGTAATTAT CGACAAATTG
CCGTTTACCT CGCCGGATGA TCCACTGTTA AAAGCGCGCA TGGAAGATTG TCGTTTGCGC
GGTGGCGACC CGTTCGATGA AGTGCAACTA CCAGATGCCG TCATTACTCT CAAACAGGGG
GTAGGGCGAC TGATTCGCGA CGCCGACGAT CGTGGCGTGC TGGTGATTTG TGACAATCGG
CTGGTGATGC GTCCTTACGG CGCGACGTTT CTCGCCAGTC TGCCGCCCGC GCCACGCACC
CGTGACATTG CCCGTGCGGT TCGTTTCCTT GCGATACCAT CCTCCAGGTA A
 
Protein sequence
MTDDFAPDGQ LAKAIPGFKP REPQRQMAVA VTQAIEKGQP LVVEAGTGTG KTYAYLAPAL 
RAKKKVIIST GSKALQDQLY SRDLPTVSKA LKYTGNVALL KGRSNYLCLE RLEQQALAGG
DLPVQILSDV ILLRSWSNQT VDGDISTCVS VAEDSQAWPL VTSTNDNCLG SDCPMYKDCF
VVKARKKAMD ADVVVVNHHL FLADMVVKES GFGELIPEAD VMIFDEAHQL PDIASQYFGQ
SLSSRQLLDL AKDITIAYRT ELKDTQQLQK CADRLAQSAQ DFRLQLGEPG YRGNLRELLA
NPQIQRAFLL LDDTLELCYD VAKLSLGRSA LLDAAFERAT LYRTRLKRLK EINQPGYSYW
YECTSRHFTL ALTPLSVADK FKELMAQKPG SWIFTSATLS VNDDLHHFTS RLGIEQAESL
LLPSPFDYSR QALLCVPRNL PQTNQPGSAR QLAAMLRPII EANNGRCFML CTSHAMMRDL
AEQFRATMTL PVLLQGETSK GQLLQQFVSA GNALLVATSS FWEGVDVRGD TLSLVIIDKL
PFTSPDDPLL KARMEDCRLR GGDPFDEVQL PDAVITLKQG VGRLIRDADD RGVLVICDNR
LVMRPYGATF LASLPPAPRT RDIARAVRFL AIPSSR