Gene EcSMS35_1380 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_1380 
Symbol 
ID6145140 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp1366858 
End bp1368768 
Gene Length1911 bp 
Protein Length636 aa 
Translation table11 
GC content56% 
IMG OID641616258 
Producthypothetical protein 
Protein accessionYP_001743438 
Protein GI170682824 
COG category[K] Transcription
[L] Replication, recombination and repair 
COG ID[COG1199] Rad3-related DNA helicases 
TIGRFAM ID[TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.434814 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones42 
Fosmid unclonability p-value0.350106 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACGGACG ATTTTGCACC AGACGGTCAG CTGGCGAAAG CGATACCAGG CTTTAAGCCG 
CGAGAACCAC AGCGACAGAT GGCGGTAGCC GTCACCCAGG CGATAGAAAA AGGCCAGCCG
CTGGTGGTGG AAGCCGGAAC CGGTACGGGC AAAACCTACG CTTACCTGGC CCCTGCGCTG
CGGGCGAAAA AGAAAGTCAT TATCTCGACC GGTTCAAAAG CGTTGCAGGA TCAGCTCTAC
AGCCGCGATT TGCCAACGGT CTCAAAGGCA TTGAAATACA CGGGCAACCT GGCGTTGCTG
AAAGGGCGCT CAAACTACCT CTGCCTCGAA CGTCTCGAAC AGCAGGCGCT GGCGGGGGGC
GATCTGCCGG TACAAATCTT AAGCGATGTG ATCCTGCTGC GCTCCTGGTC TAATCAAACA
GTCGATGGTG ATATCAGCAC CTGCGTCAGC GTGGCGGAAG ATTCACAGGC GTGGCCGCTG
GTCACCAGCA CCAACGATAA CTGCCTTGGC AGCGACTGCC CGATGTATAA AGATTGCTTT
GTGGTCAAAG CACGCAAAAA AGCGATGGAC GCCGATGTGG TGGTGGTAAA CCATCATCTC
TTTCTGGCGG ATATGGTGGT GAAAGAGAGT GGATTTGGCG AACTGATCCC GGAAGCTGAC
GTCATGATCT TCGACGAAGC CCACCAACTG CCCGACATTG CCAGCCAGTA TTTTGGTCAG
TCACTCTCCA GTCGACAACT GCTCGACCTG GCAAAAGACA TCACCATCGC CTACCGCACC
GAATTAAAAG ACACCCAGCA GTTACAAAAG TGCGCCGACC GCCTTGCCCA GAGCGCGCAG
GATTTTCGTC TGCAACTCGG TGAGCCTGGT TATCGTGGCA ACCTGCGCGA ACTGTTAGCT
AATCCGCAAA TTCAACGGGC GTTTTTACTG CTCGATGACA CCCTGGAACT TTGTTATGAC
GTGGCGAAAC TGTCGCTGGG GCGTTCCGCT TTGCTGGATG CGGCATTTGA GCGCGCCACG
TTGTATCGCA CGCGGCTGAA ACGGCTAAAA GAGATCAATC AGCCGGGCTA CAGCTACTGG
TACGAATGCA CTTCGCGCCA TTTTACTCTG GCACTCACGC CGCTCAGCGT GGCGGATAAA
TTCAAAGAGT TAATGGCGCA AAAACCCGGT AGCTGGATCT TTACCTCAGC AACGCTGTCG
GTGAACGACG ATCTGCATCA TTTCACCTCG CGGCTTGGCA TCGAACAGGC GGAGTCGTTG
CTGTTACCCA GCCCGTTTGA TTACAGCCGC CAGGCGTTAC TCTGTGTGCC GCGCAACCTG
CCGCAAACCA ATCAACCGGG CTCCGCACGG CAACTGGCGG CAATGTTGCG ACCGATCATC
GAAGCTAACA ACGGTCGTTG TTTTATGCTT TGTACCTCGC ACGCCATGAT GCGCGATCTG
GCTGAGCAGT TCCGCGCTAC CATGACGCTT CCCGTTTTGT TGCAGGGGGA AACCAGCAAA
GGGCAACTGT TGCAGCAATT TGTCAGCGCC GGTAACGCGC TTCTTGTGGC AACCAGCAGC
TTCTGGGAAG GGGTGGATGT GCGTGGCGAT ACATTGTCAT TGGTGATTAT CGACAAGTTG
CCGTTTACCT CACCGGATGA TCCACTATTA AAAGCGCGCA TGGAAGATTG CCGTTTGCGT
GGTGGTGACC CGTTCGATGA AGTACAACTA CCGGATGCGG TGATTACTCT CAAGCAGGGA
GTAGGGCGAC TGATTCGCGA CGCCGACGAT CGCGGGGTTT TGGTGATTTG TGACAATCGG
CTGGTGATGC GCCCTTACGG CGCGACGTTT CTCGCCAGTC TGCCGCCCGC GCCGCGCACC
CGTGACATTG CCCGTGCGGT TCGCTTCCTT GCGATACCAT CCTCCAGGTA A
 
Protein sequence
MTDDFAPDGQ LAKAIPGFKP REPQRQMAVA VTQAIEKGQP LVVEAGTGTG KTYAYLAPAL 
RAKKKVIIST GSKALQDQLY SRDLPTVSKA LKYTGNLALL KGRSNYLCLE RLEQQALAGG
DLPVQILSDV ILLRSWSNQT VDGDISTCVS VAEDSQAWPL VTSTNDNCLG SDCPMYKDCF
VVKARKKAMD ADVVVVNHHL FLADMVVKES GFGELIPEAD VMIFDEAHQL PDIASQYFGQ
SLSSRQLLDL AKDITIAYRT ELKDTQQLQK CADRLAQSAQ DFRLQLGEPG YRGNLRELLA
NPQIQRAFLL LDDTLELCYD VAKLSLGRSA LLDAAFERAT LYRTRLKRLK EINQPGYSYW
YECTSRHFTL ALTPLSVADK FKELMAQKPG SWIFTSATLS VNDDLHHFTS RLGIEQAESL
LLPSPFDYSR QALLCVPRNL PQTNQPGSAR QLAAMLRPII EANNGRCFML CTSHAMMRDL
AEQFRATMTL PVLLQGETSK GQLLQQFVSA GNALLVATSS FWEGVDVRGD TLSLVIIDKL
PFTSPDDPLL KARMEDCRLR GGDPFDEVQL PDAVITLKQG VGRLIRDADD RGVLVICDNR
LVMRPYGATF LASLPPAPRT RDIARAVRFL AIPSSR