Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_1380 |
Symbol | |
ID | 6145140 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 1366858 |
End bp | 1368768 |
Gene Length | 1911 bp |
Protein Length | 636 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 641616258 |
Product | hypothetical protein |
Protein accession | YP_001743438 |
Protein GI | 170682824 |
COG category | [K] Transcription [L] Replication, recombination and repair |
COG ID | [COG1199] Rad3-related DNA helicases |
TIGRFAM ID | [TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.434814 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 42 |
Fosmid unclonability p-value | 0.350106 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGACGGACG ATTTTGCACC AGACGGTCAG CTGGCGAAAG CGATACCAGG CTTTAAGCCG CGAGAACCAC AGCGACAGAT GGCGGTAGCC GTCACCCAGG CGATAGAAAA AGGCCAGCCG CTGGTGGTGG AAGCCGGAAC CGGTACGGGC AAAACCTACG CTTACCTGGC CCCTGCGCTG CGGGCGAAAA AGAAAGTCAT TATCTCGACC GGTTCAAAAG CGTTGCAGGA TCAGCTCTAC AGCCGCGATT TGCCAACGGT CTCAAAGGCA TTGAAATACA CGGGCAACCT GGCGTTGCTG AAAGGGCGCT CAAACTACCT CTGCCTCGAA CGTCTCGAAC AGCAGGCGCT GGCGGGGGGC GATCTGCCGG TACAAATCTT AAGCGATGTG ATCCTGCTGC GCTCCTGGTC TAATCAAACA GTCGATGGTG ATATCAGCAC CTGCGTCAGC GTGGCGGAAG ATTCACAGGC GTGGCCGCTG GTCACCAGCA CCAACGATAA CTGCCTTGGC AGCGACTGCC CGATGTATAA AGATTGCTTT GTGGTCAAAG CACGCAAAAA AGCGATGGAC GCCGATGTGG TGGTGGTAAA CCATCATCTC TTTCTGGCGG ATATGGTGGT GAAAGAGAGT GGATTTGGCG AACTGATCCC GGAAGCTGAC GTCATGATCT TCGACGAAGC CCACCAACTG CCCGACATTG CCAGCCAGTA TTTTGGTCAG TCACTCTCCA GTCGACAACT GCTCGACCTG GCAAAAGACA TCACCATCGC CTACCGCACC GAATTAAAAG ACACCCAGCA GTTACAAAAG TGCGCCGACC GCCTTGCCCA GAGCGCGCAG GATTTTCGTC TGCAACTCGG TGAGCCTGGT TATCGTGGCA ACCTGCGCGA ACTGTTAGCT AATCCGCAAA TTCAACGGGC GTTTTTACTG CTCGATGACA CCCTGGAACT TTGTTATGAC GTGGCGAAAC TGTCGCTGGG GCGTTCCGCT TTGCTGGATG CGGCATTTGA GCGCGCCACG TTGTATCGCA CGCGGCTGAA ACGGCTAAAA GAGATCAATC AGCCGGGCTA CAGCTACTGG TACGAATGCA CTTCGCGCCA TTTTACTCTG GCACTCACGC CGCTCAGCGT GGCGGATAAA TTCAAAGAGT TAATGGCGCA AAAACCCGGT AGCTGGATCT TTACCTCAGC AACGCTGTCG GTGAACGACG ATCTGCATCA TTTCACCTCG CGGCTTGGCA TCGAACAGGC GGAGTCGTTG CTGTTACCCA GCCCGTTTGA TTACAGCCGC CAGGCGTTAC TCTGTGTGCC GCGCAACCTG CCGCAAACCA ATCAACCGGG CTCCGCACGG CAACTGGCGG CAATGTTGCG ACCGATCATC GAAGCTAACA ACGGTCGTTG TTTTATGCTT TGTACCTCGC ACGCCATGAT GCGCGATCTG GCTGAGCAGT TCCGCGCTAC CATGACGCTT CCCGTTTTGT TGCAGGGGGA AACCAGCAAA GGGCAACTGT TGCAGCAATT TGTCAGCGCC GGTAACGCGC TTCTTGTGGC AACCAGCAGC TTCTGGGAAG GGGTGGATGT GCGTGGCGAT ACATTGTCAT TGGTGATTAT CGACAAGTTG CCGTTTACCT CACCGGATGA TCCACTATTA AAAGCGCGCA TGGAAGATTG CCGTTTGCGT GGTGGTGACC CGTTCGATGA AGTACAACTA CCGGATGCGG TGATTACTCT CAAGCAGGGA GTAGGGCGAC TGATTCGCGA CGCCGACGAT CGCGGGGTTT TGGTGATTTG TGACAATCGG CTGGTGATGC GCCCTTACGG CGCGACGTTT CTCGCCAGTC TGCCGCCCGC GCCGCGCACC CGTGACATTG CCCGTGCGGT TCGCTTCCTT GCGATACCAT CCTCCAGGTA A
|
Protein sequence | MTDDFAPDGQ LAKAIPGFKP REPQRQMAVA VTQAIEKGQP LVVEAGTGTG KTYAYLAPAL RAKKKVIIST GSKALQDQLY SRDLPTVSKA LKYTGNLALL KGRSNYLCLE RLEQQALAGG DLPVQILSDV ILLRSWSNQT VDGDISTCVS VAEDSQAWPL VTSTNDNCLG SDCPMYKDCF VVKARKKAMD ADVVVVNHHL FLADMVVKES GFGELIPEAD VMIFDEAHQL PDIASQYFGQ SLSSRQLLDL AKDITIAYRT ELKDTQQLQK CADRLAQSAQ DFRLQLGEPG YRGNLRELLA NPQIQRAFLL LDDTLELCYD VAKLSLGRSA LLDAAFERAT LYRTRLKRLK EINQPGYSYW YECTSRHFTL ALTPLSVADK FKELMAQKPG SWIFTSATLS VNDDLHHFTS RLGIEQAESL LLPSPFDYSR QALLCVPRNL PQTNQPGSAR QLAAMLRPII EANNGRCFML CTSHAMMRDL AEQFRATMTL PVLLQGETSK GQLLQQFVSA GNALLVATSS FWEGVDVRGD TLSLVIIDKL PFTSPDDPLL KARMEDCRLR GGDPFDEVQL PDAVITLKQG VGRLIRDADD RGVLVICDNR LVMRPYGATF LASLPPAPRT RDIARAVRFL AIPSSR
|
| |