Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_4574 |
Symbol | |
ID | 6146999 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 4672087 |
End bp | 4674315 |
Gene Length | 2229 bp |
Protein Length | 742 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 641619390 |
Product | hypothetical protein |
Protein accession | YP_001746502 |
Protein GI | 170683254 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 60 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTACACAC AGACCCTGTA TGAGTTAAGT CAGGAGGCTG AACGCCTGTT ACAGCTTTCT CGCCAACAGT TGCAGTTACT GGAAAAAATG CCTCTCTCTG TACCCGGAGA CGATGCGCCA CAACTGGCTT TACCCTGGAG TCAGCCTAAT ATCGCCGAAC GTCACGCGAT GCTGAATAAT GAGTTGCGTA AAATTTCCCG ACTGGAAATG GTGCTGGCGA TTGTCGGTAC CATGAAAGCA GGGAAATCAA CCACCATTAA TGCCATTGTT GGTACGGAGG TTCTGCCTAA TCGCAATCGC CCAATGACTG CGCTGCCGAC GCTTATTCGC CATACGCCCG GGCAAAAAGA ACCGGTACTG CATTTTTCAC ATGTCGCGCC AATCGATTGT TTAATTCAAA AATTACAACA GCGCCTGCGT GATTGCGATA TTAAGCATCT GACCGATGTG CTGGAAATAG ATAAAGATAT GCGTGCGCTT ATGCAGCGGA TCGAAAATGG CGTCGCTTTC GAAAAATATT ATCTGGGTGC CCAGCCTATT TTTCATTGTC TGAAAAGTTT GAATGATTTG GTGCGACTGG CGAAGGCGCT GGACGTCGAT TTTCCTTTTT CTGCTTACGC CGCCATTGAG CATATTCCCG TGATTGAAGT GGAGTTTGTC CATCTGGCGG GGCTGGAGAG TTATCCCGGC CAATTGACGT TACTGGATAC CCCCGGGCCA AATGAAGCCG GGCAACCGCA TCTGCAAAAA ATGCTTAACC AGCAGCTGGC ACGCGCCTCG GCGGTACTGG CGGTGCTGGA TTATACGCAA CTGAAATCGA TCTCCGATGA AGAGGTCCGT GAGGCGATTT TGGCGGTGGG GCAATCGGTG CCGTTGTATG TGCTGGTCAA TAAGTTCGAT CAACAGGATC GTAACAGTGA CGACGCCGAC CAGGTGCGGG CACTGATTTC CGGGACGCTG ATGAAAGGCT GTATTACGCC ACAGCAGATA TTTCCAGTGT CGTCGATGTG GGGCTACCTG GCGAATCGGG CGCGTAATGA GTTAGCCAAC AGCGGTAAGT TACCCGCGCC AGAGCAACAA CGCTGGGTGG AAGATTTTGC CCATGCCGCG CTCGGCAGGC GCTGGCGTCA TGCCGACCTG GCGAACCTCG AACATATTCG TCATGCTGCC GATCAGTTGT GGGAAGATTC GCTGTTCGCC AAGCCAATTC AGGCGTTGCT TCATGCCGCT TACGCTAACG CCTCGTTGTA TGCTCTGCGA TCTGCCGCGC ATAAACTGTT GAATTACGCG CAGCAGGCGC GGGAATACCT GGATTTTCGT GCGCACGGGT TAAACGTCGC TTGTGAACAA TTGCGGCAAA ATATCCACCA GATCGAAGAA AGTTTGCAGC TATTTCAACT CAATCAGGCT CAGGTGAGCG GCGAGATTAA ACATGAAATC GAGCTGGCCC TGACCTCCGC CAACCTCTTT CTGCGTCAAC AGCAAGATGC GGTGAATGCC CAGTTAGCCG CGTTGTTTCA GGATGATTCG GGGTCATTAA GCGAGATTCG TACCTGCTGT GAGACACTGT TACAGACGGC GCAGAACACC ATCAGTCGCG ACTTTACGCT GCGTTTTGCC GAGCTTGAAT CCACCCTTTG CCGGGTGTTA ACCGATGTTA TTCGGCCCAT TGAGCAACAA GTCAAAATGG AATTGAGCGA GTCAGGGTTT CGTCCTGGGT TTCATTTTCC TGTTTTTCAC AGCGCAGTTC CCCACTTCAA CACTCGCCAG CTGTTCAGTG AAGTCATTTC GCGCCAGGAC GCAATGGACG AGCAGAGCAC GCGTTTAGGC GTTGTGCGTG AGACTTTTTC GCGCTGGTTG AATCAGCCCG ACTGGGGACG GGGAAATGAG AAATCCCCGA CAGAGACGGT TGATTACAGT GTGTTGCAAC GAGCATTAAG CGCAGAAGTC GATCTTTATT GCCAACAAAT GGCTAAAGTT CTGGCAGAGC AGGTCGATGA ATCTGTTACG GCAGGCATGA ATACTTTTTT CGCTGAGTTC GCTTCATGTT TGACGGAATT ACAGACGCGT TTACGCGAAA GCCTGGCTCT GCGTCAACAA AATGAATCGG TGGTCAGGCT GATGCAGCAG CAATTGCAGC AGGCTGTGAT GACTCACAGC TGGATTTACA CCGACGCTCA GCTGTTACGC GATGATATTC AAACACTTTT CACGGCAGAA CGATATTGA
|
Protein sequence | MYTQTLYELS QEAERLLQLS RQQLQLLEKM PLSVPGDDAP QLALPWSQPN IAERHAMLNN ELRKISRLEM VLAIVGTMKA GKSTTINAIV GTEVLPNRNR PMTALPTLIR HTPGQKEPVL HFSHVAPIDC LIQKLQQRLR DCDIKHLTDV LEIDKDMRAL MQRIENGVAF EKYYLGAQPI FHCLKSLNDL VRLAKALDVD FPFSAYAAIE HIPVIEVEFV HLAGLESYPG QLTLLDTPGP NEAGQPHLQK MLNQQLARAS AVLAVLDYTQ LKSISDEEVR EAILAVGQSV PLYVLVNKFD QQDRNSDDAD QVRALISGTL MKGCITPQQI FPVSSMWGYL ANRARNELAN SGKLPAPEQQ RWVEDFAHAA LGRRWRHADL ANLEHIRHAA DQLWEDSLFA KPIQALLHAA YANASLYALR SAAHKLLNYA QQAREYLDFR AHGLNVACEQ LRQNIHQIEE SLQLFQLNQA QVSGEIKHEI ELALTSANLF LRQQQDAVNA QLAALFQDDS GSLSEIRTCC ETLLQTAQNT ISRDFTLRFA ELESTLCRVL TDVIRPIEQQ VKMELSESGF RPGFHFPVFH SAVPHFNTRQ LFSEVISRQD AMDEQSTRLG VVRETFSRWL NQPDWGRGNE KSPTETVDYS VLQRALSAEV DLYCQQMAKV LAEQVDESVT AGMNTFFAEF ASCLTELQTR LRESLALRQQ NESVVRLMQQ QLQQAVMTHS WIYTDAQLLR DDIQTLFTAE RY
|
| |