Gene EcSMS35_4574 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_4574 
Symbol 
ID6146999 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp4672087 
End bp4674315 
Gene Length2229 bp 
Protein Length742 aa 
Translation table11 
GC content51% 
IMG OID641619390 
Producthypothetical protein 
Protein accessionYP_001746502 
Protein GI170683254 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones60 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTACACAC AGACCCTGTA TGAGTTAAGT CAGGAGGCTG AACGCCTGTT ACAGCTTTCT 
CGCCAACAGT TGCAGTTACT GGAAAAAATG CCTCTCTCTG TACCCGGAGA CGATGCGCCA
CAACTGGCTT TACCCTGGAG TCAGCCTAAT ATCGCCGAAC GTCACGCGAT GCTGAATAAT
GAGTTGCGTA AAATTTCCCG ACTGGAAATG GTGCTGGCGA TTGTCGGTAC CATGAAAGCA
GGGAAATCAA CCACCATTAA TGCCATTGTT GGTACGGAGG TTCTGCCTAA TCGCAATCGC
CCAATGACTG CGCTGCCGAC GCTTATTCGC CATACGCCCG GGCAAAAAGA ACCGGTACTG
CATTTTTCAC ATGTCGCGCC AATCGATTGT TTAATTCAAA AATTACAACA GCGCCTGCGT
GATTGCGATA TTAAGCATCT GACCGATGTG CTGGAAATAG ATAAAGATAT GCGTGCGCTT
ATGCAGCGGA TCGAAAATGG CGTCGCTTTC GAAAAATATT ATCTGGGTGC CCAGCCTATT
TTTCATTGTC TGAAAAGTTT GAATGATTTG GTGCGACTGG CGAAGGCGCT GGACGTCGAT
TTTCCTTTTT CTGCTTACGC CGCCATTGAG CATATTCCCG TGATTGAAGT GGAGTTTGTC
CATCTGGCGG GGCTGGAGAG TTATCCCGGC CAATTGACGT TACTGGATAC CCCCGGGCCA
AATGAAGCCG GGCAACCGCA TCTGCAAAAA ATGCTTAACC AGCAGCTGGC ACGCGCCTCG
GCGGTACTGG CGGTGCTGGA TTATACGCAA CTGAAATCGA TCTCCGATGA AGAGGTCCGT
GAGGCGATTT TGGCGGTGGG GCAATCGGTG CCGTTGTATG TGCTGGTCAA TAAGTTCGAT
CAACAGGATC GTAACAGTGA CGACGCCGAC CAGGTGCGGG CACTGATTTC CGGGACGCTG
ATGAAAGGCT GTATTACGCC ACAGCAGATA TTTCCAGTGT CGTCGATGTG GGGCTACCTG
GCGAATCGGG CGCGTAATGA GTTAGCCAAC AGCGGTAAGT TACCCGCGCC AGAGCAACAA
CGCTGGGTGG AAGATTTTGC CCATGCCGCG CTCGGCAGGC GCTGGCGTCA TGCCGACCTG
GCGAACCTCG AACATATTCG TCATGCTGCC GATCAGTTGT GGGAAGATTC GCTGTTCGCC
AAGCCAATTC AGGCGTTGCT TCATGCCGCT TACGCTAACG CCTCGTTGTA TGCTCTGCGA
TCTGCCGCGC ATAAACTGTT GAATTACGCG CAGCAGGCGC GGGAATACCT GGATTTTCGT
GCGCACGGGT TAAACGTCGC TTGTGAACAA TTGCGGCAAA ATATCCACCA GATCGAAGAA
AGTTTGCAGC TATTTCAACT CAATCAGGCT CAGGTGAGCG GCGAGATTAA ACATGAAATC
GAGCTGGCCC TGACCTCCGC CAACCTCTTT CTGCGTCAAC AGCAAGATGC GGTGAATGCC
CAGTTAGCCG CGTTGTTTCA GGATGATTCG GGGTCATTAA GCGAGATTCG TACCTGCTGT
GAGACACTGT TACAGACGGC GCAGAACACC ATCAGTCGCG ACTTTACGCT GCGTTTTGCC
GAGCTTGAAT CCACCCTTTG CCGGGTGTTA ACCGATGTTA TTCGGCCCAT TGAGCAACAA
GTCAAAATGG AATTGAGCGA GTCAGGGTTT CGTCCTGGGT TTCATTTTCC TGTTTTTCAC
AGCGCAGTTC CCCACTTCAA CACTCGCCAG CTGTTCAGTG AAGTCATTTC GCGCCAGGAC
GCAATGGACG AGCAGAGCAC GCGTTTAGGC GTTGTGCGTG AGACTTTTTC GCGCTGGTTG
AATCAGCCCG ACTGGGGACG GGGAAATGAG AAATCCCCGA CAGAGACGGT TGATTACAGT
GTGTTGCAAC GAGCATTAAG CGCAGAAGTC GATCTTTATT GCCAACAAAT GGCTAAAGTT
CTGGCAGAGC AGGTCGATGA ATCTGTTACG GCAGGCATGA ATACTTTTTT CGCTGAGTTC
GCTTCATGTT TGACGGAATT ACAGACGCGT TTACGCGAAA GCCTGGCTCT GCGTCAACAA
AATGAATCGG TGGTCAGGCT GATGCAGCAG CAATTGCAGC AGGCTGTGAT GACTCACAGC
TGGATTTACA CCGACGCTCA GCTGTTACGC GATGATATTC AAACACTTTT CACGGCAGAA
CGATATTGA
 
Protein sequence
MYTQTLYELS QEAERLLQLS RQQLQLLEKM PLSVPGDDAP QLALPWSQPN IAERHAMLNN 
ELRKISRLEM VLAIVGTMKA GKSTTINAIV GTEVLPNRNR PMTALPTLIR HTPGQKEPVL
HFSHVAPIDC LIQKLQQRLR DCDIKHLTDV LEIDKDMRAL MQRIENGVAF EKYYLGAQPI
FHCLKSLNDL VRLAKALDVD FPFSAYAAIE HIPVIEVEFV HLAGLESYPG QLTLLDTPGP
NEAGQPHLQK MLNQQLARAS AVLAVLDYTQ LKSISDEEVR EAILAVGQSV PLYVLVNKFD
QQDRNSDDAD QVRALISGTL MKGCITPQQI FPVSSMWGYL ANRARNELAN SGKLPAPEQQ
RWVEDFAHAA LGRRWRHADL ANLEHIRHAA DQLWEDSLFA KPIQALLHAA YANASLYALR
SAAHKLLNYA QQAREYLDFR AHGLNVACEQ LRQNIHQIEE SLQLFQLNQA QVSGEIKHEI
ELALTSANLF LRQQQDAVNA QLAALFQDDS GSLSEIRTCC ETLLQTAQNT ISRDFTLRFA
ELESTLCRVL TDVIRPIEQQ VKMELSESGF RPGFHFPVFH SAVPHFNTRQ LFSEVISRQD
AMDEQSTRLG VVRETFSRWL NQPDWGRGNE KSPTETVDYS VLQRALSAEV DLYCQQMAKV
LAEQVDESVT AGMNTFFAEF ASCLTELQTR LRESLALRQQ NESVVRLMQQ QLQQAVMTHS
WIYTDAQLLR DDIQTLFTAE RY