Gene EcSMS35_2403 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_2403 
Symbol 
ID6143052 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp2451811 
End bp2453013 
Gene Length1203 bp 
Protein Length400 aa 
Translation table11 
GC content55% 
IMG OID641617276 
Productcompetence damage-inducible protein A 
Protein accessionYP_001744448 
Protein GI170680019 
COG category[R] General function prediction only 
COG ID[COG1058] Predicted nucleotide-utilizing enzyme related to molybdopterin-biosynthesis enzyme MoeA 
TIGRFAM ID[TIGR00177] molybdenum cofactor synthesis domain
[TIGR00200] competence/damage-inducible protein CinA N-terminal domain 


Plasmid Coverage information

Num covering plasmid clones34 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones61 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTAAAAG TGGAAATGTT ATCCACCGGG GATGAAGTGT TACACGGGCA AATCGTTGAC 
ACTAACGCTG CCTGGCTGGC CGATTTTTTC TTTCATCAGG GGTTGCCATT ATCTCGCCGC
AATACGGTGG GGGATAACCT TGATGACTTA GTCACCATTC TTCGCGAACG GAGTCAGCAC
GCCGATGTGC TGATCGTTAA CGGCGGGCTG GGGCCGACCA GTGATGATTT AAGCGCACTC
GCCGCTGCGA CAGCAAAAGG TGAAGGCCTG GTGTTGCATG AAGCCTGGCT CAAAGAGATG
GAACGCTATT TCCACGAACG TGGACGAGTA ATGGCACCGA GCAACCGTAA ACAAGCGGAG
CTACCTGCCA GTGCTGAATT TATCAATAAC CCGGTAGGCA CCGCCTGTGG TTTTGCCGTG
CAGCTCAATC GTTGCCTGAT GTTCTTTACC CCCGGCGTAC CGTCAGAATT TAAGGTGATG
GTCGAGCACG AAATCCTGCC GCGCCTGCGC GAGCGTTTTT CTTTACCGCA GCCGCCGGTT
TGTCTGCGTT TAACTACCTT TGGTCGTTCG GAAAGCGATC TGGCGCAAAG CCTGGACTCC
CTGCAACTGC CGCCGGGCGT AACAATGGGC TATCGCTCCT CAATGCCGAT TATCGAACTG
AAACTCACCG GACCGGCAAG CGAGGAACAG GCGATGGAAA AACTGTGGCT GGACGTTAAA
CGTGTTGCCG GACAGAGCGT GATTTTCGAA GGCACCGAAG GACTGCCCGC GCAGATCAGT
CGCGAATTGC AGAGCCGCCA GTTCAGCCTG ACGTTGAGCG AGCAATTCAC CGGTGGTTTA
TTGGCTTTGC AACTTTCTCG CGCAGGTGCT CCATTGCTGG CGTGTGAAGT GGTTCCTTCA
CAGGAAGAAA CCCTGGCGCA AACTGCGCAC TGGATTACAG AACGGCGGGC CAACCATTTT
GCCGGGCTGG CACTGGCTGT TTCGGGTTTC GAGAACGAGC ATCTCAACTT TGCGCTAGCC
ACGCCAGACG GCACTTTCGC TCTGCGTGTG CGTTTCAGCA CTACGCGCTA CAGCCAGGCT
ATCCGTCAGG AAGTGTGCGC AATGATGGCA CTGAATATGC TGCGCCGTTG GTTAAACGAC
CAGGACATCG CCAGTGAGCA TGGCTGGATT GAGGTTGTTG AGTCCATGAC CTTATCTGTC
TGA
 
Protein sequence
MLKVEMLSTG DEVLHGQIVD TNAAWLADFF FHQGLPLSRR NTVGDNLDDL VTILRERSQH 
ADVLIVNGGL GPTSDDLSAL AAATAKGEGL VLHEAWLKEM ERYFHERGRV MAPSNRKQAE
LPASAEFINN PVGTACGFAV QLNRCLMFFT PGVPSEFKVM VEHEILPRLR ERFSLPQPPV
CLRLTTFGRS ESDLAQSLDS LQLPPGVTMG YRSSMPIIEL KLTGPASEEQ AMEKLWLDVK
RVAGQSVIFE GTEGLPAQIS RELQSRQFSL TLSEQFTGGL LALQLSRAGA PLLACEVVPS
QEETLAQTAH WITERRANHF AGLALAVSGF ENEHLNFALA TPDGTFALRV RFSTTRYSQA
IRQEVCAMMA LNMLRRWLND QDIASEHGWI EVVESMTLSV