Gene EcSMS35_2651 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_2651 
Symbol 
ID6146975 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp2710990 
End bp2713233 
Gene Length2244 bp 
Protein Length747 aa 
Translation table11 
GC content47% 
IMG OID641617522 
Productputative cytochrome C-type biogenesis protein 
Protein accessionYP_001744687 
Protein GI170682075 
COG category[T] Signal transduction mechanisms 
COG ID[COG2200] FOG: EAL domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones56 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAACTGA ATGCAACTTA TATAAAAATA CGTGATAAAT GGTGGGGGCT TCCGCTGTTC 
CTGCCTTCTT TAATCTTGCC CATTTTCGCC CACATTAATA CTTTCGCGCA TATTTCTTCC
GGTGAGGTTT TTCTCTTTTA TCTGCCTCTG GCTCTGATGA TCAGCATGAT GATGTTTTTC
AGCTGGGCGG CATTGCCAGG GATCGCCTTA GGGATTTTTG TCCGCAAATA TGCAGAGCTG
GGTTTTTACG AAACGCTCTC ATTAACGGCT AATTTTATTA TCATTATCAT TCTCTGTTGG
GGCGGTTACA GGGTCTTTAC TCCCCGGCGT AACAACGTTT CACATGGTGA TACCCGTTTA
ATTTCCCAGC GTATATTCTG GCAGATTGTG TTTCCTGCAA CGCTGTTTCT GATACTTTTC
CAGTTTGCTG CGTTTGTAGG ATTACTGGCG AGCAGAGAAA ATCTGGTCGG CGTCATGCCT
TTTAACCTCG GGACCTTAAT CAATTATCAG GCCTTACTGG TGGGCAATCT GATTGGTGTC
CCGCTGTGCT ACTTCATCAT TCGTGTAGTG CGAAATCCGT TTTATTTACG CAGTTATTAT
TCGCAATTAA AACAGCAGGT TGATGTCAAA GTCACCAAAA AAGAGTTCGC ACTCTGGCTA
CTGGCATTAG GTGCTTTACT GCTGCTGTTA TGCATGCCGT TAAATGAAAA AAGCACAATT
TTTAGCACCA ACTATACCTT GTCATTATTG CTGCCCCTGA TGATGTGGGG AGCGATGCGC
TATGGTTATA AGCTGATTTC GCTGCTCTGG GCGGTCGTGT TGATGATCAG CATCCACAGC
TATCAAAATT ACATTCCCAT TTATCCTGGC TATACCACGC AACTGACCAT AACCTCCTCC
AGTTATCTGG TATTCTCTTT TATTGTCAAT TATATGGCTG TACTGGCAAC CCGTCAGCGA
GCGGTAGTCA GACGCATTCA GCGGCTTGCG TATGTGGACC CGGTGGTTCA TCTGCCAAAT
GTTCGCGCCC TGAATCGCGC GTTGCGTGAT GCCCCCTGGT CTGCGCTTTG TTATTTACGC
ATCCCTGGCA TGGAAATGCT GGTTAAGAAC TATGGCATCA TGCTACGGAT TCAATACAAG
CAAAAACTTT CTCACTGGCT GTCACCCTTG CTGGAACCGG GTGAAGATGT TTATCAGCTT
TCGGGTAACG ATCTCGCGCT GCGACTGAAT ACAGAATCGC ACCAGGAGCG CATTACCGAA
CTGGATAGCC ATCTCAAGCA ATTTCGTTTC TTTTGGGATG GAATGCCGAT GCAACCGCAG
ATTGGCGTCA GTTACTGCTA TGTGCGCTCG CCAGTGAATC ATATCTACCT GCTGCTGGGA
GAGCTAAATA CGGTCGCCGA GCTTTCCATC GTGACCAACG CCCCGGAAAA TATGCAGCGT
CGCGGGGCAA TGTATTTGCA ACGCGAATTG AAAGATAAAG TCGCGATGAT GAATCGGCTA
CAGCAGGCGC TGGAACACAA CCATTTTTTC CTGATGGCCC AGCCGATTAC CGGTATGCGT
GGTGATGTCT ACCATGAAAT TCTTCTGCGC ATGAAAGGTG AGAATGATGA ACTGATCAGC
CCCGACAGCT TCTTGCCGGT CGCGCACGAA TTTGGTTTAT CGTCGAGTAT CGACATGTGG
GTCATTGAGC ATACGCTGCA ATTTATGGCT GAAAACAGAG CGAAGATGCC CGCTCACCGT
TTTGCGATTA ATCTGTCGCC AACCTCGGTA TGTCAGGCGC GTTTTCCCGT TGAAGTCAGT
CAGCTGCTGG CTAAATATCA AATTGAAGCG TGGCAACTTA TTTTTGAAGT CACCGAAAGT
AATGCTCTGA CCAATGTTAA GCAGGCGCAA ATCACCTTGC AGCATCTTCA GGAATTAGGC
TGCCAGATTG CGATTGATGA TTTCGGCACC GGCTATGCCA GCTATGCGCG GCTTAAAAAT
GTGAATGCCG ATTTGCTTAA AATTGACGGC AGTTTTATCC GCAATATTGT GTCAAATAGT
CTGGATTATC AGATAGTGGC GTCGATTTGC CACCTGGCGC GAATGAAGAA AATGCGGGTA
GTGGCAGAGT ACGTTGAAAA CGAAGAGATC CGCGAGGCGG TGTTCTCTTT GGGGATCGAT
TATATGCAGG GTTATCTTAT TGGTAAGCCG CAACCGTTAA TTGATACGCT GAATGAAATC
GAACCCATTC GCGAAAGTGC CTGA
 
Protein sequence
MKLNATYIKI RDKWWGLPLF LPSLILPIFA HINTFAHISS GEVFLFYLPL ALMISMMMFF 
SWAALPGIAL GIFVRKYAEL GFYETLSLTA NFIIIIILCW GGYRVFTPRR NNVSHGDTRL
ISQRIFWQIV FPATLFLILF QFAAFVGLLA SRENLVGVMP FNLGTLINYQ ALLVGNLIGV
PLCYFIIRVV RNPFYLRSYY SQLKQQVDVK VTKKEFALWL LALGALLLLL CMPLNEKSTI
FSTNYTLSLL LPLMMWGAMR YGYKLISLLW AVVLMISIHS YQNYIPIYPG YTTQLTITSS
SYLVFSFIVN YMAVLATRQR AVVRRIQRLA YVDPVVHLPN VRALNRALRD APWSALCYLR
IPGMEMLVKN YGIMLRIQYK QKLSHWLSPL LEPGEDVYQL SGNDLALRLN TESHQERITE
LDSHLKQFRF FWDGMPMQPQ IGVSYCYVRS PVNHIYLLLG ELNTVAELSI VTNAPENMQR
RGAMYLQREL KDKVAMMNRL QQALEHNHFF LMAQPITGMR GDVYHEILLR MKGENDELIS
PDSFLPVAHE FGLSSSIDMW VIEHTLQFMA ENRAKMPAHR FAINLSPTSV CQARFPVEVS
QLLAKYQIEA WQLIFEVTES NALTNVKQAQ ITLQHLQELG CQIAIDDFGT GYASYARLKN
VNADLLKIDG SFIRNIVSNS LDYQIVASIC HLARMKKMRV VAEYVENEEI REAVFSLGID
YMQGYLIGKP QPLIDTLNEI EPIRESA