Gene EcSMS35_4605 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_4605 
SymboldipZ 
ID6143776 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp4708312 
End bp4710009 
Gene Length1698 bp 
Protein Length565 aa 
Translation table11 
GC content54% 
IMG OID641619421 
Productthiol:disulfide interchange protein precursor 
Protein accessionYP_001746532 
Protein GI170683820 
COG category[C] Energy production and conversion
[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG4232] Thiol:disulfide interchange protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0011627 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones46 
Fosmid unclonability p-value0.838171 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTCAAC GCATCTTTAC GCTGATCCTG CTACTTTGCA GCACTTCCGT TTTTGCCGGA 
TTATTCGACG CGCCGGGACG TTCACAATTT GTCCCCGCGG ATCAAGCCTT TGCTTTTGAT
TTTCAGCAAA AGCAACATGA CCTTAATCTG ACCTGGCAGA TCAAAGACGG TTACTACCTC
TACCGTAAAC AGATCCGCAT TACGCCGGAA CACGCGAAAA TTGCCGACGT GCAGCTGCCA
CAAGGCGTCT GGCATGAAGA TGAGTTTTAC GGCAAAAGCG AGATTTACCG CGATCGGCTG
ACGCTTCCCG TAACCATCAA CCAGGCGAGT GCGGGAGCGA CGTTAACTGT CACCTACCAG
GGCTGTGCTG ATGCCGGTTT CTGTTATCCG CCAGAAACCA AAACCGTTCC GTTAAGCGAA
GTGGTCGCCA ACAACGCAGC GTCACAGCCT GTGTCTGTTC CGCAGCAAGA GCAGCACACC
GCGCAATTGC CCTTTTCCGC GCTCTGGGCG TTGTTGATCG GTATTGGTAT CGCCTTTACG
CCATGCGTGC TGCCAATGTA CCCTCTGATT TCTGGTATCG TGCTGGGCGG TAAACAGCGG
CTCTCCACTG CCAGAGCATT GTTGCTGACC TTTATTTATG TGCAGGGGAT GGCACTGACT
TACACGGCGC TGGGTCTGGT AGTTGCCGCC GCAGGGTTAC AGTTCCAGGC GGCGCTACAG
CACCCATACG TGCTCATTGG CCTCGCTATC GTCTTTACCT TGCTGGCGAT GTCAATGTTT
GGCTTGTTTA CCCTGCAACT CCCCTCTTCG CTGCAAACGC GCCTCACGCT GATGAGCAAT
CGCCAACAGG GCGGCTCACC TGGCGGTGTG TTTGTTATGG GAGCGATTGC CGGACTGATC
TGTTCACCAT GCACCACCGC ACCGCTTAGC GCGATTCTGT TGTATATCGC CCAAAGCGGG
AACATGTGGC TGGGGGGCGG CACGCTTTAT CTCTATGCGT TGGGCATGGG CCTGCCGCTG
ATGCTAATAA CCGTCTTTGG TAACCGCCTG CTGCCGAAAA GCGGCCCGTG GATGGAACAA
GTCAAAACCG CGTTTGGTTT TGTGATCCTC GCACTGCCGG TCTTCCTGCT GGAGCGAGTG
ATTGGTGATA TATGGGGATT ACGCTTGTGG TCGGCGCTTG GTGTCGCATT CTTTGGCTGG
GCCTTTATCA CCAGCCTACA GGCCAAACGC GGCTGGATGC GCGTGGTGCA AATAATCCTG
CTGGCAGCGG CATTGGTTAG CGTGCGCCCA CTTCAGGATT GGGCATTTGG TGCGACGCAT
ACCGTGCAAA CTCAGACGCA TCTCAACTTT ACACAAATCA AAACGGTAGA TGAGTTAAAT
CAGGCGCTCG TTGAAGCCAA AGGCAAACCA GTGATGTTAG ATCTCTATGC CGACTGGTGC
GTCGCCTGTA AAGAGTTTGA GAAATACACC TTCAGCGACC CGCAGGTGCA AAAAGCGTTA
GCAGACACGG TCTTACTTCA GGCTAACGTC ACTGCCAACG ACGCCCAAGA TGTGGCGCTG
TTAAAGCATC TTAATGTCCT TGGCCTACCG ACAATTCTCT TTTTTGACGG ACAAGGCCAG
GAGCATCCAC AAGAACGCGT CACGGGCTTT ATGGATGCTG AAACCTTCAG CGCACATTTG
CGCGATCGCC AACCGTGA
 
Protein sequence
MAQRIFTLIL LLCSTSVFAG LFDAPGRSQF VPADQAFAFD FQQKQHDLNL TWQIKDGYYL 
YRKQIRITPE HAKIADVQLP QGVWHEDEFY GKSEIYRDRL TLPVTINQAS AGATLTVTYQ
GCADAGFCYP PETKTVPLSE VVANNAASQP VSVPQQEQHT AQLPFSALWA LLIGIGIAFT
PCVLPMYPLI SGIVLGGKQR LSTARALLLT FIYVQGMALT YTALGLVVAA AGLQFQAALQ
HPYVLIGLAI VFTLLAMSMF GLFTLQLPSS LQTRLTLMSN RQQGGSPGGV FVMGAIAGLI
CSPCTTAPLS AILLYIAQSG NMWLGGGTLY LYALGMGLPL MLITVFGNRL LPKSGPWMEQ
VKTAFGFVIL ALPVFLLERV IGDIWGLRLW SALGVAFFGW AFITSLQAKR GWMRVVQIIL
LAAALVSVRP LQDWAFGATH TVQTQTHLNF TQIKTVDELN QALVEAKGKP VMLDLYADWC
VACKEFEKYT FSDPQVQKAL ADTVLLQANV TANDAQDVAL LKHLNVLGLP TILFFDGQGQ
EHPQERVTGF MDAETFSAHL RDRQP