Gene EcHS_A4377 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A4377 
SymboldipZ 
ID5591884 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp4387368 
End bp4389065 
Gene Length1698 bp 
Protein Length565 aa 
Translation table11 
GC content54% 
IMG OID640923475 
Productthiol:disulfide interchange protein precursor 
Protein accessionYP_001460919 
Protein GI157163601 
COG category[C] Energy production and conversion
[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG4232] Thiol:disulfide interchange protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value0.0111987 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCTCAAC GCATCTTTAC GCTGATCCTG CTACTTTGCA GCACTTCCGT TTTTGCCGGA 
TTATTCGACG CGCCGGGACG TTCACAATTT GTCCCCGCGG ATCAAGCCTT TACTTTTGAT
TTTCAGCAAA ACCAACATGA CCTTAATCTG ACCTGGCAGA TCAAAGACGG TTACTACCTC
TACCGTAAAC AGATCCGCAT TACGCCGGAA CACGCGAAAA TTGCCGACGT GCAGCTGCCG
CAAGGCGTCT GGCATGAAGA TGAGTTTTAC GGCAAAAGCG AGATTTACCG CGATCGGCTG
ACGCTTCCCG TAACCATCAA CCAGGCGAGT GCGGGAGCGA CGTTAACTGT CACCTACCAG
GGCTGTGCTG ATGCCGGTTT CTGTTATCCG CCAGAAACCA AAACCGTTCC GTTAAGCGAA
GTGGTCGCCA ACAACGCAGC GTCACAGCCT GTGTCTGTTT CGCAGCAAGA GCAGCACACC
GCGCAATTGC CCTTTTCCGC GCTCTGGGCG TTGTTGATCG GTATTGGTAT CGCCTTTACG
CCATGCGTGC TGCCAATGTA CCCACTGATT TCTGGCATCG TGCTGGGTGG TAAACAGCGG
CTCTCCACTG CCAGAGCATT GTTGCTGACC TTTATTTATG TGCAGGGGAT GGCACTGACT
TACACGGCGC TGGGTCTGGT AGTTGCCGCC GCAGGGTTAC AGTTCCAGGC GGCGCTACAG
CACCCATACG TGCTCATTGG CCTCGCCATC GTCTTTACCT TGCTGGCGAT GTCAATGTTT
GGCTTGTTTA CTCTACAACT CCCCTCTTCG CTGCAAACAC GTCTCACGTT GATGAGCAAT
CGCCAACAGG GCGGCTCACC TGGCGGTGTG TTTGTTATGG GGGCGATTGC CGGACTGATC
TGTTCACCAT GCACCACCGC ACCGCTTAGC GCGATTCTGC TGTATATCGC CCAAAGCGGG
AACATGTGGC TGGGCGGCGG CACGCTTTAT CTCTATGCGT TGGGCATGGG CCTGCCGCTG
ATGCTAATTA CCGTCTTTGG TAACCGCTTG CTGCCGAAAA GCGGCCCGTG GATGGAACAA
GTCAAAACCG CGTTTGGTTT TGTGATCCTC GCACTGCCGG TCTTCCTGCT GGAGCGAGTG
ATTGGTGATG TATGGGGATT ACGCTTGTGG TCGGCGCTGG GTGTCGCATT CTTTGGCTGG
GCCTTTATCA CCAGCCTACA GGCTAAACGC GGCTGGATGC GCGTGGTGCA AATAATCCTG
CTGGCAGCGG CATTGGTTAG CGTGCGCCCA CTTCAGGATT GGGCATTTGG TGCGACGCAT
ACCGCGCAAA CTCAGACGCA TCTCAACTTT ACACAAATCA AAACGGTAGA TGAGTTAAAT
CAGGCGCTCG TTGAAGCCAA AGGCAAACCG GTGATGTTAG ATCTTTATGC CGACTGGTGC
GTCGCCTGTA AAGAGTTTGA GAAATACACC TTCAGCGACC CGCAGGTGCA AAAAGCGTTA
GCAGACACGG TCTTACTTCA GGCCAACGTC ACGGCCAACG ACGCACAAGA TGTGGCGCTG
TTAAAGCATC TTAATGTCCT TGGCCTACCG ACAATTCTCT TTTTTGACGG ACAAGGCCAG
GAGCATCCAC AAGCACGCGT CACGGGCTTT ATGGATGCTG AAACCTTCAG CGCACATTTG
CGCGATCGCC AACCGTGA
 
Protein sequence
MAQRIFTLIL LLCSTSVFAG LFDAPGRSQF VPADQAFTFD FQQNQHDLNL TWQIKDGYYL 
YRKQIRITPE HAKIADVQLP QGVWHEDEFY GKSEIYRDRL TLPVTINQAS AGATLTVTYQ
GCADAGFCYP PETKTVPLSE VVANNAASQP VSVSQQEQHT AQLPFSALWA LLIGIGIAFT
PCVLPMYPLI SGIVLGGKQR LSTARALLLT FIYVQGMALT YTALGLVVAA AGLQFQAALQ
HPYVLIGLAI VFTLLAMSMF GLFTLQLPSS LQTRLTLMSN RQQGGSPGGV FVMGAIAGLI
CSPCTTAPLS AILLYIAQSG NMWLGGGTLY LYALGMGLPL MLITVFGNRL LPKSGPWMEQ
VKTAFGFVIL ALPVFLLERV IGDVWGLRLW SALGVAFFGW AFITSLQAKR GWMRVVQIIL
LAAALVSVRP LQDWAFGATH TAQTQTHLNF TQIKTVDELN QALVEAKGKP VMLDLYADWC
VACKEFEKYT FSDPQVQKAL ADTVLLQANV TANDAQDVAL LKHLNVLGLP TILFFDGQGQ
EHPQARVTGF MDAETFSAHL RDRQP