Gene EcolC_3876 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_3876 
SymboldipZ 
ID6066324 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp4232032 
End bp4233729 
Gene Length1698 bp 
Protein Length565 aa 
Translation table11 
GC content54% 
IMG OID641603291 
Productthiol:disulfide interchange protein precursor 
Protein accessionYP_001726807 
Protein GI170021853 
COG category[C] Energy production and conversion
[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG4232] Thiol:disulfide interchange protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTCAAC GCATCTTTAC GCTGATCCTG CTACTTTGCA GCACTTCCGT TTTTGCCGGA 
TTATTCGACG CGCCGGGACG TTCACAATTT GTCCCCGCGG ATCAAGCCTT TGCTTTTGAT
TTTCAGCAAA ACCAACATGA CCTTAATCTG ACCTGGCAGA TCAAAGACGG TTACTACCTC
TACCGTAAAC AGATCCGCAT TACGCCGGAA CACGCGAAAA TTGCCGACGT GCAGCTGCCG
CAAGGCGTCT GGCATGAAGA TGAGTTTTAC GGCAAAAGCG AGATTTACCG CGATCGGCTG
ACGCTTCCCG TCACCATCAA CCAGGCGAGT GCGGGAGCGA CGTTAACTGT CACCTACCAG
GGCTGTGCTG ATGCCGGTTT CTGTTATCCG CCAGAAACCA AAACCGTTCC GTTAAGCGAA
GTGGTCGCCA ACAACGCAGC GCCACAGCCT GTGTCTGTTC CGCAGCAAGA GCAGCCCACC
GCGCAATTGC CCTTTTCCGC GCTCTGGGCG TTGTTGATCG GTATTGGTAT CGCCTTTACG
CCATGCGTGC TGCCAATGTA CCCACTGATT TCTGGCATCG TGCTGGGTGG TAAACAGCGG
CTCTCCACTG CCAGAGCATT GTTGCTGACC TTTATTTATG TGCAGGGGAT GGCGCTGACC
TACACGGCGC TGGGTCTGGT GGTTGCCGCC GCAGGGTTAC AGTTCCAGGC GGCGCTACAG
CACCCATACG TGCTCATTGG CCTCGCCATC GTCTTTACCT TGCTGGCGAT GTCAATGTTT
GGCTTGTTTA CCCTGCAACT CCCCTCTTCG CTGCAAACAC GTCTCACGTT GATGAGCAAT
CGCCAACAGG GCGGCTCACC TGGCGGTGTG TTTGTTATGG GGGCGATTGC CGGACTGATC
TGTTCACCAT GCACCACCGC ACCGCTTAGC GCGATTCTGC TGTATATCGC CCAAAGCGGG
AACATGTGGC TGGGCGGCGG CACGCTTTAT CTCTATGCGT TGGGCATGGG CCTGCCGCTG
ATGCTAATTA CCGTCTTTGG TAACCGCTTG CTGCCGAAAA GCGGCCCGTG GATGGAACAA
GTCAAAACCG CGTTTGGTTT TGTGATCCTC GCACTGCCGG TCTTCCTGCT GGAGCGAGTG
ATTGGTGATG TATGGGGATT ACGCTTGTGG TCGGCGCTGG GTGTCGCATT CTTTGGCTGG
GCCTTTATCA CCAGCCTACA GGCTAAACGC GGCTGGATGC GTATTGTGCA AATTATTCTG
CTGGCAGCGG CATTGGTTAG CGTGCGCCCA CTTCAGGATT GGGCATTTGG TGCGACGCAT
ACCGCGCAAA CTCAGACGCA TCTCAACTTT ACACAAATCA AAACGGTAGA TGAGTTAAAT
CAGGCGCTCG TTGAAGCCAA AGGCAAACCG GTGATGTTAG ATCTTTATGC CGACTGGTGC
GTCGCCTGTA AAGAGTTTGA GAAATACACC TTCAGCGACC CGCAGGTGCA AAAAGCGTTA
GCAGACACGG TCTTACTTCA GGCCAACGTC ACGGCCAACG ACGCACAAGA TGTGGCGCTG
TTAAAGCATC TTAATGTCCT TGGCCTACCG ACAATTCTCT TTTTTGACGG ACAAGGCCAG
GAGCATCCAC AAGCACGCGT CACGGGCTTT ATGGATGCTG AAACCTTCAG CGCACATTTG
CGCGATCGCC AACCGTGA
 
Protein sequence
MAQRIFTLIL LLCSTSVFAG LFDAPGRSQF VPADQAFAFD FQQNQHDLNL TWQIKDGYYL 
YRKQIRITPE HAKIADVQLP QGVWHEDEFY GKSEIYRDRL TLPVTINQAS AGATLTVTYQ
GCADAGFCYP PETKTVPLSE VVANNAAPQP VSVPQQEQPT AQLPFSALWA LLIGIGIAFT
PCVLPMYPLI SGIVLGGKQR LSTARALLLT FIYVQGMALT YTALGLVVAA AGLQFQAALQ
HPYVLIGLAI VFTLLAMSMF GLFTLQLPSS LQTRLTLMSN RQQGGSPGGV FVMGAIAGLI
CSPCTTAPLS AILLYIAQSG NMWLGGGTLY LYALGMGLPL MLITVFGNRL LPKSGPWMEQ
VKTAFGFVIL ALPVFLLERV IGDVWGLRLW SALGVAFFGW AFITSLQAKR GWMRIVQIIL
LAAALVSVRP LQDWAFGATH TAQTQTHLNF TQIKTVDELN QALVEAKGKP VMLDLYADWC
VACKEFEKYT FSDPQVQKAL ADTVLLQANV TANDAQDVAL LKHLNVLGLP TILFFDGQGQ
EHPQARVTGF MDAETFSAHL RDRQP