Gene EcHS_A3919 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A3919 
Symbol 
ID5592653 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp3912247 
End bp3913893 
Gene Length1647 bp 
Protein Length548 aa 
Translation table11 
GC content54% 
IMG OID640923027 
Productputative inner membrane protein translocase component YidC 
Protein accessionYP_001460504 
Protein GI157163186 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG0706] Preprotein translocase subunit YidC 
TIGRFAM ID[TIGR03592] membrane protein insertase, YidC/Oxa1 family, C-terminal domain
[TIGR03593] membrane protein insertase, YidC/Oxa1 family, N-terminal domain 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value0.00313921 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGATTCGC AACGCAATCT TTTAGTCATC GCTTTGCTGT TCGTGTCTTT CATGATCTGG 
CAAGCCTGGG AGCAGGATAA AAACCCGCAA CCTCAGGCCC AACAGACCAC GCAGACAACG
ACCACCGCAG CGGGTAGCGC CGCCGACCAG GGCGTACCGG CCAGTGGCCA GGGGAAACTG
ATCTCGGTTA AGACCGACGT GCTTGATCTG ACCATCAACA CCCGTGGTGG TGATGTTGAG
CAAGCTCTGC TGCCTGCTTA CCCGAAAGAG CTGAACTCTA CCCAGCCGTT CCAGCTGCTG
GAAACTTCAC CGCAGTTTAT TTATCAGGCA CAGAGCGGTC TGACCGGTCG TGATGGCCCG
GATAACCCGG CTAACGGCCC GCGTCCGCTG TATAACGTTG AAAAAGACGC TTATGTGCTG
GCTGAAGGTC AAAACGAACT GCAGGTGCCG ATGACGTATA CCGACGCGGC AGGCAACACG
TTTACCAAAA CGTTTGTCCT GAAACGTGGT GATTACGCTG TCAACGTCAA CTACAACGTG
CAGAACGCTG GCGAGAAACC GCTGGAAATC TCCACCTTTG GTCAGTTGAA GCAATCCATC
ACTCTGCCAC CGCATCTCGA TACCGGAAGC AGCAACTTCG CACTGCACAC CTTCCGCGGC
GCGGCGTACT CCACGCCTGA CGAGAAGTAC GAGAAATACA AGTTCGATAC CATTGCCGAT
AACGAAAACC TGAACATCTC TTCGAAAGGT GGTTGGGTGG CAATGCTGCA ACAGTATTTC
GCGACGGCGT GGATCCCGCA TAACGACGGT ACCAACAACT TCTATACCGC TAATCTGGGT
AACGGCATCG CCGCTATCGG CTATAAATCT CAGCCGGTAC TGGTTCAGCC TGGTCAGACT
GGCGCGATGA ACAGCACCCT GTGGGTTGGC CCGGAAATCC AGGACAAAAT GGCAGCTGTT
GCTCCGCACC TGGATCTGAC CGTTGATTAC GGTTGGTTGT GGTTCATCTC TCAGCCGCTG
TTCAAACTGC TGAAATGGAT CCATAGCTTT GTGGGTAACT GGGGCTTCTC CATTATCATC
ATCACCTTTA TCGTTCGTGG CATCATGTAC CCGCTGACCA AAGCGCAGTA CACCTCCATG
GCGAAGATGC GTATGCTGCA GCCGAAGATT CAGGCAATGC GTGAGCGTCT GGGCGATGAC
AAACAGCGTA TCAGCCAGGA AATGATGGCG CTGTACAAAG CTGAGAAGGT TAACCCGCTG
GGCGGCTGCT TCCCGCTGCT GATCCAGATG CCAATCTTCC TGGCGTTGTA CTACATGCTG
ATGGGTTCCG TTGAACTGCG TCAGGCACCG TTTGCACTGT GGATCCACGA CCTGTCGGCA
CAGGACCCGT ACTACATCCT GCCGATCCTG ATGGGCGTAA CGATGTTCTT CATTCAGAAG
ATGTCGCCGA CCACTGTGAC CGACCCGATG CAGCAGAAGA TCATGACCTT TATGCCGGTC
ATCTTCACCG TGTTCTTCCT GTGGTTCCCG TCAGGTCTGG TGCTGTACTA TATCGTCAGC
AACCTGGTAA CCATTATTCA GCAGCAGCTG ATTTACCGTG GTCTGGAAAA ACGTGGCCTG
CATAGCCGCG AGAAGAAAAA ATCCTGA
 
Protein sequence
MDSQRNLLVI ALLFVSFMIW QAWEQDKNPQ PQAQQTTQTT TTAAGSAADQ GVPASGQGKL 
ISVKTDVLDL TINTRGGDVE QALLPAYPKE LNSTQPFQLL ETSPQFIYQA QSGLTGRDGP
DNPANGPRPL YNVEKDAYVL AEGQNELQVP MTYTDAAGNT FTKTFVLKRG DYAVNVNYNV
QNAGEKPLEI STFGQLKQSI TLPPHLDTGS SNFALHTFRG AAYSTPDEKY EKYKFDTIAD
NENLNISSKG GWVAMLQQYF ATAWIPHNDG TNNFYTANLG NGIAAIGYKS QPVLVQPGQT
GAMNSTLWVG PEIQDKMAAV APHLDLTVDY GWLWFISQPL FKLLKWIHSF VGNWGFSIII
ITFIVRGIMY PLTKAQYTSM AKMRMLQPKI QAMRERLGDD KQRISQEMMA LYKAEKVNPL
GGCFPLLIQM PIFLALYYML MGSVELRQAP FALWIHDLSA QDPYYILPIL MGVTMFFIQK
MSPTTVTDPM QQKIMTFMPV IFTVFFLWFP SGLVLYYIVS NLVTIIQQQL IYRGLEKRGL
HSREKKKS