Gene ECD_04201 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECD_04201 
SymbolyjiJ 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli BL21(DE3) 
KingdomBacteria 
Replicon accessionCP001509 
Strand
Start bp4470776 
End bp4471954 
Gene Length1179 bp 
Protein Length392 aa 
Translation table11 
GC content54% 
IMG OID 
Productpredicted inner membrane protein 
Protein accessionACT45982 
Protein GI253980312 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000208654 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCTTCGT CCACGCATCC CGTAGAACGC TTTTCTTTCA GCACCGCGCT TTTCGGGATG 
CTGGTTCTGA CCTTAGGTAT GGGTTTAGGC CGCTTCCTTT ATACGCCTAT GTTGCCCGTC
ATGATGGCGG AAGGATCGTT TTCATTTAGC CAGCTCTCGT GGATTGCCAG CGGCAACTAT
GCGGGGTATC TGGCTGGCAG TCTGCTATTT TCTTTTGGCG CATTTCACCA GCCATCGCGC
CTGCGCCCAT TTCTGTTGGC TTCTGCCCTG GCGAGCGGGT TGTTGATCCT CGCAATGGCA
TGGTTGCCGC CATTTATTCT GGTGTTATTG ATTCGCGTCC TGGCGGGTGT CGCCAGCGCC
GGTATGCTGA TTTTTGGTTC GACGCTGATT ATGCAGCACA CGCGCCATCC TTTTGTGCTG
GCAGCTTTGT TTTCTGGCGT TGGCATTGGC ATCGCACTGG GCAATGAATA TGTTCTGGCA
GGCCTGCATT TTGACCTCTC CTCGCAAACG TTATGGCAAG GCGCCGGGGC GCTTTCTGGC
ATGATGCTGA TTGCACTTAC GCTTTTAATG CCCTCGAAAA AACACGCCAT CACACCAATG
CCATTGGCAA AAACGGAGCA ACAGATAATG AGCTGGTGGT TACTGGCTAT TCTGTACGGC
CTGGCGGGAT TTGGTTATAT CATCGTCGCG ACTTACCTGC CGCTCATGGC GAAAGATGCA
GGCTCACCAT TGTTAACCGC CCATCTCTGG ACGTTAGTCG GCTTATCAAT TGTGCCTGGT
TGCTTTGGCT GGCTATGGGC AGCAAAACGT TGGGGGGCTT TGCCCTGCCT GACGGCGAAT
TTGCTGGTGC AGGCTATCTG TGTGCTGCTT ACTCTCGCCA GCGACTCGCC TCTCTTGCTT
ATCATCAGCA GTCTTGGTTT TGGCGGTACC TTTATGGGCA CGACATCTCT GGTGATGACT
ATCGCCCGCC AACTGAGCGT ACCGGGAAAT CTCAATCTTT TAGGCTTTGT GACGCTCATT
TACGGCATTG GGCAAATCCT CGGCCCGGCG CTGACCAGTA TGCTCAGCAA TGGAACATCG
GCGCTTGCCA GCGCCACTCT CTGCGGCGCG GCGGCGTTGT TTATTGCTGC GCTTATCAGC
ACAGTGCAAT TATTTAAGCT ACAAGTGGTC ACTTCATGA
 
Protein sequence
MPSSTHPVER FSFSTALFGM LVLTLGMGLG RFLYTPMLPV MMAEGSFSFS QLSWIASGNY 
AGYLAGSLLF SFGAFHQPSR LRPFLLASAL ASGLLILAMA WLPPFILVLL IRVLAGVASA
GMLIFGSTLI MQHTRHPFVL AALFSGVGIG IALGNEYVLA GLHFDLSSQT LWQGAGALSG
MMLIALTLLM PSKKHAITPM PLAKTEQQIM SWWLLAILYG LAGFGYIIVA TYLPLMAKDA
GSPLLTAHLW TLVGLSIVPG CFGWLWAAKR WGALPCLTAN LLVQAICVLL TLASDSPLLL
IISSLGFGGT FMGTTSLVMT IARQLSVPGN LNLLGFVTLI YGIGQILGPA LTSMLSNGTS
ALASATLCGA AALFIAALIS TVQLFKLQVV TS