Gene ECD_01112 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECD_01112 
SymbolycfU 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli BL21(DE3) 
KingdomBacteria 
Replicon accessionCP001509 
Strand
Start bp1178020 
End bp1179219 
Gene Length1200 bp 
Protein Length399 aa 
Translation table11 
GC content54% 
IMG OID 
Productouter membrane-specific lipoprotein transporter subunit 
Protein accessionACT43007 
Protein GI253977337 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.822017 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTACCAAC CTGTCGCTCT ATTTATTGGC CTGCGTTACA TGCGTGGGCG TGCAGCGGAT 
CGCTTCGGTC GTTTCGTCTC CTGGCTTTCT ACCATCGGCA TTACCCTCGG GGTGATGGCG
CTGGTCACAG TATTGTCAGT GATGAACGGC TTTGAGCGCG AGCTGCAAAA CAACATCCTT
GGCCTGATGC CACAGGCAAT TCTCTCTTCT GAGCATGGCT CTCTTAACCC GCAGCAACTC
CCGGAAACGG CAGTCAAACT GGACGGCGTT AATCGCGTCG CACCTATTAC TACCGGTGAT
GTGGTACTGC AAAGCGCGCG CAGCGTGGCG GTCGGGGTGA TGCTGGGTAT CGATCCGGCG
CAAAAAGATC CACTAACGCC GTATCTGGTC AATGTGAAAC AAACTGACCT CGAGCCGGGG
AAATATAATG TCATCCTCGG TGAACAGCTT GCCTCACAGC TAGGCGTTAA TCGCGGTGAT
CAAATCCGCG TGATGGTGCC ATCTGCCAGC CAGTTCACGC CGATGGGGCG TATTCCAAGT
CAGCGCCTGT TCAATGTGAT TGGCACTTTC GCCGCCAACA GTGAAGTCGA TGGCTATGAA
ATGCTGGTGA ATATTGAGGA TGCCTCACGC CTGATGCGTT ATCCGGCAGG CAATATTACC
GGCTGGCGTT TGTGGCTGGA TGAGCCGCTG AAAGTTGACT CTTTAAGTCA GCAAAAACTG
CCTGAAGGCA GCAAATGGCA GGACTGGCGT GACCGTAAAG GCGAGCTGTT CCAGGCCGTA
CGCATGGAAA AAAATATGAT GGGCTTACTG CTGAGCCTGA TTGTCGCCGT TGCGGCGTTT
AACATTATTA CCTCACTAGG GCTGATGGTA ATGGAGAAGC AGGGCGAAGT AGCGATCCTG
CAAACGCAAG GCTTAACTCC GCGACAAATC ATGATGGTCT TTATGGTGCA AGGGGCCAGC
GCCGGGATTA TCGGTGCGAT CCTCGGTGCG GCGCTTGGCG CACTGCTTGC CAGCCAGTTA
AATAATCTGA TGCCGATAAT CGGCGTCCTG CTTGATGGCG CGGCGCTGCC GGTGGCTATC
GAACCTTTAC AGGTCATTGT TATTGCGCTG GTGGCGATGG CTATCGCGCT GCTGTCTACG
CTTTACCCTT CATGGCGCGC TGCCGCCACA CAACCCGCTG AGGCTTTACG TTATGAATAA
 
Protein sequence
MYQPVALFIG LRYMRGRAAD RFGRFVSWLS TIGITLGVMA LVTVLSVMNG FERELQNNIL 
GLMPQAILSS EHGSLNPQQL PETAVKLDGV NRVAPITTGD VVLQSARSVA VGVMLGIDPA
QKDPLTPYLV NVKQTDLEPG KYNVILGEQL ASQLGVNRGD QIRVMVPSAS QFTPMGRIPS
QRLFNVIGTF AANSEVDGYE MLVNIEDASR LMRYPAGNIT GWRLWLDEPL KVDSLSQQKL
PEGSKWQDWR DRKGELFQAV RMEKNMMGLL LSLIVAVAAF NIITSLGLMV MEKQGEVAIL
QTQGLTPRQI MMVFMVQGAS AGIIGAILGA ALGALLASQL NNLMPIIGVL LDGAALPVAI
EPLQVIVIAL VAMAIALLST LYPSWRAAAT QPAEALRYE