Gene B21_01094 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagB21_01094 
SymbolplsX 
ID8115235 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli BL21 
KingdomBacteria 
Replicon accessionNC_012892 
Strand
Start bp1149597 
End bp1150667 
Gene Length1071 bp 
Protein Length356 aa 
Translation table11 
GC content52% 
IMG OID644847351 
Producthypothetical protein 
Protein accessionYP_002998924 
Protein GI251784620 
COG category[I] Lipid transport and metabolism 
COG ID[COG0416] Fatty acid/phospholipid biosynthesis enzyme 
TIGRFAM ID[TIGR00182] fatty acid/phospholipid synthesis protein PlsX 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000000594515 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGACACGTC TAACCCTGGC GTTAGATGTC ATGGGAGGGG ATTTTGGCCC TTCCGTGACA 
GTGCCTGCAG CATTGCAGGC ACTGAATTCT AATTCGCAAC TCACTCTTCT TTTAGTCGGC
AATCCCGACG CCATCACGCC ATTACTTGCT AAAGCTGACT TTGAACAACG TTCGCGTCTG
CAGATTATTC CTGCGCAGTC AGTTATCGCC AGTGATGCCC GGCCTTCGCA AGCTATCCGC
GCCAGTCGTG GGAGTTCAAT GCGCGTGGCC CTGGAGCTGG TGAAAGAAGG TCGAGCGCAA
GCCTGTGTCA GTGCCGGTAA TACCGGGGCA CTGATGGGGC TGGCAAAATT ATTACTCAAG
CCCCTGGAGG GGATTGAGCG TCCGGCGCTG GTGACGGTAT TACCACATCA GCAAAAGGGC
AAAACGGTGG TCCTTGACTT AGGGGCCAAC GTCGATTGTG ACAGTACAAT GTTGGTGCAA
TTTGCCATTA TGGGCTCAGT TCTGGCTGAA GAGGTGGTGG AAATTCCCAA TCCTCGCGTG
GCGTTGCTCA ATATTGGTGA AGAAGAAGTA AAGGGTCTCG ACAGTATTCG GGATGCCTCA
GCGGTGCTTA AAACAATCCC TTCTATCAAT TATATCGGCT ATCTTGAAGC CAATGAGTTG
TTAACTGGCA AGACAGATGT GCTGGTTTGT GACGGCTTTA CAGGAAATGT CACATTAAAG
ACGATGGAAG GTGTTGTCAG GATGTTCCTT TCTCTGCTGA AATCTCAGGG TGAAGGGAAA
AAACGGTCGT GGTGGCTACT GTTATTAAAG CGTTGGCTAC AAAAGAGCCT GACGAGGCGA
TTCAGTCACC TCAACCCCGA CCAGTATAAC GGCGCCTGTC TGTTAGGATT GCGCGGCACG
GTGATAAAAA GTCATGGTGC AGCCAATCAG CGAGCTTTTG CGGTCGCGAT TGAACAGGCA
GTGCAGGCGG TGCAGCGACA AGTTCCTCAG CGAATTGCCG CTCGCCTGGA ATCTGTATAC
CCAGCTGGTT TTGAGCTGCT GGACGGTGGC AAAAGCGGAA CTCTGCGGTA G
 
Protein sequence
MTRLTLALDV MGGDFGPSVT VPAALQALNS NSQLTLLLVG NPDAITPLLA KADFEQRSRL 
QIIPAQSVIA SDARPSQAIR ASRGSSMRVA LELVKEGRAQ ACVSAGNTGA LMGLAKLLLK
PLEGIERPAL VTVLPHQQKG KTVVLDLGAN VDCDSTMLVQ FAIMGSVLAE EVVEIPNPRV
ALLNIGEEEV KGLDSIRDAS AVLKTIPSIN YIGYLEANEL LTGKTDVLVC DGFTGNVTLK
TMEGVVRMFL SLLKSQGEGK KRSWWLLLLK RWLQKSLTRR FSHLNPDQYN GACLLGLRGT
VIKSHGAANQ RAFAVAIEQA VQAVQRQVPQ RIAARLESVY PAGFELLDGG KSGTLR