Gene B21_01050 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagB21_01050 
SymbolymdC 
ID8116466 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli BL21 
KingdomBacteria 
Replicon accessionNC_012892 
Strand
Start bp1108330 
End bp1109751 
Gene Length1422 bp 
Protein Length473 aa 
Translation table11 
GC content51% 
IMG OID644847310 
Producthypothetical protein 
Protein accessionYP_002998883 
Protein GI251784579 
COG category[I] Lipid transport and metabolism 
COG ID[COG1502] Phosphatidylserine/phosphatidylglycerophosphate/cardiolipin synthases and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.795141 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGCCCCGGC TGGCGAGCGC GGTGCTGCCA CTGTGTTCGC AACATCCCGG TCAGTGTGGC 
CTTTTTCCTC TGGAGAAAAG TCTGGATGCG TTTGCCGCCC GGTATCGTCT GGCCGAAATG
GCAGAGCATA CGCTCGATGT TCAGTATTAC ATCTGGCAGG ACGATATGTC GGGTCGGTTA
CTGTTTTCCG CCCTGTTAGC CGCAGCAAAG CGTGGCGTTC GCGTCCGTTT GTTGCTGGAC
GACAACAATA CGCCCGGACT TGACGACATT TTACGCTTGC TTGACAGTCA TCCACGCATT
GAAGTCCGGC TTTTTAATCC TTTCTCGTTT CGCTTGCTGC GTCCGCTTGG TTATATCACC
GACTTTTCCC GTCTTAATCG CCGTATGCAC AATAAAAGTT TCACTGTCGA TGGCGTGGTG
ACCCTGGTGG GAGGACGAAA TATTGGTGAT GCCTATTTTG GAGCAGGGGA GGAGCCACTT
TTTTCGGATT TAGATGTCAT GGCAATAGGA CCCGTGGTAG AGGACGTTGC CGATGATTTC
GCCCGCTACT GGTATTGCAA ATCGGTTTCA CCCTTACAGC AGGTGCTGGA TGTCCCGGAG
GGTGAAATGG CGGATCGCAT CGAGTTACCC GCCTCCTGGC ATAACGATGC CATGACGCAT
CGTTATTTAC GCAAAATGGA ATCCAGTCCA TTTATAAATC ATCTGGTTGA TGGAACATTG
CCGCTTATCT GGGCGAAGAC ACGTTTATTA AGTGATGATC CGGCGAAAGG GGAGGGCAAG
GCAAAACGGC ATTCACTGTT ACCGCAGCGC CTGTTCGATA TCATGGGCTC ACCCAGTGAA
CGCATCGATA TTATCTCTTC CTATTTTGTA CCGACACGCG CAGGTGTGGC GCAACTCTTA
CGGATGGTGA GAAAAGGGGT AAAGATTGCG ATCCTAACCA ATTCTCTTGC CGCTAACGAT
GTTGCTGTCG TCCATGCCGG ATACGCGCGC TGGCGCAAAA AATTGCTCCG CTATGGCGTG
GAATTATATG AACTCAAGCC GACGCGTGAA CAAAGTAGTA CGTTACACGA TCGCGGCATA
ACCGGTAATT CCGGAGCCAG CCTGCATGCT AAAACCTTTA GCATCGATGG TAAAACGGTG
TTTATCGGCT CTTTCAATTT CGATCCGCGT TCAACATTGC TCAATACTGA AATGGGCTTC
GTGATAGAGA GCGAAACGCT GGCACAGTTA ATTGATAAAC GCTTTATTCA GAGCCAGTAT
GATGCGGCCT GGCAGCTCCG TCTGGACAGG TGGGGACGGA TCAACTGGGT TGATCGTCAT
GCAAAGAAAG AGATTATTCT CAAAAAAGAA CCCGCCACCA GTTTCTGGAA GCGGGTTATG
GTCAGACTGG CGTCGATATT GCCCGTGGAA TGGTTATTGT AA
 
Protein sequence
MPRLASAVLP LCSQHPGQCG LFPLEKSLDA FAARYRLAEM AEHTLDVQYY IWQDDMSGRL 
LFSALLAAAK RGVRVRLLLD DNNTPGLDDI LRLLDSHPRI EVRLFNPFSF RLLRPLGYIT
DFSRLNRRMH NKSFTVDGVV TLVGGRNIGD AYFGAGEEPL FSDLDVMAIG PVVEDVADDF
ARYWYCKSVS PLQQVLDVPE GEMADRIELP ASWHNDAMTH RYLRKMESSP FINHLVDGTL
PLIWAKTRLL SDDPAKGEGK AKRHSLLPQR LFDIMGSPSE RIDIISSYFV PTRAGVAQLL
RMVRKGVKIA ILTNSLAAND VAVVHAGYAR WRKKLLRYGV ELYELKPTRE QSSTLHDRGI
TGNSGASLHA KTFSIDGKTV FIGSFNFDPR STLLNTEMGF VIESETLAQL IDKRFIQSQY
DAAWQLRLDR WGRINWVDRH AKKEIILKKE PATSFWKRVM VRLASILPVE WLL