Gene B21_02748 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagB21_02748 
SymbolyggW 
ID8116411 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli BL21 
KingdomBacteria 
Replicon accessionNC_012892 
Strand
Start bp2927330 
End bp2928466 
Gene Length1137 bp 
Protein Length378 aa 
Translation table11 
GC content53% 
IMG OID644848939 
Producthypothetical protein 
Protein accessionYP_003000512 
Protein GI251786208 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0635] Coproporphyrinogen III oxidase and related Fe-S oxidoreductases 
TIGRFAM ID[TIGR00539] putative oxygen-independent coproporphyrinogen III oxidase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.153415 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTTAAAT TACCGCCGCT GAGTCTCTAC ATTCACATCC CGTGGTGCGT GCAGAAATGC 
CCGTACTGCG ATTTCAACTC TCACGCGTTG AAAGGAGAAG TGCCGCACGA CGATTATGTT
CAGCATCTGC TTAACGATCT GGACAACGAT GTGGCTTACG CTCAGGGCCG TGAAGTGCAG
ACAATCTTTA TTGGCGGTGG TACGCCGAGC CTGCTTTCCG GCCCGGCGAT GCAAACGCTG
CTGGACGGCG TGCGTGCGCG TTTGCCGCTG ACAGCGGATG CAGAAATTAC TATGGAAGCG
AACCCTGGTA CGGTAGAAGC CGATCGCTTT GTCGATTATC AGCGTGCTGG CGTGAACCGC
ATCTCTATTG GCGTACAGAG TTTTAGCGAA GAAAAGCTGA AACGACTTGG GCGCATTCAT
GGCCCGCAAG AAGCGAAACG AGCTGCGAAT CTGGCAAGCG GGCTGGGGCT GCGTAGTTTT
AACCTTGATT TGATGCATGG GCTGCCGGAT CAATCACTGG AAGAGGCGCT TGGCGATCTG
CGCCAGGCCA TTGAACTGAA TCCGCCGCAT CTTTCCTGGT ATCAACTGAC CATCGAACCT
AATACGCTGT TTGGCTCGCG CCCTCCTGTA CTGCCGGACG ATGACGCGCT GTGGGATATT
TTCGAACAGG GGCATCAGTT ATTAACCGCA GCGGGTTATC AGCAATATGA AACTTCCGCT
TACGCCAAAC CCGGTTATCA GTGCCAGCAC AATCTCAACT ACTGGCGCTT TGGTGACTAC
ATTGGTATTG GCTGCGGCGC GCATGGCAAA GTGACCTTCC CGGATGGGCG CATTCTGCGT
ACCACCAAAA CGCGTCATCC GCGTGGTTTT ATGCAGGGGC GGTATCTGGA AAGCCAGCGT
GATGTCGAAG CCGCAGATAA GCCGTTTGAG TTCTTTATGA ATCGCTTCCG TTTGCTGGAA
GCCGCGCCGC GCGTGGAGTT TAGCCAGTAT ACTGGCCTTT CAGAAGAGGT TATTCGCCCT
CAGTTAGACG AGGCTATTGC TCAGGGTTAT CTCACAGAAT GTGCGGATTA CTGGCAGATA
ACGGAACATG GGAAGTTGTT TTTAAATTCG CTGCTGGAGC TTTTTCTGGC TGAGTAA
 
Protein sequence
MVKLPPLSLY IHIPWCVQKC PYCDFNSHAL KGEVPHDDYV QHLLNDLDND VAYAQGREVQ 
TIFIGGGTPS LLSGPAMQTL LDGVRARLPL TADAEITMEA NPGTVEADRF VDYQRAGVNR
ISIGVQSFSE EKLKRLGRIH GPQEAKRAAN LASGLGLRSF NLDLMHGLPD QSLEEALGDL
RQAIELNPPH LSWYQLTIEP NTLFGSRPPV LPDDDALWDI FEQGHQLLTA AGYQQYETSA
YAKPGYQCQH NLNYWRFGDY IGIGCGAHGK VTFPDGRILR TTKTRHPRGF MQGRYLESQR
DVEAADKPFE FFMNRFRLLE AAPRVEFSQY TGLSEEVIRP QLDEAIAQGY LTECADYWQI
TEHGKLFLNS LLELFLAE