Gene SbBS512_E4595 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSbBS512_E4595 
SymbolnrfE 
ID6268938 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShigella boydii CDC 3083-94 
KingdomBacteria 
Replicon accessionNC_010658 
Strand
Start bp4291451 
End bp4293133 
Gene Length1683 bp 
Protein Length560 aa 
Translation table11 
GC content57% 
IMG OID641728371 
Productheme lyase subunit NrfE 
Protein accessionYP_001882769 
Protein GI187731032 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1138] Cytochrome c biogenesis factor 
TIGRFAM ID[TIGR00353] c-type cytochrome biogenesis protein CcmF 


Plasmid Coverage information

Num covering plasmid clones37 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGTTGTTAA GTCTCGGGGT CAACGTGTTG ACCCCGTTGA CGGCCTTTGC GGGAGTGCGG 
TTGCGCTGGC CTGCCATGAT GCGACTCACT TGCATCGGCA TTCTGGCGCA GTTCGCGCTC
CTGCTGCTCG CCTTTGGCGT ACTGACGTAT TGTTTTCTCA TCAGCGATTT CTCGGTCATT
TATGTCGCCC AACATAGCTA CAGCCTGCTG TCGTGGGAAC TCAAACTGGC GGCGGTGTGG
GGCGGTCATG AAGGTTCGCT GCTGCTTTGG GTGCTGCTGC TTTCCGCCTG GAGCGCGCTG
TTTGCCTGGC ATTATCGGCA GCAAACCGAT CCGCTATTTC CGCTGACGCT AGCCGTTTTA
TCTCTCATGC TCGCCGCACT GCTACTGTTT GTGGTGCTGT GGTCCGATCC CTTCGTGCGG
ATATTTCCAC CAGCAATCGA AGGCCGCGAT CTCAATCCGA TGCTGCAACA TCCCGGTCTT
ATCTTTCATC CACCGCTGCT TTACCTTGGC TATGGCGGTT TGATGGTAGC GGCGAGCGTG
GCGCTGGCGA GTCTACTGCG CGGCGAGTTT GATGCGGCCA GCGCCCGAAT TTGCTGGCGC
TGGGCGTTAC CTGGCTGGAG TGCATTAACG GCGGGGATCA TCCTCGGTTC CTGGTGGGCC
TACTGCGAAC TCGGCTGGGG CGGCTGGTGG TTCTGGGATC CGGTGGAAAA TGCCTCTTTA
TTACCCTGGC TTTCTGCCAC TGCGCTGCTG CACAGTTTAT CCCTGACACG CCAGCGGGGG
ATTTTTCGCC ACTGGTCGCT GTTACTGGCG ATAGTTACTC TGATGCTGTC GCTGCTGGGC
ACCTTAATTG TCCGTTCTGG CATTCTGGTT TCGGTTCATG CGTTCGCGCT GGATAACGTC
CGCGCCGTGC CGTTGTTCAG CCTGTTTGCA CTGATTAGCC TTGCGTCTCT GGCTCTGTAT
GGCTGGCGAG CGCGGGACGG TGGCCCGGCG GTGCGTTTTT CGGGGTTATC GCGGGAAATG
TTAATCCTCG CTACGCTGTT GCTGTTTTGC GCAGTGCTAC TGATCGTGCT GGTGGGAACG
CTTTATCCGA TGATTTACGG TCTGCTGGGC TGGGGACGCC TCTCCGTTGG CGCGCCGTAT
TTTAACCGCG CGACGTTACC GTTTGGTCTG TTGATGCTGG TGGTGATTGT GCTGGCGACG
TTTGTCTCTG GCAAACGCGT GCAGCTTCCG GCGCTGGTAG CTCATGCGGG CGTGCTGTTA
TTTGCCGCTG GGATCGTGGT TTCCAGCGTC AGCCGTCAGG AGATCAGCCT GAATTTACAG
CCGGGTCAGC AGGTGACGCT GGCAGGATAC ACCTTCCGTT TTGAGCGCCT CGATCTGCAA
GCCAAAGGCA ATTACACCAG CGAAAAAGCG ATAGTGGCAC TGTTTGACCA TCAGCAACGC
ATTGGTGAAC TGATGCCGGA GCGGCGTTTT TACGAAGCAC GTCGTCAGCA AATGATGGAA
CCGTCAATTC GCTGGAACGG CATCCATGAC TGGTATGCGG TCATGGGTGA AAAAACCGGA
GCGGATCGTT ACGCTTTTCG CTTGTATGTA CAAAGCGGTG TGCGCTGGAT CTGGGGGGGA
GGATTGTTGA TGATTGCGGG CGCATTGTTA AGCGGATGGC GGGGGAGGAA GCGCGATGAA
TAA
 
Protein sequence
MLLSLGVNVL TPLTAFAGVR LRWPAMMRLT CIGILAQFAL LLLAFGVLTY CFLISDFSVI 
YVAQHSYSLL SWELKLAAVW GGHEGSLLLW VLLLSAWSAL FAWHYRQQTD PLFPLTLAVL
SLMLAALLLF VVLWSDPFVR IFPPAIEGRD LNPMLQHPGL IFHPPLLYLG YGGLMVAASV
ALASLLRGEF DAASARICWR WALPGWSALT AGIILGSWWA YCELGWGGWW FWDPVENASL
LPWLSATALL HSLSLTRQRG IFRHWSLLLA IVTLMLSLLG TLIVRSGILV SVHAFALDNV
RAVPLFSLFA LISLASLALY GWRARDGGPA VRFSGLSREM LILATLLLFC AVLLIVLVGT
LYPMIYGLLG WGRLSVGAPY FNRATLPFGL LMLVVIVLAT FVSGKRVQLP ALVAHAGVLL
FAAGIVVSSV SRQEISLNLQ PGQQVTLAGY TFRFERLDLQ AKGNYTSEKA IVALFDHQQR
IGELMPERRF YEARRQQMME PSIRWNGIHD WYAVMGEKTG ADRYAFRLYV QSGVRWIWGG
GLLMIAGALL SGWRGRKRDE