Gene BCZK4039 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBCZK4039 
Symbol 
ID3025659 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus cereus E33L 
KingdomBacteria 
Replicon accessionNC_006274 
Strand
Start bp4147637 
End bp4148758 
Gene Length1122 bp 
Protein Length373 aa 
Translation table11 
GC content38% 
IMG OID637548253 
ProductNIF3-like protein 
Protein accessionYP_085618 
Protein GI52141211 
COG category[S] Function unknown 
COG ID[COG3323] Uncharacterized protein conserved in bacteria 
TIGRFAM ID[TIGR00486] dinuclear metal center protein, YbgI/SA1388 family 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.0149626 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTAAAA TTCCAAATGG CCATGAAATT ATTTCTTTAT TTGAAAGTAT GTATCCGAAG 
CATTTGGCGA TGGAAGGAGA TAAGATTGGC CTGCAGATTG GAGCGCTTAA TAAACCCGTG
CAGCACGTAT TAATTGCGTT AGATGTAACG GAAGAAGTTG TGGATGAAGC AATTCAATTA
GGAGCGAATG TCATTATTGC GCATCATCCT TTAATTTTTA ACCCGCTAAA AGCGATTCAT
ACAGATAAGG CGTATGGGAA AATTATTGAA AAGTGTATTA AAAATGATAT TGCAATCTAT
GCAGCACATA CAAATGTGGA TATTGCTAAG GGCGGGGTAA ATGATTTACT TGCTGAGGCG
TTAGGATTGC AAAATACAGA AGTTTTGGCA CCGACATATG CAGAAGAAAT GAAAAAAATT
GTTGTGTTTG TGCCTGAAAC TCATGCAGAA GAAGTAAGAA AAGCATTAGG AGACGCAGGC
GCTGGTCATA TCGGCAATTA TAGCCACTGT ACGTTTAGTA GCGAGGGTAC AGGCACGTTT
ATACCTCAAG AGGGAACAAA TCCTTATATC GGGGAAACTG GGCAGTTAGA ACGCGTGGAA
GAAGTGCGAA TCGAAACGAT TATTCCAGCT TCATTCCAGC GAAAAGTAAT TAAAGCAATG
GTAACGGCAC ATCCATATGA AGAAGTAGCA TATGATGTGT ATCCACTTGA TAACAAAGGT
GAAACATTAG GGCTTGGAAA AATAGGATAT TTACAAGAAG AAATGACACT TGGACAATTT
GCGGAACATG TAAAGAAGTC ATTAGATGTA AAGGGTGCGC GAGTTGTTGG GAAATTAGAT
GATAAAGTGC GCAAAGTAGC TGTACTTGGT GGCGATGGTA ACAAATACAT CAATCAAGCT
AAATTTAAAG GAGCAGATGT ATATGTAACG GGGGACATGT ATTATCATGT TGCTCATGAT
GCGATGATGC TCGGTTTAAA TATAGTTGAC CCAGGACATA ACGTTGAAAA GGTAATGAAG
CAAGGTGTAC AAAAGCAATT ACAAGAAAAA GTGGATGCAA AGAAACTTAA TGTAAACATT
CATGCTTCGC AGTTACATAC AGATCCATTT ACATTTGTAT AA
 
Protein sequence
MSKIPNGHEI ISLFESMYPK HLAMEGDKIG LQIGALNKPV QHVLIALDVT EEVVDEAIQL 
GANVIIAHHP LIFNPLKAIH TDKAYGKIIE KCIKNDIAIY AAHTNVDIAK GGVNDLLAEA
LGLQNTEVLA PTYAEEMKKI VVFVPETHAE EVRKALGDAG AGHIGNYSHC TFSSEGTGTF
IPQEGTNPYI GETGQLERVE EVRIETIIPA SFQRKVIKAM VTAHPYEEVA YDVYPLDNKG
ETLGLGKIGY LQEEMTLGQF AEHVKKSLDV KGARVVGKLD DKVRKVAVLG GDGNKYINQA
KFKGADVYVT GDMYYHVAHD AMMLGLNIVD PGHNVEKVMK QGVQKQLQEK VDAKKLNVNI
HASQLHTDPF TFV