Gene Arth_0668 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_0668 
Symbol 
ID4446849 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp713010 
End bp714170 
Gene Length1161 bp 
Protein Length386 aa 
Translation table11 
GC content67% 
IMG OID639688468 
Productglobin 
Protein accessionYP_830167 
Protein GI116669234 
COG category[C] Energy production and conversion 
COG ID[COG1017] Hemoglobin-like flavoprotein
[COG1018] Flavodoxin reductases (ferredoxin-NADPH reductases) family 1 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.513137 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTCTCGG ACAAGTCCTT CCCCGTCATC GAGGCCACCC TCCCCCTGGT CGGTTCCCGG 
ATCGGTGAAA TCACCCCCAA GTTTTACGCC CGCCTCTTCG CAGCACACCC GGAACTACTG
GACGGGCTCT TCAGCCGCTC CAACCAGCGC AACGGCAACC AGCAGCAGGC GCTGGCCGGA
AGCATCGCCG CTTTCGCCAC CCACCTGGTG AACAACCCCG GCACCCTGCC CGAGACCGTG
CTTGCGCGCA TCGCCCACCG CCACGCCTCC CTCGGCATCA CCGAACCGCA GTACCAAGTG
GTCTACGAGC ACCTCTTCGC CGCCATCGCC GAGGACCTCG CCGAGGTCAT TACCCCGGAA
ATCGCCGAAG CCTGGACCGA GGTCTACTGG CTCATGGCGG ATGCGCTGAT CAAGCTCGAA
AAGGGCCTCT ACGCCGCACA GGCCAACGGC GTGATGTGGA GCCCGTGGCG GGTCGCCGCC
AAGACTGCTG CCGGCACCGG CTCCATGACG TTCACCCTGG AACCTGCCGA CGACACCCCC
ATCACCCCCG CCCTTGCGGG ACAGTACGTG AGCGTCAAGG TCCAGCTCCC GGACGGACTG
CGCCAGGTCC GCCAGTACTC GCTGTCCGGC GAGGCCGGCA CGAGCCGGAC GTTCACCACC
AAGAAGGACG ACGGCGGCGA AGTCTCCCCC GTCCTGCACA ACAACGTCCA GGTGGGCGAC
ATCCTCGAGA TCTCCAACCC CTACGGTGAA ATCACCCTCA AGGAAGGCGA CGGGCCCGTC
GTCCTGGCCT CCGCCGGCAT TGGCTGCACA CCCACCGCCT CCATCCTGCG CTCCCTCGCT
GACTCCGGCT CGGACCGCCA GGTCCTGGTC CTGCACGCGG AAAGCGACCT GGACAGCTGG
GCCCTGCGCA GCCAGATGAC GGACGACGTC GAACGCCTAG ACGGCGCCGA CCTGCAGCTC
TGGCTCGAGC GGCCGGTCGC CGGAACCAAG GAGGGCTTCA TGTCGCTGCG CGAAGTCGAC
CTGCCGGCCA ACGCCTCGCT GTACCTGTGC GGTCCGCTGC CGTTCATGAA GCACATCCGC
AACGAGGCCA TCAACGCCGG GATCCCCGCC ACGAAGATCC ACTACGAAGT CTTCGGCCCG
GACATCTGGC TGGCTTCCTA A
 
Protein sequence
MLSDKSFPVI EATLPLVGSR IGEITPKFYA RLFAAHPELL DGLFSRSNQR NGNQQQALAG 
SIAAFATHLV NNPGTLPETV LARIAHRHAS LGITEPQYQV VYEHLFAAIA EDLAEVITPE
IAEAWTEVYW LMADALIKLE KGLYAAQANG VMWSPWRVAA KTAAGTGSMT FTLEPADDTP
ITPALAGQYV SVKVQLPDGL RQVRQYSLSG EAGTSRTFTT KKDDGGEVSP VLHNNVQVGD
ILEISNPYGE ITLKEGDGPV VLASAGIGCT PTASILRSLA DSGSDRQVLV LHAESDLDSW
ALRSQMTDDV ERLDGADLQL WLERPVAGTK EGFMSLREVD LPANASLYLC GPLPFMKHIR
NEAINAGIPA TKIHYEVFGP DIWLAS