Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Arth_0668 |
Symbol | |
ID | 4446849 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Arthrobacter sp. FB24 |
Kingdom | Bacteria |
Replicon accession | NC_008541 |
Strand | - |
Start bp | 713010 |
End bp | 714170 |
Gene Length | 1161 bp |
Protein Length | 386 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 639688468 |
Product | globin |
Protein accession | YP_830167 |
Protein GI | 116669234 |
COG category | [C] Energy production and conversion |
COG ID | [COG1017] Hemoglobin-like flavoprotein [COG1018] Flavodoxin reductases (ferredoxin-NADPH reductases) family 1 |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.513137 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCTCTCGG ACAAGTCCTT CCCCGTCATC GAGGCCACCC TCCCCCTGGT CGGTTCCCGG ATCGGTGAAA TCACCCCCAA GTTTTACGCC CGCCTCTTCG CAGCACACCC GGAACTACTG GACGGGCTCT TCAGCCGCTC CAACCAGCGC AACGGCAACC AGCAGCAGGC GCTGGCCGGA AGCATCGCCG CTTTCGCCAC CCACCTGGTG AACAACCCCG GCACCCTGCC CGAGACCGTG CTTGCGCGCA TCGCCCACCG CCACGCCTCC CTCGGCATCA CCGAACCGCA GTACCAAGTG GTCTACGAGC ACCTCTTCGC CGCCATCGCC GAGGACCTCG CCGAGGTCAT TACCCCGGAA ATCGCCGAAG CCTGGACCGA GGTCTACTGG CTCATGGCGG ATGCGCTGAT CAAGCTCGAA AAGGGCCTCT ACGCCGCACA GGCCAACGGC GTGATGTGGA GCCCGTGGCG GGTCGCCGCC AAGACTGCTG CCGGCACCGG CTCCATGACG TTCACCCTGG AACCTGCCGA CGACACCCCC ATCACCCCCG CCCTTGCGGG ACAGTACGTG AGCGTCAAGG TCCAGCTCCC GGACGGACTG CGCCAGGTCC GCCAGTACTC GCTGTCCGGC GAGGCCGGCA CGAGCCGGAC GTTCACCACC AAGAAGGACG ACGGCGGCGA AGTCTCCCCC GTCCTGCACA ACAACGTCCA GGTGGGCGAC ATCCTCGAGA TCTCCAACCC CTACGGTGAA ATCACCCTCA AGGAAGGCGA CGGGCCCGTC GTCCTGGCCT CCGCCGGCAT TGGCTGCACA CCCACCGCCT CCATCCTGCG CTCCCTCGCT GACTCCGGCT CGGACCGCCA GGTCCTGGTC CTGCACGCGG AAAGCGACCT GGACAGCTGG GCCCTGCGCA GCCAGATGAC GGACGACGTC GAACGCCTAG ACGGCGCCGA CCTGCAGCTC TGGCTCGAGC GGCCGGTCGC CGGAACCAAG GAGGGCTTCA TGTCGCTGCG CGAAGTCGAC CTGCCGGCCA ACGCCTCGCT GTACCTGTGC GGTCCGCTGC CGTTCATGAA GCACATCCGC AACGAGGCCA TCAACGCCGG GATCCCCGCC ACGAAGATCC ACTACGAAGT CTTCGGCCCG GACATCTGGC TGGCTTCCTA A
|
Protein sequence | MLSDKSFPVI EATLPLVGSR IGEITPKFYA RLFAAHPELL DGLFSRSNQR NGNQQQALAG SIAAFATHLV NNPGTLPETV LARIAHRHAS LGITEPQYQV VYEHLFAAIA EDLAEVITPE IAEAWTEVYW LMADALIKLE KGLYAAQANG VMWSPWRVAA KTAAGTGSMT FTLEPADDTP ITPALAGQYV SVKVQLPDGL RQVRQYSLSG EAGTSRTFTT KKDDGGEVSP VLHNNVQVGD ILEISNPYGE ITLKEGDGPV VLASAGIGCT PTASILRSLA DSGSDRQVLV LHAESDLDSW ALRSQMTDDV ERLDGADLQL WLERPVAGTK EGFMSLREVD LPANASLYLC GPLPFMKHIR NEAINAGIPA TKIHYEVFGP DIWLAS
|
| |