Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Arth_0239 |
Symbol | |
ID | 4447295 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Arthrobacter sp. FB24 |
Kingdom | Bacteria |
Replicon accession | NC_008541 |
Strand | - |
Start bp | 252325 |
End bp | 253851 |
Gene Length | 1527 bp |
Protein Length | 508 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 639688035 |
Product | HNH nuclease |
Protein accession | YP_829740 |
Protein GI | 116668807 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 28 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGGGCTT CCCTTGCTGC GTTGGCTGCG TGTGCCGGCT GCGGGGCCGG CCGCGGGGCC GGGAACGGCG CCGGGAACCT GGCCTCGTCT GGTGCTGATC CTTTGCGGGA CGAGGCGGAC GCGTGCCTGG ACGGCCTGGC CGGAGTGGGT GGCCTGGAGG CCATGCTGGC CGCGCTGAAG GTGCACTTCG CCGCCGGGTA CGTCCGCGCT GCCGCGGCCT TGACGGCCCC GGCCGTTTCC CCGCAGGAGC GCACCGCCCG AGCCATGGCG GTCACGGCGG AGGTGGCCTG TGTCCTGACG GTGAGTGAAA GATCGGCAGC GGCGTTACTG GCGGACTCAG TCATCCTGAC CACCCGGTTG CCGCTGACGC TGTCCGCGCT GCGGTCGGGG GGCATTTCGT GGCAGCATGC GCGGGTGATG TGTGATGAGA CGTCCGGTCT GGACCCGGTC GCGGCTGCTG CCCTGGAGGG GCATTTCCTG GACCCCGAAG CTCCCTTTGC TGCGCGGGGC TGCCCGGCCG GGGAGCTGGT GCCGGGCAGG TTCCGGGCCA AGGCGCGCAG CTGGCGTGAG CGGCACCATC CGGTCAGCCT CGAAGCACGC CACCGCAAGG CCGTGACGGA CCGCCGGCTG GACTATGCCC CGGACCGGGA CGGCATGGCC TGGCTCTCGG CCTATGTGAG TGCGGATGTG GCAGCGGGGG TGTGGGCCCG TGCCACGGAC GCGGCGCGTG CCCTGCAGGG CCCGGCCGAA TCCCGGACCC TGACCCAGCT CCGCGCGGAC GTCGCCGCTG ACTGGCTCAT CGCAGGCATT GCCGAAGGTG TTCCGTCGCC GAAGGCGCAG GTGTTGGTGA CCGTCCCGGT GCTGTCGCTG CTGGGCGCGG GGACGGAACC GGCCACCCTG GACGGGTATG GTCCTGTTCC GCCGTCCATG GCCCGCCGGC TTCTCGGCGA GGGCGCCGGA TCGTTCCTGC GTGTGCTGAC CGACCCGCGC AGCGGGGCGC CGCTGGAGAT CGGACGCAGC AGCTACCCGG TTCCGAAAGC GATGCGCCAA TGGCTGCGGC TTCGGGACGG CCGGTGCCCG TTTCCCGGCT GCAACAACCA CTCCCTGGAT AACGAAGCGG ACCACCTGCT GGCCTGGTCC GCAGGTGGCG GCACCGACAT CACCAACCTG GGCCAGCCAT GCCCGAAGCA CCACCGGCTC AAACACACCA CAGCATGGAC GCCGGTCGAC GCCACCCGGG ACCAGCCACC GCGCTGGATC TCACCCGCAG GACGCTCCTA CCCCAGCGAA CAACAGGACT GGGAACCACC ACACTGGCCC GACCTACCCG CCGGCGCGGA GCCCGCGGGC GAAAACCCCG CGGTGGAGGA CGACGACCCG GGATGGGTAC CGGAATGGGA ACCACCACAC TGGCCGGACC TACCCGCCGG CGCGGAGGGA AGCACCGGCG CCACCGGTCC CGGAGAGACC GGGCCACCAT TACCCGCCGA CCCCTTCCCC GACTGGGCCC TATTCATCGC GGCCTAG
|
Protein sequence | MGASLAALAA CAGCGAGRGA GNGAGNLASS GADPLRDEAD ACLDGLAGVG GLEAMLAALK VHFAAGYVRA AAALTAPAVS PQERTARAMA VTAEVACVLT VSERSAAALL ADSVILTTRL PLTLSALRSG GISWQHARVM CDETSGLDPV AAAALEGHFL DPEAPFAARG CPAGELVPGR FRAKARSWRE RHHPVSLEAR HRKAVTDRRL DYAPDRDGMA WLSAYVSADV AAGVWARATD AARALQGPAE SRTLTQLRAD VAADWLIAGI AEGVPSPKAQ VLVTVPVLSL LGAGTEPATL DGYGPVPPSM ARRLLGEGAG SFLRVLTDPR SGAPLEIGRS SYPVPKAMRQ WLRLRDGRCP FPGCNNHSLD NEADHLLAWS AGGGTDITNL GQPCPKHHRL KHTTAWTPVD ATRDQPPRWI SPAGRSYPSE QQDWEPPHWP DLPAGAEPAG ENPAVEDDDP GWVPEWEPPH WPDLPAGAEG STGATGPGET GPPLPADPFP DWALFIAA
|
| |