Gene Arth_0239 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_0239 
Symbol 
ID4447295 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp252325 
End bp253851 
Gene Length1527 bp 
Protein Length508 aa 
Translation table11 
GC content72% 
IMG OID639688035 
ProductHNH nuclease 
Protein accessionYP_829740 
Protein GI116668807 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGGGCTT CCCTTGCTGC GTTGGCTGCG TGTGCCGGCT GCGGGGCCGG CCGCGGGGCC 
GGGAACGGCG CCGGGAACCT GGCCTCGTCT GGTGCTGATC CTTTGCGGGA CGAGGCGGAC
GCGTGCCTGG ACGGCCTGGC CGGAGTGGGT GGCCTGGAGG CCATGCTGGC CGCGCTGAAG
GTGCACTTCG CCGCCGGGTA CGTCCGCGCT GCCGCGGCCT TGACGGCCCC GGCCGTTTCC
CCGCAGGAGC GCACCGCCCG AGCCATGGCG GTCACGGCGG AGGTGGCCTG TGTCCTGACG
GTGAGTGAAA GATCGGCAGC GGCGTTACTG GCGGACTCAG TCATCCTGAC CACCCGGTTG
CCGCTGACGC TGTCCGCGCT GCGGTCGGGG GGCATTTCGT GGCAGCATGC GCGGGTGATG
TGTGATGAGA CGTCCGGTCT GGACCCGGTC GCGGCTGCTG CCCTGGAGGG GCATTTCCTG
GACCCCGAAG CTCCCTTTGC TGCGCGGGGC TGCCCGGCCG GGGAGCTGGT GCCGGGCAGG
TTCCGGGCCA AGGCGCGCAG CTGGCGTGAG CGGCACCATC CGGTCAGCCT CGAAGCACGC
CACCGCAAGG CCGTGACGGA CCGCCGGCTG GACTATGCCC CGGACCGGGA CGGCATGGCC
TGGCTCTCGG CCTATGTGAG TGCGGATGTG GCAGCGGGGG TGTGGGCCCG TGCCACGGAC
GCGGCGCGTG CCCTGCAGGG CCCGGCCGAA TCCCGGACCC TGACCCAGCT CCGCGCGGAC
GTCGCCGCTG ACTGGCTCAT CGCAGGCATT GCCGAAGGTG TTCCGTCGCC GAAGGCGCAG
GTGTTGGTGA CCGTCCCGGT GCTGTCGCTG CTGGGCGCGG GGACGGAACC GGCCACCCTG
GACGGGTATG GTCCTGTTCC GCCGTCCATG GCCCGCCGGC TTCTCGGCGA GGGCGCCGGA
TCGTTCCTGC GTGTGCTGAC CGACCCGCGC AGCGGGGCGC CGCTGGAGAT CGGACGCAGC
AGCTACCCGG TTCCGAAAGC GATGCGCCAA TGGCTGCGGC TTCGGGACGG CCGGTGCCCG
TTTCCCGGCT GCAACAACCA CTCCCTGGAT AACGAAGCGG ACCACCTGCT GGCCTGGTCC
GCAGGTGGCG GCACCGACAT CACCAACCTG GGCCAGCCAT GCCCGAAGCA CCACCGGCTC
AAACACACCA CAGCATGGAC GCCGGTCGAC GCCACCCGGG ACCAGCCACC GCGCTGGATC
TCACCCGCAG GACGCTCCTA CCCCAGCGAA CAACAGGACT GGGAACCACC ACACTGGCCC
GACCTACCCG CCGGCGCGGA GCCCGCGGGC GAAAACCCCG CGGTGGAGGA CGACGACCCG
GGATGGGTAC CGGAATGGGA ACCACCACAC TGGCCGGACC TACCCGCCGG CGCGGAGGGA
AGCACCGGCG CCACCGGTCC CGGAGAGACC GGGCCACCAT TACCCGCCGA CCCCTTCCCC
GACTGGGCCC TATTCATCGC GGCCTAG
 
Protein sequence
MGASLAALAA CAGCGAGRGA GNGAGNLASS GADPLRDEAD ACLDGLAGVG GLEAMLAALK 
VHFAAGYVRA AAALTAPAVS PQERTARAMA VTAEVACVLT VSERSAAALL ADSVILTTRL
PLTLSALRSG GISWQHARVM CDETSGLDPV AAAALEGHFL DPEAPFAARG CPAGELVPGR
FRAKARSWRE RHHPVSLEAR HRKAVTDRRL DYAPDRDGMA WLSAYVSADV AAGVWARATD
AARALQGPAE SRTLTQLRAD VAADWLIAGI AEGVPSPKAQ VLVTVPVLSL LGAGTEPATL
DGYGPVPPSM ARRLLGEGAG SFLRVLTDPR SGAPLEIGRS SYPVPKAMRQ WLRLRDGRCP
FPGCNNHSLD NEADHLLAWS AGGGTDITNL GQPCPKHHRL KHTTAWTPVD ATRDQPPRWI
SPAGRSYPSE QQDWEPPHWP DLPAGAEPAG ENPAVEDDDP GWVPEWEPPH WPDLPAGAEG
STGATGPGET GPPLPADPFP DWALFIAA