Gene Arth_3225 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_3225 
Symbol 
ID4444006 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp3631739 
End bp3633424 
Gene Length1686 bp 
Protein Length561 aa 
Translation table11 
GC content69% 
IMG OID639691049 
ProductHNH endonuclease 
Protein accessionYP_832701 
Protein GI116671768 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.978165 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAAGCCG TTGGAGAATT CGCCCCGAAC CAGGCCTTGC GCCTAGGGCG GATTTCCGAC 
GCGCCTTTCG AACCGGCCGC GCCGAAGCGG ACGCTGAGGG CCGTGCCGCC GGCTGAGCTC
ACCGTCGGCC GGCCAAAGCC GTTAGCGTTC GACGGCGGCG GGTCCCGTGG TCTGGCCAGG
TGCCTTGCGC TGCTGGAGGC CATCAGTTCG ACGGCGGTTG CCGACGCTGC GGGGATGCGC
TTCCCGGAGG CTGCTGACTT CGCTGCCAGG GTGGAGGACA TCTCCCGCGC CGCTGAATAT
TTGCAGGTGG TTGCTGCCGG GGCTGTGGAC CGGTCCCGGC GGGAGGCTAT TTCTGCTGCT
GTGCGGGCTG GTGCGGGCCG TGGCGCGGCG GTTGGCTGGA CCACCGGGTG GGGCAACGAG
ACGGCTGTTC ACGGGCCGAT CGGCTGGGCG GCCGGAACGG AGGCCCAGCC CGCGGACGCC
GGAAGTGACG CGGCTGAAGG GATACCTGCG GCAACGCCGA AGACGAACGA TCCGGATCCG
GCCGATGACG GGTGCCGGAA CACTACCGGG TTCCTCCGTA TGAGGTTGCG GATCGGTGCC
GGCGAAGCCC GCCGCCGGCT CGCCCTCGCC GAAGCCGTGT TGCCCCGGGC CGGGATCACC
GGCCACCCCC AAGAACCGGA ACGGCCCGAA CTGGCAGCGG CCGTCGCATC CGGGTCCGTG
GGCTCCCGGT CCGCGACCAT CATCACCCTC GCCCTGGACC GGGTCCGCCA CCACGCCACC
GAAGACACCA CGGCCCGGAT GGAACACGCC CTGACCCGCA CCGCGGCAGA GCACGACACC
GATTTCGTGA CCCGCATCGC CCGGCAGTGG ACAGAGGCGA TCGACCAGGA CGGCAGCGAA
CCCTCCGAGG AGGAACTCCG CCACCGGCAG GGCGTGTTCA TCCGGAAACC CCGCCGCGGA
CTACAGCACG TGGAATTCTT CGCCACCCCG GACCAGTACG AACCCCTCCT GACCGTCATG
AACACCGCCA CCAACCCCCG CACCCAACCA GAAGCGGCTG AAGGCAACGA CGGAACCATT
AGCGGCGAGG GCGGCCTGGA CCGGCGGACC CGGCCCCAGC AACTCCTGGA CGGCCTCGTC
AGCGCCGCCA AAACCGCCCT CGCCACCGGA TCACTTCCGG CTGCAGGCGG TTTGCGTCCC
CAGGTCATGG TCACCATCGA CTACCGCGAC CTCCTCGACA CACTCGAACA GGGCACCCCA
GGCAGGGGCA CCGGCTCATT CACGTTCACC GGGCCCGTCA CCGCCGCCAC GGTCCGGAAG
ATCGCCTGCG ACGCCGACAT CATCCCCGTC CTGCTCGGCG GCCAGGGCCG CGTCCTGGAC
ATCGGCCGCA CCACGAGGAT CTTCCCGCCC CACATCCGCA AAGCCCTCAC CGCCCGCGAC
CAGGGCTGCG CCTTCCCGGG CTGCACCATC CCCGCCCCGT GGTGCGAAGC CCACCACACC
ACCTACTGGT CACACGGCGG AACCACCAGC ACCGACAACG GCACACTGCT CTGCTCCCAT
CACCACCACC TGATCCACAA AGAGCAGTGG CACATCCAGG TCAAAACCGG GATCCCCTGG
TTCATCCCGC CACCCCACAT CGACCCACAC CAACAACCAC GAAGGAACAG CTACTTCAGA
TGCTGA
 
Protein sequence
MEAVGEFAPN QALRLGRISD APFEPAAPKR TLRAVPPAEL TVGRPKPLAF DGGGSRGLAR 
CLALLEAISS TAVADAAGMR FPEAADFAAR VEDISRAAEY LQVVAAGAVD RSRREAISAA
VRAGAGRGAA VGWTTGWGNE TAVHGPIGWA AGTEAQPADA GSDAAEGIPA ATPKTNDPDP
ADDGCRNTTG FLRMRLRIGA GEARRRLALA EAVLPRAGIT GHPQEPERPE LAAAVASGSV
GSRSATIITL ALDRVRHHAT EDTTARMEHA LTRTAAEHDT DFVTRIARQW TEAIDQDGSE
PSEEELRHRQ GVFIRKPRRG LQHVEFFATP DQYEPLLTVM NTATNPRTQP EAAEGNDGTI
SGEGGLDRRT RPQQLLDGLV SAAKTALATG SLPAAGGLRP QVMVTIDYRD LLDTLEQGTP
GRGTGSFTFT GPVTAATVRK IACDADIIPV LLGGQGRVLD IGRTTRIFPP HIRKALTARD
QGCAFPGCTI PAPWCEAHHT TYWSHGGTTS TDNGTLLCSH HHHLIHKEQW HIQVKTGIPW
FIPPPHIDPH QQPRRNSYFR C