Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Arth_3225 |
Symbol | |
ID | 4444006 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Arthrobacter sp. FB24 |
Kingdom | Bacteria |
Replicon accession | NC_008541 |
Strand | + |
Start bp | 3631739 |
End bp | 3633424 |
Gene Length | 1686 bp |
Protein Length | 561 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 639691049 |
Product | HNH endonuclease |
Protein accession | YP_832701 |
Protein GI | 116671768 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.978165 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAAGCCG TTGGAGAATT CGCCCCGAAC CAGGCCTTGC GCCTAGGGCG GATTTCCGAC GCGCCTTTCG AACCGGCCGC GCCGAAGCGG ACGCTGAGGG CCGTGCCGCC GGCTGAGCTC ACCGTCGGCC GGCCAAAGCC GTTAGCGTTC GACGGCGGCG GGTCCCGTGG TCTGGCCAGG TGCCTTGCGC TGCTGGAGGC CATCAGTTCG ACGGCGGTTG CCGACGCTGC GGGGATGCGC TTCCCGGAGG CTGCTGACTT CGCTGCCAGG GTGGAGGACA TCTCCCGCGC CGCTGAATAT TTGCAGGTGG TTGCTGCCGG GGCTGTGGAC CGGTCCCGGC GGGAGGCTAT TTCTGCTGCT GTGCGGGCTG GTGCGGGCCG TGGCGCGGCG GTTGGCTGGA CCACCGGGTG GGGCAACGAG ACGGCTGTTC ACGGGCCGAT CGGCTGGGCG GCCGGAACGG AGGCCCAGCC CGCGGACGCC GGAAGTGACG CGGCTGAAGG GATACCTGCG GCAACGCCGA AGACGAACGA TCCGGATCCG GCCGATGACG GGTGCCGGAA CACTACCGGG TTCCTCCGTA TGAGGTTGCG GATCGGTGCC GGCGAAGCCC GCCGCCGGCT CGCCCTCGCC GAAGCCGTGT TGCCCCGGGC CGGGATCACC GGCCACCCCC AAGAACCGGA ACGGCCCGAA CTGGCAGCGG CCGTCGCATC CGGGTCCGTG GGCTCCCGGT CCGCGACCAT CATCACCCTC GCCCTGGACC GGGTCCGCCA CCACGCCACC GAAGACACCA CGGCCCGGAT GGAACACGCC CTGACCCGCA CCGCGGCAGA GCACGACACC GATTTCGTGA CCCGCATCGC CCGGCAGTGG ACAGAGGCGA TCGACCAGGA CGGCAGCGAA CCCTCCGAGG AGGAACTCCG CCACCGGCAG GGCGTGTTCA TCCGGAAACC CCGCCGCGGA CTACAGCACG TGGAATTCTT CGCCACCCCG GACCAGTACG AACCCCTCCT GACCGTCATG AACACCGCCA CCAACCCCCG CACCCAACCA GAAGCGGCTG AAGGCAACGA CGGAACCATT AGCGGCGAGG GCGGCCTGGA CCGGCGGACC CGGCCCCAGC AACTCCTGGA CGGCCTCGTC AGCGCCGCCA AAACCGCCCT CGCCACCGGA TCACTTCCGG CTGCAGGCGG TTTGCGTCCC CAGGTCATGG TCACCATCGA CTACCGCGAC CTCCTCGACA CACTCGAACA GGGCACCCCA GGCAGGGGCA CCGGCTCATT CACGTTCACC GGGCCCGTCA CCGCCGCCAC GGTCCGGAAG ATCGCCTGCG ACGCCGACAT CATCCCCGTC CTGCTCGGCG GCCAGGGCCG CGTCCTGGAC ATCGGCCGCA CCACGAGGAT CTTCCCGCCC CACATCCGCA AAGCCCTCAC CGCCCGCGAC CAGGGCTGCG CCTTCCCGGG CTGCACCATC CCCGCCCCGT GGTGCGAAGC CCACCACACC ACCTACTGGT CACACGGCGG AACCACCAGC ACCGACAACG GCACACTGCT CTGCTCCCAT CACCACCACC TGATCCACAA AGAGCAGTGG CACATCCAGG TCAAAACCGG GATCCCCTGG TTCATCCCGC CACCCCACAT CGACCCACAC CAACAACCAC GAAGGAACAG CTACTTCAGA TGCTGA
|
Protein sequence | MEAVGEFAPN QALRLGRISD APFEPAAPKR TLRAVPPAEL TVGRPKPLAF DGGGSRGLAR CLALLEAISS TAVADAAGMR FPEAADFAAR VEDISRAAEY LQVVAAGAVD RSRREAISAA VRAGAGRGAA VGWTTGWGNE TAVHGPIGWA AGTEAQPADA GSDAAEGIPA ATPKTNDPDP ADDGCRNTTG FLRMRLRIGA GEARRRLALA EAVLPRAGIT GHPQEPERPE LAAAVASGSV GSRSATIITL ALDRVRHHAT EDTTARMEHA LTRTAAEHDT DFVTRIARQW TEAIDQDGSE PSEEELRHRQ GVFIRKPRRG LQHVEFFATP DQYEPLLTVM NTATNPRTQP EAAEGNDGTI SGEGGLDRRT RPQQLLDGLV SAAKTALATG SLPAAGGLRP QVMVTIDYRD LLDTLEQGTP GRGTGSFTFT GPVTAATVRK IACDADIIPV LLGGQGRVLD IGRTTRIFPP HIRKALTARD QGCAFPGCTI PAPWCEAHHT TYWSHGGTTS TDNGTLLCSH HHHLIHKEQW HIQVKTGIPW FIPPPHIDPH QQPRRNSYFR C
|
| |