Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Arth_3968 |
Symbol | |
ID | 4447628 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Arthrobacter sp. FB24 |
Kingdom | Bacteria |
Replicon accession | NC_008541 |
Strand | + |
Start bp | 4482593 |
End bp | 4483966 |
Gene Length | 1374 bp |
Protein Length | 457 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 639691799 |
Product | HNH nuclease |
Protein accession | YP_833443 |
Protein GI | 116672510 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.737066 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGACGGCA AGCAGGAGCC GGAAAGGGCT TCGGGAGCAG CGGACCCGGT GGCGATCACC GAGATGATCG CCTCCCTGAC TGTCGCACGG CCTGTGGTGG ACGGTGCGGG GATGATCGAC CAGCTTCGTA AACTCGAGGA GCTAAAATCC GCCGCAGCCG CGCTGCAGGC CAGGGTCGCG GTTGCCTTTG ACATATGCCA GCGCCGTGAA CAGTCCGACG TCGGAGTTCC GGCGGATCAG CTTGGCACCG GCGTCGCTGC GCAGATCGCC CTCGCCCGGC GGGAATCGCC AGCCAGGGGC AGCCGGCTTC TGGGCTTGGC GAAGGCGCTC GTCACCGAGA TGCCGCATAC CCTCGCCGGG CTGGAGGCAG GCCGGCTCAA CGAATGGCGC GCAACACTTC TGGTGCGTGA GACTGCCTGC CTGTCCGCGG CTGACCGCAG CGCCGTCGAC GAGGAGTTGG CGTCCGACGT CGGGTCGTTC AATGGGGCGG GCGACCGAGC CGTCGTCGCA GCGGCGCGGT CAGCCGCCTA TCGGCGTGAT CCGAGGTCCG TTGCGGACCG CGCCAGCCAC GCCGCCGCGG AGCGGCATGT CAGTCTTCGT CCCGCCCCGG ACACCATGTG CTACGTGACT GCATTCCTCC CGGTGGCCGA GGGCGTGGCC GTCCACGCGG CCCTCACCAG GCATGCGGAC ACCCTGCGGT CCGACGGCGA CCCGCGTTGC CGCGGCCAGC TCATGGCAGA TGCGCTGGTT GAACGCGTGA CGGGAACTGC GGGTGGAATC TGCGGGGTCG AAATCCAGCT CGTCATGACC GACCGCGCCC TGTTTCAGGG AGACAGTGAA CCGGCCCGGC TTGCGGGTTA CGGCATAGTC CCTGCGGCGT GGGGGCGGAA GATCGTTGCC CGGGACCAAG GCCAACCCAT GAACGTCTGG CTCCGACGGC TGTACACCGC ACCGGGCAGC GGTGATCTCG TGGCGATGGA GTCGAGAGCA CGCCTTTTCC CGTCCGGGCT GCGCCGCTTC ATCCAAGTCC GGGACCACAC CTGCCGCACG CCATTTTGCG ACGCTCCGAT CAGGCATCTG GATCACATCC TGCCGTGGCA CAGCGACGGA ACCACCACTC AAGGCAACGG GGCAGGGCTC TGCGAAGCCT GCAACCACAT CAAGGAGGCC CCCGGCTGGC GCTCCCGCCC CCTGCCGGGA CCACGGCACA CGTTCCAGCT GACAACGCCC GCCGGGCACG GCTACCAGTC CACGGCGCCG CCGCTGCCGG GGCATCCGCC TCCCGAAATA GCAGCCTCAC GCCGCCGCCG TGAACTCCGG CACCAGGTCA AGGCGCTCAA GCGCGTCAGG TTAAGGGCTG CGGCTGCTGC GTGA
|
Protein sequence | MDGKQEPERA SGAADPVAIT EMIASLTVAR PVVDGAGMID QLRKLEELKS AAAALQARVA VAFDICQRRE QSDVGVPADQ LGTGVAAQIA LARRESPARG SRLLGLAKAL VTEMPHTLAG LEAGRLNEWR ATLLVRETAC LSAADRSAVD EELASDVGSF NGAGDRAVVA AARSAAYRRD PRSVADRASH AAAERHVSLR PAPDTMCYVT AFLPVAEGVA VHAALTRHAD TLRSDGDPRC RGQLMADALV ERVTGTAGGI CGVEIQLVMT DRALFQGDSE PARLAGYGIV PAAWGRKIVA RDQGQPMNVW LRRLYTAPGS GDLVAMESRA RLFPSGLRRF IQVRDHTCRT PFCDAPIRHL DHILPWHSDG TTTQGNGAGL CEACNHIKEA PGWRSRPLPG PRHTFQLTTP AGHGYQSTAP PLPGHPPPEI AASRRRRELR HQVKALKRVR LRAAAAA
|
| |