Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Arth_0370 |
Symbol | |
ID | 4447164 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Arthrobacter sp. FB24 |
Kingdom | Bacteria |
Replicon accession | NC_008541 |
Strand | + |
Start bp | 391445 |
End bp | 393115 |
Gene Length | 1671 bp |
Protein Length | 556 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 639688166 |
Product | HNH nuclease |
Protein accession | YP_829871 |
Protein GI | 116668938 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAAAGCA GTGCAGCTGT GGAGACGATG GAGGCCGTTG AGGCATCCAT CGCTGTGCTG GCCGCTTGTG TCGGCCGCGG GGCGGGCAAC CCGGGCTCCA CCGGTGACGA CCCCCTTCGG GAGCAGGCGG ATGCGTGCCT GGACGGGCTG GCTGCGGCGG CGCGGGTGGA AGCCGGGATG GCAGCCCTGA AAGTGCACCT CGCTGCTGGA TACGCCCGTG CTGCCAGAGC CATGGCGCCG CCCGCCCTGT CCCCGCAGGA ACACACCGTG CAGGACATGG CCGTGACCGC TGAGGTGGCG TGTGTCCTGA CCGTCAGCGA ACGCACCGCC GGTGCCCTCC TGGCGGAAGC CCATGCACTA ACCACCACGC TGCCGCTGAC GTTGTCCGCG CTGCAGTCCG GCAGCATTTC CTGGCAGCAT GCCCGGGTGA TGGTGGACGA AACCGACAAC CTCGACGAAG CGGGGGCTGC CGGACTAGAG GCACACTTCC TGGACCCGGA TGCGCCTTCC GCTGCGCGCG GCTGTCCTGT CGGCGGCCTA GTCCCGGCCC GGTTCCGGGC GAAGGCCCGT ACGTGGCGGG AACGGCACCA CCCGGTCAGC ATCGAAAAAC GCCATAGTCG CAGCGCCGCC GACCGCCGGG TGCAATACAC GCCGGATCGC GACGGCATGG CCTGGCTCTC CGCGTACCTG CCCGCCAGCC AGGCAGCGGG CGTCTGGGAC CGCATCACCA CAGCTGCAAG ATCCCTCCAG GGAGCTGACG AGGCCAGGAC CCTTACCCAG CTCCGCGCCG ACGTCGCCGC GACCTGGCTC CTCACTTCAG GCCTCGAAAC TGCAGCCAAC TCAGCGGCTG GAGAGTTGCC TTCCCCTGCA GCGCAGGTTC TCATTACCGT TCCAGCGCTT TCGCTGCTGG GACTGACCGA GGAACCTGCC ATGCTGGATG GGTACGGGCC GGTCCCGCCG TCCATGGGGC GAGCCCTGGT CGCCGAAGGG GCTTCGTCGT TCCTGAGGGT GCTGACTGAC CCCCGTGATG GTGCGCCGTT GGAGATCGGC CGGACGAGCT ACCGCATCCC CAAAGCGCTG CGGCAGTGGC TGCGGCTGCG GGACGGGAAA TGTCCGTTCC CCGGCTGTAA CAACCAGTCA CTGGACAATG ATGCAGACCA CCTCCTCGCC TGGGATGAAG GCGGGCCCAC GGGGATCAGC AACCTCGGCC AACCGTGCCG CAAACACCAC CGACTCAAAC ACGGCACTGC GTGGACACCC GCCGGCGCCG GAACCCATGA ACCGCCCGGC TGGACTTCAC CCATGGGCCG CCACTACGCC AGCGAACAAC CAGACTGGGA ACCACCCCTC TGGCTAGCCG AGATCCTGGC CATGGCCACC AGTCGTGATC CAGGCGACCC CGACCCGGGC AGACCCGACC CGGGCACATT GGAGCCGGGC GCGCTGTGTT CAGGCATGCC CGACTACTTG GACGAACCGC CACCCGAAGT CGATCTGACG GACCATCTGC TAGCCGAGGA CCCTTTCAAG GACTGGGAGC TCTTCCTGGC CTACGACTCC TGCGCGGCTG CACAGGAGAG TGCCGCACCG GGGAGTGCTG CATGGGAGAG TGCGGCCGGT CTCCTTTGTC TTGAAGAGCG TTGGGCGATG GACTCATATG CGGCGCCCTA G
|
Protein sequence | MESSAAVETM EAVEASIAVL AACVGRGAGN PGSTGDDPLR EQADACLDGL AAAARVEAGM AALKVHLAAG YARAARAMAP PALSPQEHTV QDMAVTAEVA CVLTVSERTA GALLAEAHAL TTTLPLTLSA LQSGSISWQH ARVMVDETDN LDEAGAAGLE AHFLDPDAPS AARGCPVGGL VPARFRAKAR TWRERHHPVS IEKRHSRSAA DRRVQYTPDR DGMAWLSAYL PASQAAGVWD RITTAARSLQ GADEARTLTQ LRADVAATWL LTSGLETAAN SAAGELPSPA AQVLITVPAL SLLGLTEEPA MLDGYGPVPP SMGRALVAEG ASSFLRVLTD PRDGAPLEIG RTSYRIPKAL RQWLRLRDGK CPFPGCNNQS LDNDADHLLA WDEGGPTGIS NLGQPCRKHH RLKHGTAWTP AGAGTHEPPG WTSPMGRHYA SEQPDWEPPL WLAEILAMAT SRDPGDPDPG RPDPGTLEPG ALCSGMPDYL DEPPPEVDLT DHLLAEDPFK DWELFLAYDS CAAAQESAAP GSAAWESAAG LLCLEERWAM DSYAAP
|
| |