Gene Arth_0370 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_0370 
Symbol 
ID4447164 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp391445 
End bp393115 
Gene Length1671 bp 
Protein Length556 aa 
Translation table11 
GC content68% 
IMG OID639688166 
ProductHNH nuclease 
Protein accessionYP_829871 
Protein GI116668938 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAAAGCA GTGCAGCTGT GGAGACGATG GAGGCCGTTG AGGCATCCAT CGCTGTGCTG 
GCCGCTTGTG TCGGCCGCGG GGCGGGCAAC CCGGGCTCCA CCGGTGACGA CCCCCTTCGG
GAGCAGGCGG ATGCGTGCCT GGACGGGCTG GCTGCGGCGG CGCGGGTGGA AGCCGGGATG
GCAGCCCTGA AAGTGCACCT CGCTGCTGGA TACGCCCGTG CTGCCAGAGC CATGGCGCCG
CCCGCCCTGT CCCCGCAGGA ACACACCGTG CAGGACATGG CCGTGACCGC TGAGGTGGCG
TGTGTCCTGA CCGTCAGCGA ACGCACCGCC GGTGCCCTCC TGGCGGAAGC CCATGCACTA
ACCACCACGC TGCCGCTGAC GTTGTCCGCG CTGCAGTCCG GCAGCATTTC CTGGCAGCAT
GCCCGGGTGA TGGTGGACGA AACCGACAAC CTCGACGAAG CGGGGGCTGC CGGACTAGAG
GCACACTTCC TGGACCCGGA TGCGCCTTCC GCTGCGCGCG GCTGTCCTGT CGGCGGCCTA
GTCCCGGCCC GGTTCCGGGC GAAGGCCCGT ACGTGGCGGG AACGGCACCA CCCGGTCAGC
ATCGAAAAAC GCCATAGTCG CAGCGCCGCC GACCGCCGGG TGCAATACAC GCCGGATCGC
GACGGCATGG CCTGGCTCTC CGCGTACCTG CCCGCCAGCC AGGCAGCGGG CGTCTGGGAC
CGCATCACCA CAGCTGCAAG ATCCCTCCAG GGAGCTGACG AGGCCAGGAC CCTTACCCAG
CTCCGCGCCG ACGTCGCCGC GACCTGGCTC CTCACTTCAG GCCTCGAAAC TGCAGCCAAC
TCAGCGGCTG GAGAGTTGCC TTCCCCTGCA GCGCAGGTTC TCATTACCGT TCCAGCGCTT
TCGCTGCTGG GACTGACCGA GGAACCTGCC ATGCTGGATG GGTACGGGCC GGTCCCGCCG
TCCATGGGGC GAGCCCTGGT CGCCGAAGGG GCTTCGTCGT TCCTGAGGGT GCTGACTGAC
CCCCGTGATG GTGCGCCGTT GGAGATCGGC CGGACGAGCT ACCGCATCCC CAAAGCGCTG
CGGCAGTGGC TGCGGCTGCG GGACGGGAAA TGTCCGTTCC CCGGCTGTAA CAACCAGTCA
CTGGACAATG ATGCAGACCA CCTCCTCGCC TGGGATGAAG GCGGGCCCAC GGGGATCAGC
AACCTCGGCC AACCGTGCCG CAAACACCAC CGACTCAAAC ACGGCACTGC GTGGACACCC
GCCGGCGCCG GAACCCATGA ACCGCCCGGC TGGACTTCAC CCATGGGCCG CCACTACGCC
AGCGAACAAC CAGACTGGGA ACCACCCCTC TGGCTAGCCG AGATCCTGGC CATGGCCACC
AGTCGTGATC CAGGCGACCC CGACCCGGGC AGACCCGACC CGGGCACATT GGAGCCGGGC
GCGCTGTGTT CAGGCATGCC CGACTACTTG GACGAACCGC CACCCGAAGT CGATCTGACG
GACCATCTGC TAGCCGAGGA CCCTTTCAAG GACTGGGAGC TCTTCCTGGC CTACGACTCC
TGCGCGGCTG CACAGGAGAG TGCCGCACCG GGGAGTGCTG CATGGGAGAG TGCGGCCGGT
CTCCTTTGTC TTGAAGAGCG TTGGGCGATG GACTCATATG CGGCGCCCTA G
 
Protein sequence
MESSAAVETM EAVEASIAVL AACVGRGAGN PGSTGDDPLR EQADACLDGL AAAARVEAGM 
AALKVHLAAG YARAARAMAP PALSPQEHTV QDMAVTAEVA CVLTVSERTA GALLAEAHAL
TTTLPLTLSA LQSGSISWQH ARVMVDETDN LDEAGAAGLE AHFLDPDAPS AARGCPVGGL
VPARFRAKAR TWRERHHPVS IEKRHSRSAA DRRVQYTPDR DGMAWLSAYL PASQAAGVWD
RITTAARSLQ GADEARTLTQ LRADVAATWL LTSGLETAAN SAAGELPSPA AQVLITVPAL
SLLGLTEEPA MLDGYGPVPP SMGRALVAEG ASSFLRVLTD PRDGAPLEIG RTSYRIPKAL
RQWLRLRDGK CPFPGCNNQS LDNDADHLLA WDEGGPTGIS NLGQPCRKHH RLKHGTAWTP
AGAGTHEPPG WTSPMGRHYA SEQPDWEPPL WLAEILAMAT SRDPGDPDPG RPDPGTLEPG
ALCSGMPDYL DEPPPEVDLT DHLLAEDPFK DWELFLAYDS CAAAQESAAP GSAAWESAAG
LLCLEERWAM DSYAAP