Gene Arth_1274 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_1274 
Symbol 
ID4446251 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp1411304 
End bp1412743 
Gene Length1440 bp 
Protein Length479 aa 
Translation table11 
GC content68% 
IMG OID639689082 
ProductHNH endonuclease 
Protein accessionYP_830768 
Protein GI116669835 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGGACGGGA ATTCGGGGTC CGAAGCGGTA CCGGCAGCAG TGGAGTGCTG TTGTGCTTGC 
GGCCGGTGGC GTGAGCCTGC AGGGGATGCG TTTGAGGGGG ACGGTGCCCG GCTGATTGAT
GAGATCCGGG CCTTGGAAAA CCACAAGTCT GCCCTCGCTG CCCGCCAGGC CCGGCTGGCT
GTCGCATTTG ACCAGCAGCA GCGCCGCCAA CGGTCTGCTG CCGGAGCACC CGCAGACCAA
TGCGGGGCAG GTGTCGGAGC GCAGATCGCG CTCGCCCGTC GCGAATCCCC GGCTAACGGG
AGCCGGCTGT TGGGCCTTGC CAAAGCTATG GTGACCGAGA TGCCCCGGAC CCTCGCAGCC
CTGGAGACCG GGCAGTTGAA CGAGTGGCGG GCCACGTTGT TGGTGCGCGA AACCGCCTGC
CTGTCGGCGG AGGACCGCTG CGCCGTGGAC GAGGAACTCG CCCCCGACAC CGGCGCCTTT
GACGGGGCAG GGGACCGCCG AATCATCGCC GCGGTCCGAG CCGCGGCTTA CCGCCGTGAC
CCCCGTTGTG TCACCCGGAG GGCCAGTCAC GCCGCAGCCG AACGCCACGT CAGCCTTCGT
CCCGCTCCTG ACACCATGTG CTACCTGACC GCCCTGCTGC CCGTCGCGGC GGGCGTCGCC
GTTCACGCTG CCCTTGTCCT GAACGCGGAG TCTGTCCGCA GCAGCGGAGA CCCTCGCTCT
CGCGGACAAA TCATGGCCGA TGACCTGGTC GAACGCGTCA CCGGGACGCC GGGCGGCTTT
ACAGGCATCG AAATCCAGCT CGTCATGACT GATCGGGCCC TCTTTCAAGG TGACAGCGAA
CCGGCACGCC TCCCCGGCTA CGGTGTGGTC CCAAGCGGCT GGGCCAGAAA CATCATTGAC
CGTGGCGGAG CTGCGCCTGC TATCCGGGAT CAAGCCTTCA ACACTTGGAT CCGTCGTCTG
TACACGGCCC CTGCCACGGG CGAGCTGGTG GCAATGGATT CCCGCGCCCG GCTTTTCCCC
GCCGGACTCC GCCGCTTCAT CGAGGCACGC GACGACACCT GCCGCACGCC CTTCTGCGAC
GCCCCCATCC GCCACCTGGA CCACGTCGTC CCCTGGCACG GCGGCGGAGC CACAACACTG
GACAACGGCG CCGGGCTCTG CGAGGCCTGC AACCACACTA AGGAAGCGCC GGGCTGGAAG
GCTCACCCGT TGAACGCGCC AAACGCCAAG GGTGGGGCGA GGCATGCCAT CCGGTTGACA
ACGCCCACCG GCCACAGCTA CCGATCCACT GCACCGCCGT TGCCGGGAAT CCAACGCGGC
AGCCCGGCCG CCGACTCCGG AGAGCCGGAA GGGGTTCGCC AACGGAAAGA GCTTCGACAT
CGCGCCAAGA TACACAGGCG AACCGTTCGG TCGCTGCGCG GTGCTCCATG CGCGGCGTAA
 
Protein sequence
MDGNSGSEAV PAAVECCCAC GRWREPAGDA FEGDGARLID EIRALENHKS ALAARQARLA 
VAFDQQQRRQ RSAAGAPADQ CGAGVGAQIA LARRESPANG SRLLGLAKAM VTEMPRTLAA
LETGQLNEWR ATLLVRETAC LSAEDRCAVD EELAPDTGAF DGAGDRRIIA AVRAAAYRRD
PRCVTRRASH AAAERHVSLR PAPDTMCYLT ALLPVAAGVA VHAALVLNAE SVRSSGDPRS
RGQIMADDLV ERVTGTPGGF TGIEIQLVMT DRALFQGDSE PARLPGYGVV PSGWARNIID
RGGAAPAIRD QAFNTWIRRL YTAPATGELV AMDSRARLFP AGLRRFIEAR DDTCRTPFCD
APIRHLDHVV PWHGGGATTL DNGAGLCEAC NHTKEAPGWK AHPLNAPNAK GGARHAIRLT
TPTGHSYRST APPLPGIQRG SPAADSGEPE GVRQRKELRH RAKIHRRTVR SLRGAPCAA