Gene Arth_1643 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_1643 
Symbol 
ID4445835 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp1834409 
End bp1835770 
Gene Length1362 bp 
Protein Length453 aa 
Translation table11 
GC content70% 
IMG OID639689458 
Productdeoxyribonuclease 
Protein accessionYP_831137 
Protein GI116670204 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG2265] SAM-dependent methyltransferases related to tRNA (uracil-5-)-methyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0120532 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACCCCG AGACCCAGAC CCGCACGCAT ACCGAACTTG TGGTGGATAT CGGGCCCATC 
GCCCACGGCG GGCACTGCGT TGCCCGCCAC GAGGGCCGTG TGGTGTTTGT CCGCCACGCC
ATCCCGGGCG AGAAGGTCCG GATCCGTCTG ACGGATGCCG GCGAGGACTC CAAATTTTGG
CGAGCGGATG TTGTGGAAGT GCTCGAAGCA TCGCCCGACC GTATGCCCCA TTTCTGGCAT
GTGGCGGATT CGCTTCGGGC ATGGTCGCAC GGGCACCCTC CGGTGGGCGG AGCCGAGCTG
GGCCACGTGT CCCTTGGACG TCAGCGCAGC CTCAAGGCTG ACGTCCTGGC CGAACAGCTG
AAGCGGCTCG CCGGCGTCGA ACGCGTCACG GAGGTGGAAG CTGTCGGCGC GGCCGCCGCC
GCAGGCGACC ACAGCCCCGG TGCGCCCGGG CTGGGCTGGC GCACGCGGGC CAGCTTTGCC
GTGACACCCG CCGGGAAGCT GGGCATGCAC GCGCACAGGT CCGACCAGGT CATTGCCATC
CGCGAGATGC CATTGACCGT GTCCGCCATC AACGACCTCA GGCTTTGGGA CATCGACCTC
GCGGGCGTCG AACGCGTTGA AGTGGCCGCG CCAGCCAACG GCTCGCGCCC GCTGGTCCTG
CTGGCACCGG CCGAAGGAAC CCGCGCGAAG CGCCTTAGCG GGATCCTCGC GCAGCTTCCC
GACGAGGTCT CGGTGGCGAG CTTCGATCCG GCCAAGGGCG AGTCGCTGCA GCTGCGCGGC
CGCACCTGGG TGCAGGAGTC GGCCGCCGGG CACGAGTTCC GGGTCACGGG GGAGGGCTTC
TGGCAGATCC ACCGGGATGC TCCGGAAACA CTAGTCGGGG CGCTTAAGGG ATTCCTGCAC
GACGGCGGGT ACCTGGAGCC GGGCGCGGTG GTTGCGGACC TGTATGCCGG GGCGGGGCTG
TTCACCGCAG CGCTTGCGGA CGCCGTTGGC GTGACCGGCT CCGTGCTGTC CGTTGAGGGT
GCCCCCGGCA CCAGCCGGGA CGCGCGGAAG AACCTGCACG GGGCACCGCA GGTGGAAATT
GTGCAGGGAC GCGTGGAACG GGTCCTGCGC CAGAAGCCAC GTAACTTCGA TGCCCTGGTG
CTCGACCCGC CCCGCGCCGG CGCGGGCAAG GCAGTGGTCA GCCAGCTGAT GGCGGCCGGT
CCCCGGGCCA TCGCCTACGT GTCCTGCGAT CCGGCGTCGT TCGCCCGGGA CCTGGGGTAC
TTCCGGCAGG GAGGCTGGCA GCTCGCGGGG CTGCGGGCAT TCGACCTGTA CCCGCACACC
CACCACATGG AGACAGTGGC GTTGCTGACG CCCCCGGCTT GA
 
Protein sequence
MNPETQTRTH TELVVDIGPI AHGGHCVARH EGRVVFVRHA IPGEKVRIRL TDAGEDSKFW 
RADVVEVLEA SPDRMPHFWH VADSLRAWSH GHPPVGGAEL GHVSLGRQRS LKADVLAEQL
KRLAGVERVT EVEAVGAAAA AGDHSPGAPG LGWRTRASFA VTPAGKLGMH AHRSDQVIAI
REMPLTVSAI NDLRLWDIDL AGVERVEVAA PANGSRPLVL LAPAEGTRAK RLSGILAQLP
DEVSVASFDP AKGESLQLRG RTWVQESAAG HEFRVTGEGF WQIHRDAPET LVGALKGFLH
DGGYLEPGAV VADLYAGAGL FTAALADAVG VTGSVLSVEG APGTSRDARK NLHGAPQVEI
VQGRVERVLR QKPRNFDALV LDPPRAGAGK AVVSQLMAAG PRAIAYVSCD PASFARDLGY
FRQGGWQLAG LRAFDLYPHT HHMETVALLT PPA