Gene Arth_1650 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_1650 
Symbol 
ID4445842 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp1843289 
End bp1844311 
Gene Length1023 bp 
Protein Length340 aa 
Translation table11 
GC content67% 
IMG OID639689465 
Productinosine/uridine-preferring nucleoside hydrolase 
Protein accessionYP_831144 
Protein GI116670211 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG1957] Inosine-uridine nucleoside N-ribohydrolase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.358595 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCATTCGG TCCTGATGGA CGTGGACACC GGGATCGACG ATGCCCTGGC CCTGGTCTAC 
CTCCTGTCCC GCCCCGACGT CCGGCTGCAG GCCATCACCT GCACCGCCGG AAACGTGGGC
GCACGCCAGG TGGCACTGAA CAACCTTGCC CTGCTCGAGT TGTGCGGCAC ATCAGGAGTC
GAAGTGGCGA TCGGGGCCGA AGTGCCGCTC GAGATCCCGC TGGTCACCAC GGAGGAAACC
CACGGACCGC AGGGAATCGG CTACGCCGAG CTGCCGGTGC CCGCGCAACA AATCTCGGAG
CGGCACGCCG TGGACGTCTG GGTGGACGAG GTGCGCAAGC ACCCGGGCGA GATCACGGGC
CTTATCACCG GCCCGCTGAC CAACTTCGCC CTCGCCCTCC GCCGGGAACC GGAACTGCCG
CAGTTGCTCA AGGGGCTGGT GATCATGGGC GGTTGCTTCT ACTACCAGGG CAACACCACC
CCGACGGCAG AGTGGAACGT CTCGGTCGAT CCGCATGCCG CGAAGGAAGT CTTTGCCGCC
TACCGGGGCC TCCCGGAGGA CAAGCTGCCG GTGGTGTGTG CCCTGGAGAC CACCGAACTG
GTCGAGATGC GGCCCGAACA CCTGCAGCGA CTGGCCGAAG CCGCCGGCAC TGGTCCGGAA
CTCGTCCTTC CGGACCAGCC GGAGGGGCTC CGCAGCAGCT CCGGCAACCC CCTGGTGGCG
TGCCTGTCCG ATGCCATCCG CTTCTACATG GAGTTCCACC GGCTCTACGA CCAGGGCTTC
GTGGCCCACG TGCACGACGC CTTCGCTGCC TGTGTGGCCG TGGGCCGGAC GGAATACACC
GCCCGGCTGG CAACGGTTGA CGTCGAGACC GGATCCGCGC TGCTGATGGG CACCACCGTC
GCCGATTACC GCGGACTGTG GGGGCTGCCG CCGAACGCCC GGATTGTGAC GTCGAACAAT
CCGAAGCAGT GCTTTGATGA GCTCATCAAT TCAGTGGGCG CACTGGCCAG GCGGCTGGCC
TAA
 
Protein sequence
MHSVLMDVDT GIDDALALVY LLSRPDVRLQ AITCTAGNVG ARQVALNNLA LLELCGTSGV 
EVAIGAEVPL EIPLVTTEET HGPQGIGYAE LPVPAQQISE RHAVDVWVDE VRKHPGEITG
LITGPLTNFA LALRREPELP QLLKGLVIMG GCFYYQGNTT PTAEWNVSVD PHAAKEVFAA
YRGLPEDKLP VVCALETTEL VEMRPEHLQR LAEAAGTGPE LVLPDQPEGL RSSSGNPLVA
CLSDAIRFYM EFHRLYDQGF VAHVHDAFAA CVAVGRTEYT ARLATVDVET GSALLMGTTV
ADYRGLWGLP PNARIVTSNN PKQCFDELIN SVGALARRLA