Gene Arth_3600 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_3600 
Symbol 
ID4443911 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp4041678 
End bp4042910 
Gene Length1233 bp 
Protein Length410 aa 
Translation table11 
GC content67% 
IMG OID639691424 
ProductN-isopropylammelide isopropylaminohydrolase 
Protein accessionYP_833075 
Protein GI116672142 
COG category[F] Nucleotide transport and metabolism
[R] General function prediction only 
COG ID[COG0402] Cytosine deaminase and related metal-dependent hydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTCATCA CTAACGTCCG CCCCTGGGGC GGGGACAACG TAGACCTCGA GGTTCACGAG 
GGGCGTATTG CAGCCGTTCA TCCCGCGGGG ACCGGCGACG TGTCAGCTGA CGCCGCCGGC
GGCACGATCG ACGGACGCGG CCGCATCGCC TTCCCCGCTT TCACGGACGT GCACGTCCAC
CTCGACTCGA CCCGGATAGG ACTGCCGTTT CGGGAGCACA CCGCCTCTCC CGGGGTGTGG
AACATGATGT GCAACGACCG GGAAAACTGG CGTGACACGC CCATCCCGTA CGCGGATGTG
GTGGCAGGAA CGCTGGAACG GATGATTGCG CGGGGCACCA CACGCGTCCG TTCCTATGCG
CAGATCGATG TGGACTGCAA GCTTGAGCGC TTCGAAGCAG TCCTGGCGGC GAAGGAGCGC
TTCGCCCACG CGGCCGAGGT AGAAGTCATG GCTTTCCCCC AGGCGGGCCT CCTCCTCGAG
GACGGCACTG TGCCGCTCCT TGAGGAGGCC CTCCGTGCGG GGGCGACCAC CATCGGCGGC
ATTGATCCCT GCCAGCTGGA CCGCGACCCG GCCCGCCATC TGGACATCGT CTTCGAGCTG
GCCGAGAAGT ACGGGGTGGA CGTGGATATT CACCTGCACG AGCCCGGCCA TCTGGGGGTC
TTCAGTGCGG AACTCATCTT CGAACGCACC CGCGCACTGG GCATGCAGGG ACGCGTCTCG
CTTTCCCACG CCTACGATCT GGCCAACGTC CACCCCGATG TGACCGCCCG GATCGTGGAG
CAGATGGCCG AGCTGGACGT CGCCTGGGCG ACCGTGGCCC CGGCAAGTGG AGGCGCCCAG
TTCGACCTGG CCCGGATGAC GGAGGCCGGG ATCCGCGTTG GTCTGGGCGA GGATGGTCAA
CGGGATTACT GGAGTCCGTA TGGCAATTGC GACATGCTCG ACCGCACCTG GCAGCTGGCC
TTCACGCACC GGCTGCGCAA GGACCGTCTC ATCGAGCACT GCGCGGCGAT CGCCACGGTC
GGAGGCGCGT CCATCATGGA CCGCACCGTC CCGCGGCTCA CCAGCCCCGA CGACCGGCCG
GGCCTGACCC CAGGCGACCG GGCCGACGTC GTCCTGGTAG ACGGCGAAAC CGTCACCAGC
ACCGTCATGG ACCGCGGCAC CGACCGCACC GTCATACACG ACGGCAGGCT CGTCGCCGAC
GGGCTGGCTG TTCTTCCACG CGCAGCCGGG TAA
 
Protein sequence
MLITNVRPWG GDNVDLEVHE GRIAAVHPAG TGDVSADAAG GTIDGRGRIA FPAFTDVHVH 
LDSTRIGLPF REHTASPGVW NMMCNDRENW RDTPIPYADV VAGTLERMIA RGTTRVRSYA
QIDVDCKLER FEAVLAAKER FAHAAEVEVM AFPQAGLLLE DGTVPLLEEA LRAGATTIGG
IDPCQLDRDP ARHLDIVFEL AEKYGVDVDI HLHEPGHLGV FSAELIFERT RALGMQGRVS
LSHAYDLANV HPDVTARIVE QMAELDVAWA TVAPASGGAQ FDLARMTEAG IRVGLGEDGQ
RDYWSPYGNC DMLDRTWQLA FTHRLRKDRL IEHCAAIATV GGASIMDRTV PRLTSPDDRP
GLTPGDRADV VLVDGETVTS TVMDRGTDRT VIHDGRLVAD GLAVLPRAAG