Gene Arth_3736 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_3736 
Symbol 
ID4443749 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp4209438 
End bp4210583 
Gene Length1146 bp 
Protein Length381 aa 
Translation table11 
GC content69% 
IMG OID639691560 
Productphosphoribosylaminoimidazole synthetase 
Protein accessionYP_833211 
Protein GI116672278 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0150] Phosphoribosylaminoimidazole (AIR) synthetase 
TIGRFAM ID[TIGR00878] phosphoribosylaminoimidazole synthetase 


Plasmid Coverage information

Num covering plasmid clones37 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACTTCCG CATCTTCCTC CGCCCAGAAC GCCGGCATCA CGTACGCCTC TGCCGGTGTT 
GACGTCGAAG CGGGCGACCG CGCCGTCGAA CTCATGAAGG ACGCCGTCAA GGCCACCCAC
AATTCCTCCG TGATCGGCGG AGTGGGCGGC TTTGCCGGAC TCTATGACGT TTCGAGGCTC
CTCACCTTCA AGCGGCCGCT GCTCGCAACC TCCACGGACG GCGTGGGCAC CAAGGTGGCC
ATCGCCCAGG CCATGGACAT CCACGACACC ATTGGCTTCG ACCTCGTGGG CATGGTGGTG
GACGACATCG TAGTGGTGGG CGCCGAACCG CTCTACATGA CCGACTACAT CGCCTGCGGA
AAGGTTGTCC CCGAGCGCAT CGCGGACATC GTCCGCGGCA TCGCGGCAGC CTGCTCCGTG
GCCGGCACCG CCCTGGTGGG CGGCGAAACC GCAGAGCACC CTGGCCTGCT AGGTGAGCAC
GAGTACGACG TCGCCGGTGC CGCCACCGGT GTTGTCGAGG CCGACGCCCT GCTGGGGCCG
GACCGCGTCC GCGCCGGCGA CGTAGTGATC GGCATGGCCT CCTCGGGCCT GCACTCCAAC
GGCTACTCCC TGGTCCGCCG CGTCATCAAC CACGCCGGCT GGGCCCTGGA CCGCCAGGTC
TCCGAACTCG GACGCACGCT GGGCGAGGAA CTGCTCGAGC CCACCCGGGT CTACGCCGCA
GACTGCCTGG ACCTGGCCCG CACCTTCCCG GTTACGGCAG GCGCGGCCGT CCACGGCTTC
AGCCACGTCA CCGGCGGCGG CCTCGCGGCC AACCTGGCCC GCGTCCTCCC CCAGGGCCTC
ATCGCCACGG TGGACCGCGC CACCTGGGAA CTCCCCGCCA TCTTCAAGCT GGTTTCGGAA
CTGGGCAACG TCCCGCTGGC CGACCTCGAG CGCACGCTGA ACCTCGGCGT GGGCATGGTG
GCGATCGTCT CCCCCGAAGC GGCCGACGCC GCAGTGAACC GCCTCAATGA CCGCGGCCTG
CCGTCCTGGG TCATGGGCAC CGTGGAGGAG AACTCGGACT CGATCGTGAA GACCGGCCCG
GACTATGTCC AGGGTGCCAA GGGTGTTGAC GGCGGCGCAG TCCGACTGGT GAACACGTAC
GCCTGA
 
Protein sequence
MTSASSSAQN AGITYASAGV DVEAGDRAVE LMKDAVKATH NSSVIGGVGG FAGLYDVSRL 
LTFKRPLLAT STDGVGTKVA IAQAMDIHDT IGFDLVGMVV DDIVVVGAEP LYMTDYIACG
KVVPERIADI VRGIAAACSV AGTALVGGET AEHPGLLGEH EYDVAGAATG VVEADALLGP
DRVRAGDVVI GMASSGLHSN GYSLVRRVIN HAGWALDRQV SELGRTLGEE LLEPTRVYAA
DCLDLARTFP VTAGAAVHGF SHVTGGGLAA NLARVLPQGL IATVDRATWE LPAIFKLVSE
LGNVPLADLE RTLNLGVGMV AIVSPEAADA AVNRLNDRGL PSWVMGTVEE NSDSIVKTGP
DYVQGAKGVD GGAVRLVNTY A