Gene Arth_1189 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_1189 
Symbol 
ID4446320 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp1289709 
End bp1290950 
Gene Length1242 bp 
Protein Length413 aa 
Translation table11 
GC content67% 
IMG OID639688996 
Productphosphoribosylaminoimidazole carboxylase 
Protein accessionYP_830683 
Protein GI116669750 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0026] Phosphoribosylaminoimidazole carboxylase (NCAIR synthetase) 
TIGRFAM ID[TIGR01161] phosphoribosylaminoimidazole carboxylase, PurK protein 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGGTGAAGC CCGGAACACC CCTGCTCTTC TGTAGGCTGG CTCTTGTGAC TTTTCCAGTA 
ATAGGCGTAG TTGGCGGCGG CCAGCTAGCC CGCATGATGG CCCCCGCCGC AACGGCCCTG
GGCTTTGAAC TCCGTGTCCT GGCCGAAGGC GAGGACGTTT CCGCGGTTTC CGCAGTGCCG
ACGTCGCCGG TGGGCGACTA CAAGGACCTT GACGCCCTCC TCGAGTTCTC CCGGGGGCTG
GACGTCATGA CCTTTGACCA CGAGCACGTC CCCAACGACC ACCTGCGGGC ACTGCAGGAG
GCCGGCGTCA ACGTCCAGCC CGGCCCGGAC GCCCTGGTCC ACGCGCAGGA CAAGCTGGTG
ATGCGGGCAG CCATCGACCG GCTTGAGCTG CCCAACCCGG CCTGGGCCTC CGTTGCCGAC
GTCGAGGCCC TGGTTGCCTT CGGCGAGAAG ACCGGGTGGC CGGTGGTGTT GAAGACGCCC
CGCGGCGGTT ACGACGGCAA AGGGGTCCGC ATGGTCGGAT CGGCTGAGGA AGCCGCCGAC
GCCGCCGACT GGTTTGCGGC CATGACCCCG CTGCTGGCCG AGGCCAAGGT GGAGTTCAGC
CGCGAACTGT CCGCACTCGT AGCGAGGACT CCTGACGGTG AATCCCGCGC CTGGCCCGTG
GTCCACACCA TCCAGGTGGA CGGCGTCTGC GACGAAGTGA TCGCCCCGGC CCAGGACATT
CCGCTTGAAG TCGCCGCGGC CGCCGAAGAC GCCGCAATCC GCATCGCCAA CGAACTCGGA
GTCACCGGCG TCATGGCCGT GGAGCTCTTC GAAACCCCCG GCGTCGGCTC CGGCTTCCTG
ATCAACGAGC TCGCAATGCG CCCGCACAAC ACCGGCCACT GGACCCAGGA CGGATCGGTC
ACGAGCCAGT TCGAACAGCA CCTGCGTGCC GTGCTGAACC TCCCGCTCGG TGCCACCGAC
GCTCTGGGAC AGATTGTTGT GATGAAGAAC TTCCTTGGCG GCGAGAACCA GGAACTGTTC
TCGGCGTATC CGCTGGCCAT GGCCAGCGAG CCGGCCGCGA AGATCCACTG CTACGGCAAG
GCCGTCAGGC CCGGCAGGAA GATCGGCCAC GTCAACCTGG TGGGGGCAGC CGCTTCCGAT
GTCGACTCCG TCCGGCAGCG CGCCACCACC GTCGCCAACA TCATCAGGGA CGGCCGTGCT
CCGGCCCGAC CTGCACCAGG GAACTCCGAG GAGACCGTAT GA
 
Protein sequence
MVKPGTPLLF CRLALVTFPV IGVVGGGQLA RMMAPAATAL GFELRVLAEG EDVSAVSAVP 
TSPVGDYKDL DALLEFSRGL DVMTFDHEHV PNDHLRALQE AGVNVQPGPD ALVHAQDKLV
MRAAIDRLEL PNPAWASVAD VEALVAFGEK TGWPVVLKTP RGGYDGKGVR MVGSAEEAAD
AADWFAAMTP LLAEAKVEFS RELSALVART PDGESRAWPV VHTIQVDGVC DEVIAPAQDI
PLEVAAAAED AAIRIANELG VTGVMAVELF ETPGVGSGFL INELAMRPHN TGHWTQDGSV
TSQFEQHLRA VLNLPLGATD ALGQIVVMKN FLGGENQELF SAYPLAMASE PAAKIHCYGK
AVRPGRKIGH VNLVGAAASD VDSVRQRATT VANIIRDGRA PARPAPGNSE ETV