Gene Daro_0451 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDaro_0451 
Symbol 
ID3568363 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDechloromonas aromatica RCB 
KingdomBacteria 
Replicon accessionNC_007298 
Strand
Start bp494032 
End bp495192 
Gene Length1161 bp 
Protein Length386 aa 
Translation table11 
GC content63% 
IMG OID637678892 
Productphosphoribosylaminoimidazole carboxylase ATPase subunit 
Protein accessionYP_283678 
Protein GI71906091 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0026] Phosphoribosylaminoimidazole carboxylase (NCAIR synthetase) 
TIGRFAM ID[TIGR01161] phosphoribosylaminoimidazole carboxylase, PurK protein 


Plasmid Coverage information

Num covering plasmid clones59 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGGTAA AGTCTGCAAT GATTCTTCCT CCAGCTACCC TTGGCATGCT CGGTGGCGGC 
CAGCTCGGCC GCTTCTTCGT TTCGGCCGCC CACGAACTGG GCTATCAGGT CTGGGTGCTC
GATCCGGACA AGAATTCGCC GGCCGGCCAG ATCGCCGAAC GCCATTTTTG TGTTGATTAC
AACGACTATG CGGCGCTTGA CGAGTTTGCC GCCGGTTGTG CCGCAATCAC CACCGAGTTT
GAGAACGTGC CCGCCGATAC GCTGGATTAT CTAGCCAAGT TCGTGCCGGT ACGCCCGTCG
GCAGCGGCCG TCGGCATTTG TCAGAACCGC ATCGCCGAAA AGTCCTTCCT GCGCGACAAC
GGCCTGCCGC ACGGTCCATT CGCCGCCATC CGTTGCGAAG ACGACATTCG TAATGCCGAT
GCCTCGCTGT TTCCGGCCAT CCTGAAAGTG GCCCGCTTCG GCTACGACGG CAAGGGGCAG
GCCACGGTCC ATAACCGCGA GGAAGCGCTG GTCGCCTTCG GTCAGTTCAA GGGCGAACAG
TGCGTGCTGG AACAGCGCCT GACGCTCGAC TACGAGGTCT CGGTCGTCCT CGCCCGTGAC
GAGCGCGGCC GGGTCGCCTG CTTCCCGACC GGCGAAAATC AGCACACCAA GGGCATCCTC
GACGTTTCCA TCGTGCCGGC GCGCACCACC GCTTGCGTCA AGAGTGATGC CGAGGAGGTC
GCTGCCCGTA TTGCTGAAAA GCTCGGCTAC ATCGGCACCA TGGGCGTCGA GTTCTTCATC
AGCCGCGGCC AGTTGATCGT CAACGAAATG GCACCGCGGC CGCACAACAG CGGCCACTAC
ACCATTGACG CCTGCGTGAC CGACCAGTTC GAGCAGCAGG TGCGTGCCCT GTGCGGCTTG
CCGCTTGGCG AGCCGCGGGC GCACTCGGCC TCGGTCATGG TCAATCTGCT CGGTGACCTT
TGGTACGACG GCGAAACCTA CCGCGAGCCG GACTGGGCCA AGCTGCATGC CGTGCCCAAC
TTGAAGCTGC ACCTCTACGG CAAGCACCAC GCCCGCCCGG GACGCAAGAT GGGCCACTTC
ACGGTGATCG GGGACAACGC CGAGGCCGTG CAAAAGGCCG CTCTGGCTGC CCGTGCCGCC
ATCGGCATCA GGGACGAATG A
 
Protein sequence
MPVKSAMILP PATLGMLGGG QLGRFFVSAA HELGYQVWVL DPDKNSPAGQ IAERHFCVDY 
NDYAALDEFA AGCAAITTEF ENVPADTLDY LAKFVPVRPS AAAVGICQNR IAEKSFLRDN
GLPHGPFAAI RCEDDIRNAD ASLFPAILKV ARFGYDGKGQ ATVHNREEAL VAFGQFKGEQ
CVLEQRLTLD YEVSVVLARD ERGRVACFPT GENQHTKGIL DVSIVPARTT ACVKSDAEEV
AARIAEKLGY IGTMGVEFFI SRGQLIVNEM APRPHNSGHY TIDACVTDQF EQQVRALCGL
PLGEPRAHSA SVMVNLLGDL WYDGETYREP DWAKLHAVPN LKLHLYGKHH ARPGRKMGHF
TVIGDNAEAV QKAALAARAA IGIRDE