Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dret_2031 |
Symbol | |
ID | 8419876 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Desulfohalobium retbaense DSM 5692 |
Kingdom | Bacteria |
Replicon accession | NC_013223 |
Strand | + |
Start bp | 2330350 |
End bp | 2331108 |
Gene Length | 759 bp |
Protein Length | 252 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 645038619 |
Product | 1-(5-phosphoribosyl)-5-amino-4-imidazole- carboxylate (AIR) carboxylase |
Protein accession | YP_003198893 |
Protein GI | 258406151 |
COG category | [R] General function prediction only |
COG ID | [COG1691] NCAIR mutase (PurE)-related proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.0016635 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.0000823444 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGACATTC ACGACCTCTT GCAGCGCGTG GCCCAAGGGC GCCTCTCTGT CGAGGACGCC GCGCGCGAAC TCCGTTTTGC CCCGTTCACC GCATCCGGGG AGGGCGTCTG CCTGGACGGC CACCGCGGGC TGCGCACCGG TGTTCCCGAA GTCGTCTTTG GGTCGGGGAA ATCGGAGGCG CAGCTCGAAC GGGCTGTTCA AGGCCTGGCT GATCAGGGAG CCTCGGTGCT GGTCACCAAG CTGAGTCCCG GTCAAGGCGA AGCGTTACAC GCCTTGTTTC CCGAGGGGAA GGTGCATCCC CAGGCCGGCC TTTTCACCCT CGGCGCTGAT CTCTCGCTGA CCCCTCCTTG GCCCGAACAA GGTGAGGCGC TGGTCCTCAG CGCCGGGGCC AGCGATCTCC CCGTGGCCCT GGAAGCTTAC GCCACAGCCC AATATTTCGG ACTGACGGCC GGACTGGTCA GTGACGTCGG CGTCGCCGGG CTGCACCGGC TTTTGCCCCA TCTCCAGGCC TGTGACCAGG CCCGAGTGCT CATCGTGGTT GCCGGGATGG AGGGGGCCTT GCCCTCTGTC GTCGCCGGCC TCACGGACAA GCCGGTCATC GCTGTGCCCA CGTCAGTGGG CTACGGGGTC TCGTTTCAGG GAGTGACCGC ACTTTTGGGC ATGCTCAGCA GTTGCGCCCC AGGCGTGGCG GTGGTCAATA TCGACAACGG GTTCGGGGCT GCCGCCATGG CCCGTAAATT GTGTCACCTG AAGCCCTGA
|
Protein sequence | MDIHDLLQRV AQGRLSVEDA ARELRFAPFT ASGEGVCLDG HRGLRTGVPE VVFGSGKSEA QLERAVQGLA DQGASVLVTK LSPGQGEALH ALFPEGKVHP QAGLFTLGAD LSLTPPWPEQ GEALVLSAGA SDLPVALEAY ATAQYFGLTA GLVSDVGVAG LHRLLPHLQA CDQARVLIVV AGMEGALPSV VAGLTDKPVI AVPTSVGYGV SFQGVTALLG MLSSCAPGVA VVNIDNGFGA AAMARKLCHL KP
|
| |