Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Csal_1031 |
Symbol | |
ID | 4027802 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chromohalobacter salexigens DSM 3043 |
Kingdom | Bacteria |
Replicon accession | NC_007963 |
Strand | + |
Start bp | 1161490 |
End bp | 1162407 |
Gene Length | 918 bp |
Protein Length | 305 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 637966208 |
Product | allophanate hydrolase subunit 2 |
Protein accession | YP_573087 |
Protein GI | 92113159 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG1984] Allophanate hydrolase subunit 2 |
TIGRFAM ID | [TIGR00724] biotin-dependent carboxylase uncharacterized domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAGGGT TGGTCGTCGA ACGCGCGGGA CCGTTGGCGC TGATTCAGGA TGGCGGGCGC TTCGGCGTGC GTCACCTTGG CGTGACCCAG GGAGGCGCCG CCGATTGGGT GTCGCTGGGC TGGGCCAACT GGTTGCTGGG GAACGCGCCT CAGGCCGCGG GGCTGGAGAT TACCGTGGGC GGCCTGACGC TCTGTGCCGA GACCTCGGCC ACGCTGGCCC TGACGGGCGC GGATCTCGGT GCCACGCTCG ACGAGACGCC GCTGGCACCG GGGGCCAGTT TCACCATCGC ACCCGGCCAG CGCCTGGCCT TCGAGCGTCC GCGTGCCGGT CTCCGCGCTT ATCTCGCGTT TCCCGGCGGG CTCGAGGCGC CCTCCGTGCT GGGCAGTGTG GCCAGCACGG TACGCGAGGG CCTGGGCGGT CTCGAGGGGC AGGGCGGGAC GCTTGGCGAG GGCGACAGGC TGGTCTGGTC GGGCCATGGC GAGCCCGGAC GCCGTCTGCC GGATGACGCC AGCACCCTGC CGGGCGAAAA CCCGCATCTG GCCCTGGTGA GCGGCGCGCA GATCGCCGGC TTCAGCGGTA CCAGCGTGTT CGAGGCGTTC AATGCCGCCT GGACCGTCGA CCAGCGTGCC GATCGCATGG GCGTGCGACT GACCGGCCCC GAACTGCGCT ACCTGGGGGC GGGCATGGTG TCGGAAGGCA TTCCCCTGGG AGCCGTTCAG GTACCTGCCG ACGGCCAACC CATCATCCTG CTCAACGACC GCCAGACCAT CGGCGGCTAT CCGCGTCTCG GCGCCCTGAC GCCACTGGCG TGCGCGCAAC TGGCGCAATG CGTGCCGGGC ACGCAGTTAC GCCTGAGGCC GGTCACCGCC AGCCAGGCCC GCGAGGCGCA TTTACGGCAA CTCGCGCGCT GGCGCTGA
|
Protein sequence | MKGLVVERAG PLALIQDGGR FGVRHLGVTQ GGAADWVSLG WANWLLGNAP QAAGLEITVG GLTLCAETSA TLALTGADLG ATLDETPLAP GASFTIAPGQ RLAFERPRAG LRAYLAFPGG LEAPSVLGSV ASTVREGLGG LEGQGGTLGE GDRLVWSGHG EPGRRLPDDA STLPGENPHL ALVSGAQIAG FSGTSVFEAF NAAWTVDQRA DRMGVRLTGP ELRYLGAGMV SEGIPLGAVQ VPADGQPIIL LNDRQTIGGY PRLGALTPLA CAQLAQCVPG TQLRLRPVTA SQAREAHLRQ LARWR
|
| |