Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_2947 |
Symbol | |
ID | 5900402 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | + |
Start bp | 3196644 |
End bp | 3198839 |
Gene Length | 2196 bp |
Protein Length | 731 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 641563444 |
Product | molydopterin dinucleotide-binding region |
Protein accession | YP_001684572 |
Protein GI | 167646909 |
COG category | [C] Energy production and conversion |
COG ID | [COG0243] Anaerobic dehydrogenases, typically selenocysteine-containing |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 0.975721 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCCCGC CGACCAGCAT GGTGCGCACT GGCGTCTGTC CTCTTTGCGA AGCGATGTGC GGCCTGGAAA TGACCGTCGA CGGCGACCGC GTGGTCGGCG TGCGGGGCGA CGCCAGCGAC CCCTTGAGCG CCGGCTTCCT GTGCCCCAAG GCCGTCGCCC TGGAGGACCT GTCGGCTGAT CCGCGCCGCA TCGATCAACC CATGCTGCGG ACCGGCGACC AGCGCCGGCC GATCCCCTGG GACGAGGCCC TGGACCTGGC CGGGACCAGG CTGGGCGACC TCCAGCGCCG ACATGGCGTC GACGCGGTGG CCTTCTACAG CGGCAATCCC AACCTTCACT CCTACGCCGC CCAGTTGTCG GAACTGACCT TCAAGCGCGC CCTGCGCAGC CGCAACTGCA GCTCCACCGC CTCGGTCGAT CACCTGCCGC ACATGGTGGC CGCCGCGGCG ATGTTCGGCC ACCCGCTGCT GCTGCCCGTG CCGGACCTCG ACCGCACTGA CTTCTTCCTG GTCCTGGGCG CCAATCCCGC CGACTCCAAC GGCAGCCTGA TGGGCGCGCC AGGCATGCCG CTTCGGCTGC ACAAGCTGCG CCAGCGCGGG ACCCGGGTGG TGGTGGTCGA TCCCTGCCGT ACCCGCACCG CCGACCTGGC CGACACCCAC CTGTTCATCC GCCCGGGCAC GGACGCCCTG CTGCTGCTGG CCCTGGTCCG CATCCTGTTG ACCGAAAACC TGGTCAGGCT TGGGCGGCTG TCCGGCCTGG TCGACGCCGT CGAGGCCCTG CGCCAGGCCA GCGCGCCGTT CTCGCCGGAC CGCATCGCGC CGGTCACCGG CCTGGATCCT CGCGCCGTCG TCGATCTGGC GCGAGACCTC GCCGCCGCGC CGACGTCGGT CGTCTATGGC CGCGTCGGGG TCTGCACCCA GGAGCACGGC GCGCTCTGCG CCTGGTTGAT CAACGTCCTG AACATCCTGA CCGGCAATCT CGACCGCCCC GGCGGCGCGA TGTTCCCCAC CCCGGCCGTC GATATCGTCG CCGCCGCCTC GATGCTGGGC GCGGCCGGTG GGATGGAGCC CGGGCGCAGC CGGGTGCGCG GTCTGCCCGG CTTCAACGGC GAGCTGCCGC TCTCGACCCT GGCCGAGGAG ATCGACACCC CGGGCGAGGG CCAGGTGCGG GGCCTGATCG TCTCGGCCGG CAATCCGGTG CTGTCCGGCC CCAACGGCCG ACGACTGGAA GCGGCCCTGC CACGTCTCGA CTTCATGGTC GCCATCGATC GCTATGTGAC CGAGACCACT CGCCATGCCG ACCTCATCCT GCCGGCCGCC ATGCCGCTGG AGCGCGACCA CTACGACGTG GTGTTCCGGG CCTTCGGCGT GCGCGACACC GCTCGCTTCC AGGAAGCGAT CCTGCCCCGG CCGCCCGGCG TACGGGAGGA CTGGCGGATC TATTGCGGCC TAGGCGAGCG GATCGCCCGC CGGCGGGGTC TGGAAGGTCA CCTTTCGGCC CTGTCGCTGT CGGCCCTGCA GGCCGTCACC CCGCGCCGCA TCCTCGATCT GCTGCTGCGC CTGGGTCCCC ACGGCCTGAG CCTGAACCAG CTGGCGCGGA GCCCTCACGC CGTGGATCTC GGCCCGCTGC GCCCCGCCCT GCCCGCGCGG CTGCGCACGC CCGACAAGCG GATTCAGCTC ACGCCGCCAG TCCTGCTCGA AGCCCTTCCG GGCCTGGCCG CGCGCCTGGA CATGCCGCAA CCGCCCAGCG ACGCCTTGGT GCTGATCGGC CGCCGGCAGC TGCGCAGCAA CAACAGCTGG ATGCACCACC TGCCTCGCCT GCAGCGGGGC AGCAATCGCT GCACCCTGCT GATCCATGAG ACCGACGCCC GCCGCCGGGG CCTGGCCCCT GGTCAGACGG TCGAGATCCG CGGCCGTACC GGCGCCGTCG AGGCGCCGGT CGAGATCACC AACAGGATCA TGCCCGGCGT CGTCAGCCTG CCGCACGGCT TTGGCCACGG ACGCATAGCC GCCGAGCGAG CGATCCCGCG GGATTGGATC GGCGCCAGCC TCAACGACCT TACTGACGAC ACGCGTCTGG ACCGCCAGTC CGGCGCGGCC GCTTTCAGCG GCGTGCCGGT CGAGGTCGCC GCCGTTCCTC GCGAACTCGC CGAAGCGAGC TCGTCCATGC TCCACCCCGC GCCCAGTCAA ACCTAG
|
Protein sequence | MSPPTSMVRT GVCPLCEAMC GLEMTVDGDR VVGVRGDASD PLSAGFLCPK AVALEDLSAD PRRIDQPMLR TGDQRRPIPW DEALDLAGTR LGDLQRRHGV DAVAFYSGNP NLHSYAAQLS ELTFKRALRS RNCSSTASVD HLPHMVAAAA MFGHPLLLPV PDLDRTDFFL VLGANPADSN GSLMGAPGMP LRLHKLRQRG TRVVVVDPCR TRTADLADTH LFIRPGTDAL LLLALVRILL TENLVRLGRL SGLVDAVEAL RQASAPFSPD RIAPVTGLDP RAVVDLARDL AAAPTSVVYG RVGVCTQEHG ALCAWLINVL NILTGNLDRP GGAMFPTPAV DIVAAASMLG AAGGMEPGRS RVRGLPGFNG ELPLSTLAEE IDTPGEGQVR GLIVSAGNPV LSGPNGRRLE AALPRLDFMV AIDRYVTETT RHADLILPAA MPLERDHYDV VFRAFGVRDT ARFQEAILPR PPGVREDWRI YCGLGERIAR RRGLEGHLSA LSLSALQAVT PRRILDLLLR LGPHGLSLNQ LARSPHAVDL GPLRPALPAR LRTPDKRIQL TPPVLLEALP GLAARLDMPQ PPSDALVLIG RRQLRSNNSW MHHLPRLQRG SNRCTLLIHE TDARRRGLAP GQTVEIRGRT GAVEAPVEIT NRIMPGVVSL PHGFGHGRIA AERAIPRDWI GASLNDLTDD TRLDRQSGAA AFSGVPVEVA AVPRELAEAS SSMLHPAPSQ T
|
| |