Gene Caul_2947 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_2947 
Symbol 
ID5900402 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp3196644 
End bp3198839 
Gene Length2196 bp 
Protein Length731 aa 
Translation table11 
GC content72% 
IMG OID641563444 
Productmolydopterin dinucleotide-binding region 
Protein accessionYP_001684572 
Protein GI167646909 
COG category[C] Energy production and conversion 
COG ID[COG0243] Anaerobic dehydrogenases, typically selenocysteine-containing 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value0.975721 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCCCGC CGACCAGCAT GGTGCGCACT GGCGTCTGTC CTCTTTGCGA AGCGATGTGC 
GGCCTGGAAA TGACCGTCGA CGGCGACCGC GTGGTCGGCG TGCGGGGCGA CGCCAGCGAC
CCCTTGAGCG CCGGCTTCCT GTGCCCCAAG GCCGTCGCCC TGGAGGACCT GTCGGCTGAT
CCGCGCCGCA TCGATCAACC CATGCTGCGG ACCGGCGACC AGCGCCGGCC GATCCCCTGG
GACGAGGCCC TGGACCTGGC CGGGACCAGG CTGGGCGACC TCCAGCGCCG ACATGGCGTC
GACGCGGTGG CCTTCTACAG CGGCAATCCC AACCTTCACT CCTACGCCGC CCAGTTGTCG
GAACTGACCT TCAAGCGCGC CCTGCGCAGC CGCAACTGCA GCTCCACCGC CTCGGTCGAT
CACCTGCCGC ACATGGTGGC CGCCGCGGCG ATGTTCGGCC ACCCGCTGCT GCTGCCCGTG
CCGGACCTCG ACCGCACTGA CTTCTTCCTG GTCCTGGGCG CCAATCCCGC CGACTCCAAC
GGCAGCCTGA TGGGCGCGCC AGGCATGCCG CTTCGGCTGC ACAAGCTGCG CCAGCGCGGG
ACCCGGGTGG TGGTGGTCGA TCCCTGCCGT ACCCGCACCG CCGACCTGGC CGACACCCAC
CTGTTCATCC GCCCGGGCAC GGACGCCCTG CTGCTGCTGG CCCTGGTCCG CATCCTGTTG
ACCGAAAACC TGGTCAGGCT TGGGCGGCTG TCCGGCCTGG TCGACGCCGT CGAGGCCCTG
CGCCAGGCCA GCGCGCCGTT CTCGCCGGAC CGCATCGCGC CGGTCACCGG CCTGGATCCT
CGCGCCGTCG TCGATCTGGC GCGAGACCTC GCCGCCGCGC CGACGTCGGT CGTCTATGGC
CGCGTCGGGG TCTGCACCCA GGAGCACGGC GCGCTCTGCG CCTGGTTGAT CAACGTCCTG
AACATCCTGA CCGGCAATCT CGACCGCCCC GGCGGCGCGA TGTTCCCCAC CCCGGCCGTC
GATATCGTCG CCGCCGCCTC GATGCTGGGC GCGGCCGGTG GGATGGAGCC CGGGCGCAGC
CGGGTGCGCG GTCTGCCCGG CTTCAACGGC GAGCTGCCGC TCTCGACCCT GGCCGAGGAG
ATCGACACCC CGGGCGAGGG CCAGGTGCGG GGCCTGATCG TCTCGGCCGG CAATCCGGTG
CTGTCCGGCC CCAACGGCCG ACGACTGGAA GCGGCCCTGC CACGTCTCGA CTTCATGGTC
GCCATCGATC GCTATGTGAC CGAGACCACT CGCCATGCCG ACCTCATCCT GCCGGCCGCC
ATGCCGCTGG AGCGCGACCA CTACGACGTG GTGTTCCGGG CCTTCGGCGT GCGCGACACC
GCTCGCTTCC AGGAAGCGAT CCTGCCCCGG CCGCCCGGCG TACGGGAGGA CTGGCGGATC
TATTGCGGCC TAGGCGAGCG GATCGCCCGC CGGCGGGGTC TGGAAGGTCA CCTTTCGGCC
CTGTCGCTGT CGGCCCTGCA GGCCGTCACC CCGCGCCGCA TCCTCGATCT GCTGCTGCGC
CTGGGTCCCC ACGGCCTGAG CCTGAACCAG CTGGCGCGGA GCCCTCACGC CGTGGATCTC
GGCCCGCTGC GCCCCGCCCT GCCCGCGCGG CTGCGCACGC CCGACAAGCG GATTCAGCTC
ACGCCGCCAG TCCTGCTCGA AGCCCTTCCG GGCCTGGCCG CGCGCCTGGA CATGCCGCAA
CCGCCCAGCG ACGCCTTGGT GCTGATCGGC CGCCGGCAGC TGCGCAGCAA CAACAGCTGG
ATGCACCACC TGCCTCGCCT GCAGCGGGGC AGCAATCGCT GCACCCTGCT GATCCATGAG
ACCGACGCCC GCCGCCGGGG CCTGGCCCCT GGTCAGACGG TCGAGATCCG CGGCCGTACC
GGCGCCGTCG AGGCGCCGGT CGAGATCACC AACAGGATCA TGCCCGGCGT CGTCAGCCTG
CCGCACGGCT TTGGCCACGG ACGCATAGCC GCCGAGCGAG CGATCCCGCG GGATTGGATC
GGCGCCAGCC TCAACGACCT TACTGACGAC ACGCGTCTGG ACCGCCAGTC CGGCGCGGCC
GCTTTCAGCG GCGTGCCGGT CGAGGTCGCC GCCGTTCCTC GCGAACTCGC CGAAGCGAGC
TCGTCCATGC TCCACCCCGC GCCCAGTCAA ACCTAG
 
Protein sequence
MSPPTSMVRT GVCPLCEAMC GLEMTVDGDR VVGVRGDASD PLSAGFLCPK AVALEDLSAD 
PRRIDQPMLR TGDQRRPIPW DEALDLAGTR LGDLQRRHGV DAVAFYSGNP NLHSYAAQLS
ELTFKRALRS RNCSSTASVD HLPHMVAAAA MFGHPLLLPV PDLDRTDFFL VLGANPADSN
GSLMGAPGMP LRLHKLRQRG TRVVVVDPCR TRTADLADTH LFIRPGTDAL LLLALVRILL
TENLVRLGRL SGLVDAVEAL RQASAPFSPD RIAPVTGLDP RAVVDLARDL AAAPTSVVYG
RVGVCTQEHG ALCAWLINVL NILTGNLDRP GGAMFPTPAV DIVAAASMLG AAGGMEPGRS
RVRGLPGFNG ELPLSTLAEE IDTPGEGQVR GLIVSAGNPV LSGPNGRRLE AALPRLDFMV
AIDRYVTETT RHADLILPAA MPLERDHYDV VFRAFGVRDT ARFQEAILPR PPGVREDWRI
YCGLGERIAR RRGLEGHLSA LSLSALQAVT PRRILDLLLR LGPHGLSLNQ LARSPHAVDL
GPLRPALPAR LRTPDKRIQL TPPVLLEALP GLAARLDMPQ PPSDALVLIG RRQLRSNNSW
MHHLPRLQRG SNRCTLLIHE TDARRRGLAP GQTVEIRGRT GAVEAPVEIT NRIMPGVVSL
PHGFGHGRIA AERAIPRDWI GASLNDLTDD TRLDRQSGAA AFSGVPVEVA AVPRELAEAS
SSMLHPAPSQ T