Gene Caul_2067 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_2067 
Symbol 
ID5899522 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp2207303 
End bp2208745 
Gene Length1443 bp 
Protein Length480 aa 
Translation table11 
GC content69% 
IMG OID641562556 
Productglycoside hydrolase family protein 
Protein accessionYP_001683693 
Protein GI167646030 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG5520] O-Glycosyl hydrolase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.0826948 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGATCA AAGTCCCTGT TCGACGCCCA GCCGCGCGCC TCGGCGCGGC GGCCCTGCTG 
CTAGGTCTGG CGGCCTGCGC GACCACGCCG CCGGTCGTCC CCACCCACGC AGGAGCATGG
ATCACCACGG CGGACCGGAG CCAGATGCTG GCGGCGCAGC CCGCCCTGGT CTTCGGGTCA
GAGGACGTCG CCCTGGGGCT GCCGGTGATC ACCGTCGACG CCGCGGAACG CCACCAGAGC
ATGGTCGGTT TCGGCGCGGC GATCACCGAC GCCTCGGCCT GGCTGATCCA GAACCGGCTG
ACGCCCGACC AGCGCGAGCA ATTGCTCAGG GAGCTCTATG GGCGGGGCGA GGGGGAGCTC
GGCTTCAGCT TCACGCGCCT GACGATCGGC GCCTCGGATT TCTCGTCCGA ACACTACAGC
CTCGACGACG CGCCGGGCGG CGCGGCCGAT CCGGAGCTGG CTCACCTGTC GCTGGGCCGT
CCGGCGCAGG CGGTCTTCCC GACCGTCAGG CAGGTCCTGG CGATCAATCC TGATCTCAAG
GTCATGGCCT CGCCCTGGAG CGCCCCGGCC TGGATGAAGA CCACCGGCTC GCTGATCAAG
GGCCAGCTGA AATCCGAGGC CTATCCGACC TATGCCCGCT TCTTCGTGCG CTATGTTGAC
GGGGCGGCCA AGCTGGGCGT GCCGATCGAC TATCTGAGCA TCCAGAACGA GCCGGACTTC
GAGCCGGAGA ACTATCCTGG CATGCGCTGG GGCGCGGCCG ATCGGGCGCG GTTCATCGGC
GAGAACCTGG GGCCGGCGTT CAAGCAACAT GGCGTGCGGA CCCGGATCCT CGAATGGGAC
CACAACTGGG ACCAGCCGCA GCAGCCCCTG ACCGCGCTGG CCGATCCGAA GGCCGCGCCG
TTCATCGCCG GCGTGGCCTG GCACTGTTAC GCCGGCGACG TCGCCGCCCA GGCCAAGGTC
GCCGGCGCCC ATCCCGACAA GGACGTGTTC TTCACCGAAT GCTCGGGCGG CGACTGGTCG
GGTCCGTTCG ACGAAAGCTT CGGCTGGCTG ATGCGCAACC TGGTGATCGG CTCCACCCGC
AACGGCGCCC GGGGCGTGCT GATGTGGAAC CTGGCGCTTG ACGAAACCCA CGGACCGCAC
AAGGGCGGAT GCGGGGACTG TCGGGGCGTG GTGACCATAG ACAGCCGCAC CGGCGCGATC
ACCCGCAACC CCGAATACTA CGCCTTTGGA CACGCCAGCC GGTTCGTGAG GCCGGGCGCC
GTACGGATCG ACTCGTCAGA AACCGCGAGC CTGCCGAGCG TCGCCTTCCG CAATCCCGAC
GGCGGTCGCG TGCTGGTCGT CTTCAATTCC GGCAAGGATC GCCAGGCGTT TAGCGTCCGC
GAGGGCGGTC GGGTCGCCAA GACCTCGCTG CCAGGCGGCG CCGCGGCGAC GTTTGTCTGG
TAG
 
Protein sequence
MTIKVPVRRP AARLGAAALL LGLAACATTP PVVPTHAGAW ITTADRSQML AAQPALVFGS 
EDVALGLPVI TVDAAERHQS MVGFGAAITD ASAWLIQNRL TPDQREQLLR ELYGRGEGEL
GFSFTRLTIG ASDFSSEHYS LDDAPGGAAD PELAHLSLGR PAQAVFPTVR QVLAINPDLK
VMASPWSAPA WMKTTGSLIK GQLKSEAYPT YARFFVRYVD GAAKLGVPID YLSIQNEPDF
EPENYPGMRW GAADRARFIG ENLGPAFKQH GVRTRILEWD HNWDQPQQPL TALADPKAAP
FIAGVAWHCY AGDVAAQAKV AGAHPDKDVF FTECSGGDWS GPFDESFGWL MRNLVIGSTR
NGARGVLMWN LALDETHGPH KGGCGDCRGV VTIDSRTGAI TRNPEYYAFG HASRFVRPGA
VRIDSSETAS LPSVAFRNPD GGRVLVVFNS GKDRQAFSVR EGGRVAKTSL PGGAAATFVW