Gene Caul_5274 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_5274 
Symbol 
ID5897458 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010335 
Strand
Start bp213701 
End bp214804 
Gene Length1104 bp 
Protein Length367 aa 
Translation table11 
GC content68% 
IMG OID641555377 
Productglycosidase PH1107-related 
Protein accessionYP_001676708 
Protein GI167621923 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2152] Predicted glycosylase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000268053 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGACCAAGC GCGCGACCGT GCCAGATTGG GCCATCGGCC CGTTTTCGCC ACCCCAACGG 
ATCCTGGGTC CGCGACCGGA CCTGCGCTTT GACTGTCCTG TCTCGGGCGA CGCCGTCGCT
TGGGCCGCCA AGGACGTGTT CAACGCCGGC GCGGTCGTCC ACGAGGGCCG GGTCTGCCTT
CTGGTGCGGG CTGAAGACAC CGTTGGCCGA TACGCTGGCG TCTCGCGCAT CGGGCTGGCC
ACTAGCGCCG ACGGCGTGAC GTTCGACCTT GAGTCCAAGC CGGTCCTTTA TCCGGACGAC
GACCGCTGGC AGGCCTGGGA GTGGCCGGGG GGCCTGGAGG ATCCGCGTAT CGTGGTCGGG
CCCGACGGCA CGTTCGTCTG CGCTTACACC GCGTTCGACG GGAAGGTCGG CTGCTTGTTC
ATCGCCACCT CCCGCGATCT TCGCCAGTGG ACCAAGCACG GTCCGGCCTT TGCCGGGTCG
CCCTATGCGC GCCTGGCCAC CAAGGCCGGC GCAATCGTCA CCGAGCTGGT CGAGGGGCGA
CTGGTGGCCG CGCGCATCGA CGGCCGCTAT TGGATGTACT GGGGCGAGGG CGCGCTTTAT
GCGGCGACCT CCGAGGACCT GGTGCGCTGG ACCCCGGTGG AAGCCGACAG CGCGCCGGAC
AAGTACCTGA CTTGGGATCC CGAACACCGC GGCCCGATGG GCGCCTGGAC CTTGGAACGT
CCACCTGGAC CCAGGGGCGT CCGCCTCCTG GCGGGACCGC GCAGGCATCG CTTCGATTCC
CTGTTGGTCG AGCCAGGCCC GCCGGCGATC TTGACGCCTG AGGGCGTGGT TCTGATCTAC
AACGGCGGCA ATCACGTGGT GGATGGCGAC CCCGACATCG AGCCCTTCGC CTATCAGCCC
AGCCAGATGC TGTTCGACGC CCGCGATCCC ACGGCCCTGA TCGCCCGGGC GCGCGAGCCG
TTCCTGGGTA TTCCCAGGCA CGAGGCCGAG GGGCAGGTGG GCAACGTCTG TTTCGCGCAG
GGTCTGGTGA CCTTCCAGGG GCAATGGCGA CTTTATCTGG GCCTTGCCGA CTCCAGGCTG
GGGGTTTCCA CAGCGCCATT CTGA
 
Protein sequence
MTKRATVPDW AIGPFSPPQR ILGPRPDLRF DCPVSGDAVA WAAKDVFNAG AVVHEGRVCL 
LVRAEDTVGR YAGVSRIGLA TSADGVTFDL ESKPVLYPDD DRWQAWEWPG GLEDPRIVVG
PDGTFVCAYT AFDGKVGCLF IATSRDLRQW TKHGPAFAGS PYARLATKAG AIVTELVEGR
LVAARIDGRY WMYWGEGALY AATSEDLVRW TPVEADSAPD KYLTWDPEHR GPMGAWTLER
PPGPRGVRLL AGPRRHRFDS LLVEPGPPAI LTPEGVVLIY NGGNHVVDGD PDIEPFAYQP
SQMLFDARDP TALIARAREP FLGIPRHEAE GQVGNVCFAQ GLVTFQGQWR LYLGLADSRL
GVSTAPF