Gene Caul_0936 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_0936 
Symbol 
ID5898391 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp986693 
End bp988333 
Gene Length1641 bp 
Protein Length546 aa 
Translation table11 
GC content70% 
IMG OID641561419 
ProductAlpha-N-arabinofuranosidase 
Protein accessionYP_001682565 
Protein GI167644902 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3507] Beta-xylosidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.789253 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGCCGC AAACGATCCA GAACCCCATC CTGCGGGGCT TCAATCCCGA CCCGTCGATC 
ATCCGAGTGG ATGAGGACTA CTACGTCGCC ACCTCGACCT TCGAGTGGTT CCCGGGCGTA
CAGATCCACC ACTCGCGCGA CCTGGTGAAC TGGCGGCTGC TGACCCGGCC CCTGACCCGG
GCGAGCCAGC TGAACATGCT GGGCGCGTCC GACGGCTGCG GGGTCTGGGC CCCGTGCCTG
ACCCACGCCG ACGGCAAGTT CTGGCTGATC TATACCGACG TGAAGCGCTA TGGCCGCACC
ACGGTGGGCG GGGCCTCGGG CGCATCCCTG CGCGACTTCC ACAACTATCT GGTCACCGCC
GACCACATCG AGGGACCGTG GTCGGACCCG GTCTATCTCA ACAGCAGCGG CTTCGACCCG
TCGCTGTTCC ACGACGACGA CGGCCGCAAG TGGCTGCTGA ACCAGCTGTG GGACCACCGG
CCGGGCCGCA ACCGCTTCGC CGGCATCGTG GCCCAGGAAT ACGACGCCGC CGCCCAGGCG
CTGGTGGGGC GGCGCGTGAA CATCTTCCCC GGCACCCCGC TGGGCCTGAC CGAGGCGCCG
CATCTCTACA AGCGGGACGG CTGGTACCAC CTGATCACCG CCGAGGGCGG CACCGGCTTT
GGCCACGCGG TGACCATGGC CCGGTCCCGA ACCTTGGAGG GTCCCTACGA GGTTCACCCC
GACGGACCGG TCCTGACCGC CCGCGACCGC CCGCATGCGC CGCTGCATCG GGCCGGCCAC
GCCGATCTGG TCGAGACGGC CGACGGCCAG ACCTGGATGG CCTATCTGTG CGGCCGGCCG
CTGCCCAACC GGGGACGCTG CGTGCTGGGC CGCGAGACGG CGATCCAGCC GATGACCTGG
GGCGACGACG GCTGGCTGCG CACCCGCGAC GGCTCGGGCG ATCCGGAGCT GACCCCGCCG
TCGCCCGGCC TGCCGCCCGC GCCGTTCCCG GCCGCGCCGG CCTGGGAGGA TTTCGACGGC
CCCGACCTGC CCCTCGACTT CCAGTGGCTG CGCTCGCCGT TCCCCGAGGA GCTGTTCAGC
CTGACGGCCC GGCCGGGCTG GCTGCGGCTG TTTGGCCGCG AGACGATCGG TAGCCAATAC
CGCCAGGCCC TGGTCGCCCG CCGCCAGCAG GCGTTCTGCT ACTCCGCCCG CACGGTGTTG
GACTTTTCGC CCGAGCACTT CCAGCAGGTG GCCGGGCTGC TCTGCTACTA TGGGGCAAGC
AAGTTCCACT ACCTGTTCGT GTCGCGCGAC GACGAGAACG GCCGCCATGT CCAGGTGATG
TCGGCCCTGC CCGACAGTCC CCAGGCCGAC GCCTTCACCC CGCCGATCCC GCTGCCCGAC
ACGGGCCTGG TCCACCTGCG CGTCGAGGTC GATTTCGAGC GTCTGCGCTT CGCCTTCAGC
CTGGACGGCC AGGCCTGGAC TTGGCTGGAG CAGGTGTTCG ACGCCTCGAT CCTGTCCGAC
GAGGCCACCA GCCCCGGCGC GCCCAACTTC ACCGGCGCCT TCGTCGGCAT GGCCTGCCAG
GACTTGGCCG GCACGGCGCG GGCGGCCGAT TTCGACGGGT TTGGGTATGT CGAGCGCGAG
TATCAGGCGA CGGTGGGGTG A
 
Protein sequence
MTPQTIQNPI LRGFNPDPSI IRVDEDYYVA TSTFEWFPGV QIHHSRDLVN WRLLTRPLTR 
ASQLNMLGAS DGCGVWAPCL THADGKFWLI YTDVKRYGRT TVGGASGASL RDFHNYLVTA
DHIEGPWSDP VYLNSSGFDP SLFHDDDGRK WLLNQLWDHR PGRNRFAGIV AQEYDAAAQA
LVGRRVNIFP GTPLGLTEAP HLYKRDGWYH LITAEGGTGF GHAVTMARSR TLEGPYEVHP
DGPVLTARDR PHAPLHRAGH ADLVETADGQ TWMAYLCGRP LPNRGRCVLG RETAIQPMTW
GDDGWLRTRD GSGDPELTPP SPGLPPAPFP AAPAWEDFDG PDLPLDFQWL RSPFPEELFS
LTARPGWLRL FGRETIGSQY RQALVARRQQ AFCYSARTVL DFSPEHFQQV AGLLCYYGAS
KFHYLFVSRD DENGRHVQVM SALPDSPQAD AFTPPIPLPD TGLVHLRVEV DFERLRFAFS
LDGQAWTWLE QVFDASILSD EATSPGAPNF TGAFVGMACQ DLAGTARAAD FDGFGYVERE
YQATVG