Gene Caul_2410 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_2410 
Symbol 
ID5899865 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp2629526 
End bp2631601 
Gene Length2076 bp 
Protein Length691 aa 
Translation table11 
GC content65% 
IMG OID641562901 
ProductAlpha-N-arabinofuranosidase 
Protein accessionYP_001684035 
Protein GI167646372 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3534] Alpha-L-arabinofuranosidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000000106036 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.27509 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGATATG GATTGATGTT GGCCGCCTGC GCCGGCTTGA CGATGGCCGC CTCGACGGCG 
GCGATGGCCA CAACCCCGGT CAAGGTAGCG ATCGATGGCG CCAGCCGGGC CGCGCCGGTC
ACGAAATACG AATACGGCAT GTTCATCGAG CCCATCGGGG GGCTGGTGGC GCGCACGCTG
TGGGCGGAAA TGCTCGACGA CCGGAAGTTC TACTATCCCG TCATGGCGCC GGCCTTCGAC
AAGCCGCCGC CGCTTAACGC CGAAGGTCGG CCAGGCGTCA GCTATCGCAA GTGGCGGCCG
ATCGGCGGCG ACGCCGCGGT GACCATGGAC ACCAAGGCGC CCTATGTCGG GACGCAGAGC
CCTCGCGTCG CCGTCGATCC GTCGATGCGC AAGGGTTTCT CACAATCGGG AATCAGCGTG
GCCAAGGGAG AGCGCTACGA CGGCTACCTG CTGATGACGG GCGATCCAGG GGCGAGGGTC
GAGGCGGCGC TCGTCTGGGG TCCTGGACCC AATGATCGTC AGGCCATAGC CCTGCCCGAA
CCCGGTGACG ACTGGCGCCG AGCCGATTTC AGCTTCACGC CGACGGTGGA CGCGGCGGAC
GCCCGCCTGG AGATCACGGG CCTGGGTTCG GGATCCTTCC GAATCGGCGC GGTCTCGCTG
ATGCCCGCCG ACAACATTCG CGGCTGGCGC GCCGACACCA CCGCGATCGC CCGCTCGCTG
AATTCGGGCA TGTGGCGCCT GCCCGGCGGC AATTTCCTGT CGGATTGGGA CTGGCATGGC
GCCATTGGAC CGAGGGACAA GCGCGCGCCG ATGTTCGACC ACGCCTGGAG CGCGATGCAA
CCCAACGACA TCGGCATGGA CGAGTGGATG GATCTGACCA AGATCATCGG CGTCGAGCCC
TACGTCACGG TCAATGCTGG CCTTGGCGAC GCCAACTCGG CCGCCGAGGA GGTGGAATAT
CTAAACGGCT CGGCCAGCAC CCCCTGGGGC GCGCGCCGAG CCGCCAATGG CCATCCACAG
CCCTATGGCG TGAAGTACTG GAACATCGGC AACGAGCCGT ACGGCTGGTG GCAGATCGGC
AAGACCTCGC TCGACTATTT CATGATCAAG CACAATGCGT TCGCCGAGGC GATGCGCGCG
GTCGATCCGA CGATCACCCT GATCGGCTCG GGGGCCATGC CTGACCAAGG TCATCCGCGC
GGTACAAAGG AAAACGCGTC GATCGAGAGC GTCGCGCCGA AGTTCGGCAC CGAATGGGAT
TGGACCGGTG GGCTCTTGGA AAAGGCCTGG GGCAATTTCG ACGGCATTTC CGAGCATTGG
TACGATCAAC CCGAGGCTCG CCCCGACGCG CCCGCCGACG CTGAACTGAT CGAGTACGCT
CGCTCGCCCT CCAACCAGGT CCGGATGAAA GCCGATCAAT GGAAGATCTA CCAGAAACGC
TTCCCGGCCA TGAAGGACAA GGCGATCTTC CTGTCGATCG ACGAGTACGC CTATTTCGGC
CAAGTGAACC TGAAGTCAGC GCTGGCCTAC GCGATGGTCC TGCAGGAGAT GCTTCGCCAC
ACCGATTTCC TGACCATGGG AGCCTTCACC ACGGGGGTCT CGACCATGGA CATCACGCCC
ACCGACGCGG TGCTCAACAC GACGGGCCAG GTCTTCAAGC TCTATGGCGA ACATTTCGGC
GCCGGCGTCG TTCCTGTGAC GGTGCAAGGC GACTCGCCTC AGCCCGAACC GCGCTATCCC
GTCGGCTACA ATCACCCCAA GGTGCGCGCG GGAAGTCCGA CCTATCCGCT CGATGTAGTC
GCGGGCCTGA GCCCCGACGG CAAGACCCTG CGGATCGCCG TGGTGAATCC GACGCTCACG
CCCCAAACCC TGAAGCTTGA CCTGAAGGCG CTGGCCGCGC GGGGCGCTGG ACGCAAATGG
GCACTGAGCG GCGCGTCGCT GAACGCCCAG AACACGGTTG GCGCCGCAGC CGGAGTCACT
ATCACCCAGA GCGCAGCGCC GCGTCCGGGC GGCGAACTCG TCGTCTCGCC GATTTCGGCG
ACCGTGTTCG AGTTTCCGAT CGAGGCCAAG CGATAG
 
Protein sequence
MRYGLMLAAC AGLTMAASTA AMATTPVKVA IDGASRAAPV TKYEYGMFIE PIGGLVARTL 
WAEMLDDRKF YYPVMAPAFD KPPPLNAEGR PGVSYRKWRP IGGDAAVTMD TKAPYVGTQS
PRVAVDPSMR KGFSQSGISV AKGERYDGYL LMTGDPGARV EAALVWGPGP NDRQAIALPE
PGDDWRRADF SFTPTVDAAD ARLEITGLGS GSFRIGAVSL MPADNIRGWR ADTTAIARSL
NSGMWRLPGG NFLSDWDWHG AIGPRDKRAP MFDHAWSAMQ PNDIGMDEWM DLTKIIGVEP
YVTVNAGLGD ANSAAEEVEY LNGSASTPWG ARRAANGHPQ PYGVKYWNIG NEPYGWWQIG
KTSLDYFMIK HNAFAEAMRA VDPTITLIGS GAMPDQGHPR GTKENASIES VAPKFGTEWD
WTGGLLEKAW GNFDGISEHW YDQPEARPDA PADAELIEYA RSPSNQVRMK ADQWKIYQKR
FPAMKDKAIF LSIDEYAYFG QVNLKSALAY AMVLQEMLRH TDFLTMGAFT TGVSTMDITP
TDAVLNTTGQ VFKLYGEHFG AGVVPVTVQG DSPQPEPRYP VGYNHPKVRA GSPTYPLDVV
AGLSPDGKTL RIAVVNPTLT PQTLKLDLKA LAARGAGRKW ALSGASLNAQ NTVGAAAGVT
ITQSAAPRPG GELVVSPISA TVFEFPIEAK R