Gene Caul_1040 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_1040 
Symbol 
ID5898495 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp1099968 
End bp1101290 
Gene Length1323 bp 
Protein Length440 aa 
Translation table11 
GC content69% 
IMG OID641561522 
Productmannanase, putative 
Protein accessionYP_001682668 
Protein GI167645005 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3934] Endo-beta-mannanase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.41477 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTGTCGA GACGTCACCT GATAGCGACC GGCGCGGCCG CCATGGCCGC GGGCGGAGCC 
CACGCCGCCC CGCCCTCGCG GGACTTCGTC ACCGTCCACG AGGGCCGCCT GGCCCTGAAC
GGCAAGCCCT ATCGCTTCGT CGGCGCGAAC GTCTGGTACG GGGCCTGGCT GGGCTCGCCA
GGGGCGACGG GCGACGTCGC GCGGCTGGGG CGCGAGCTGG ACCGGCTGAA GGCCCTGGGC
GTCACCAACC TGCGAGTCCT GGGTTCGGGC GAGAAGTCGC CGGCCAAGGT GGCCATCGAC
CCCACCTTCC GCGGGCCGGG CCAGGACTAT AACCAGGACC TGCTCAAGGG CCTGGACGTG
CTGCTGGCCC AGATGGCCAA GCGCGACATG AAGGCAGTGA TCTACGTCAA CAACTTCTGG
GACTGGTCGG GCGGCATGCC GGCCTATCTG CGTTGGACCG GCAATGGCGA GTGGTTCCAG
CAGGGCGACC CCGCCCACCC CTGGCCGCAG TTCGCCGACT ATTCGGCCCG CTTCTATGGC
GACGCCAAGG CCCAGGCGCT GTTCCGTCAC TATGTCCGCG CCCTGGTCAC CCGCACCAGC
AGCGTCACCG GCAAGCCCTA TCGCGACGAT CCGACGATCA TGGCCTGGCA ACTGGCCAAC
GAACCCCGCC CCGGCGGCAG CGACGCCTTC GGGGTTCCCA ACCTGCCGAC CTATTACCGC
TGGATCGCCG AGACCTCGGC CTTCATCAAG ACGCTGGATC CGCACCACCT GGTCACCACC
GGCAGCGAGG GCGCCATGGG CTGTCTGCGG CGCGAGGCCT GCGTCGTCGA GGCCCACAAG
CCGGCCAGCA TCGACTACAT CACCCTGCAC GTCTGGCCCA ACAACTGGGG CTGGATCGAC
CCCAAGAACC AGACCGCCAC CTACGAGGCC GGCGAGGCCC GCTGCCGCGA CTATGTCGTC
GACCACATCG CCATCGCCCG CCAATTGGGA AAGCCGCTGG TGATCGAGGA GTTCGGCCTG
GTGCGCGACG GCCGCACGTT CGAGCCGGGC GGCCCCACGG TCTATCGCGA CCGGTTCTAT
TCCCGGATCT ACGCCCTGGC CCTGGCCGAC ATGCAGGTCG ACGGCCCGAT CGCCGGGACC
AACTTCTGGG CCTGGAACGG CGAAGGCCGC GCCCAGCACG ACGACGCCTG GTTCAAGATG
GGCGACAAGG CCTATGCCGG CGACCCGCCG CAGGAGGAGC AGGGCCTGTT TGGGGTGTTC
GACGCGGATG TATCGACGCT GAACGTGGTG CGGGAGCATG CGAAGGCGGT GGCGGCGCTT
TAG
 
Protein sequence
MLSRRHLIAT GAAAMAAGGA HAAPPSRDFV TVHEGRLALN GKPYRFVGAN VWYGAWLGSP 
GATGDVARLG RELDRLKALG VTNLRVLGSG EKSPAKVAID PTFRGPGQDY NQDLLKGLDV
LLAQMAKRDM KAVIYVNNFW DWSGGMPAYL RWTGNGEWFQ QGDPAHPWPQ FADYSARFYG
DAKAQALFRH YVRALVTRTS SVTGKPYRDD PTIMAWQLAN EPRPGGSDAF GVPNLPTYYR
WIAETSAFIK TLDPHHLVTT GSEGAMGCLR REACVVEAHK PASIDYITLH VWPNNWGWID
PKNQTATYEA GEARCRDYVV DHIAIARQLG KPLVIEEFGL VRDGRTFEPG GPTVYRDRFY
SRIYALALAD MQVDGPIAGT NFWAWNGEGR AQHDDAWFKM GDKAYAGDPP QEEQGLFGVF
DADVSTLNVV REHAKAVAAL