Gene Caul_4090 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_4090 
Symbol 
ID5901552 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp4438481 
End bp4440325 
Gene Length1845 bp 
Protein Length614 aa 
Translation table11 
GC content69% 
IMG OID641564610 
Productalpha amylase catalytic region 
Protein accessionYP_001685712 
Protein GI167648049 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0366] Glycosidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value0.904888 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCAAATCC AGAGCGCATG GCGGAGGGGA TCGGCGAGCG CCTTGCTGGC GGTGGTCGCC 
TTGGGCGCGG GCCTGGCCCA GGCCGCCCCG GCCGCCCGTC CCGCCCGTCC CGCCTATCTC
GACCGCGCGC CGCGGGACGA GGTGATCTAC TTCGTCCTGC CCGATCGGTT CGCCAACGGC
GACGCGGCCA ATGATCGCGG CAACCTCGCG GGAGATCGCT TGAAGACGGG CTTCGATCCG
GCCGACAAGG GCTTCTTCCA CGGCGGCGAC CTAGCCGGCC TGACCGCCAG GCTGGGCTAT
ATCCAGGGCC TGGGCGCGAC GGCGATCTGG CTGGGACCGA TCTTCAAGAA CAAGGTGGTC
CAGGGGCCGC CGGGCCAGGA GTCGGCCGGC TATCACGGCT ATTGGATCAC CGACTTCACC
GACGTGGACC CGCACTTCGG CACGCGAGCC CAGATGAAGA GCTTCGTGGA CGCCGCCCAT
GCGCGCGGCC TGAAGGTCTA TCTGGACATC GTCATCAACC ACACCGCCGA CGTCATCCAG
TACCGCGACT GCCCGGCCGG CGGCTGCGAC TACCGGTCCA AGGCCGACTA TCCCTTTGTC
CGCAAGGGTG GGCCACAGGG ACCGGCGATC AACGACGGCT TCCTGGGCGA CCAGGCCAAG
CGGCAGACGG CCGGCAACTT CGCGCGGCTG ACGCGGCCCG ACTACGCCTA CACGCCCTTC
GTCCCCAAGG ACGAGGAGGG CGTCAAGAAG CCGGCGTGGC TGAACGATCC GATCTGGTAC
CACAATCGCG GCGACAGCCG GTTCGTGGGC GAGAGCTCGA CCTATGGCGA CTTCTCGGGT
CTGGACGACG TGGCCACCGA GAACCCGCGC GTGGTGCAGG GCTTCATCGA CATCTACGGC
CAGTGGATCG ACGACTTCGG CGTCGATGGC TACCGGATCG ACACCGCCCG CCACGTGAAC
CCCGAGTTCT GGCAAGCCTT CGTGCCGGCC ATGCTGGCCC GGGCCAAGGC GCGCGGCATC
CCGAACTTCC ACATCTTCGG CGAGGTCGCC GAGACCGAAC CGGGCATGTT GGCCAATTTC
ACGCGGGTGG ACGGCTATCC GGCGGTGCTC GACTTCGCCT TCCAGGGCGC GGTGGCCGAT
GTCGTCAACG GCAAGGTCGG GACCGACCGC CTGGCCCACC TGTTCGCCCA GGACGCGCTC
TATCAGGGCG GTGAGGCGGC GGCCCTGCAG CTGCCGACCT TCCTGGGCAA TCACGACATG
GGCCGCATCG GCCACTTCGT TCGCGGCGCC CACCCCGAGG CTTCGGAGGA CGAGATCGCC
CGGCGCGTCG TCCTGGCCCA CGCCTTCCTG ATGTTCACCC GGGGCGTGCC GGTGGTCTAT
TACGGCGACG AGCAGGGATT CGCTGGCGTC GGGGGCGACA AGGACGCCCG CCAGGACATG
TTCGCCAGCC AGGTGGCGGC CTACAACGCC GACAAGCTGG TCGGCGGCGC GCCGGCGACC
GGGGATCACT TCAAGACCGA CACCGTGCTT TACCAGGCGA TCTCGGCGAT GGCCGGGCTG
CGCCAGGCCA ATCCGGCGCT GCGCGGCGGC CGGCAAGTGG TGCGGGCCTC CAGCGACAAG
CCGGGCTTGT TGGCGATCTC GCGATCCACT GGCGCCGGCG AGACCCTTGT GGTGTTCAAT
ACCGGGCTGA CGCCGCTCGA GGCCCAGATC GAGGTCGATG CGACCTCGCG GACCTGGCGC
GCCGCCCACG GAGCCTGCGC CGCCGCCGCC TCGGCGCCGG GCAGCTATAG GGTCCAGATC
GGCCCGCTCG ACTACATGAT TTGCGTTTCG GAGGCTGGCC AGTGA
 
Protein sequence
MQIQSAWRRG SASALLAVVA LGAGLAQAAP AARPARPAYL DRAPRDEVIY FVLPDRFANG 
DAANDRGNLA GDRLKTGFDP ADKGFFHGGD LAGLTARLGY IQGLGATAIW LGPIFKNKVV
QGPPGQESAG YHGYWITDFT DVDPHFGTRA QMKSFVDAAH ARGLKVYLDI VINHTADVIQ
YRDCPAGGCD YRSKADYPFV RKGGPQGPAI NDGFLGDQAK RQTAGNFARL TRPDYAYTPF
VPKDEEGVKK PAWLNDPIWY HNRGDSRFVG ESSTYGDFSG LDDVATENPR VVQGFIDIYG
QWIDDFGVDG YRIDTARHVN PEFWQAFVPA MLARAKARGI PNFHIFGEVA ETEPGMLANF
TRVDGYPAVL DFAFQGAVAD VVNGKVGTDR LAHLFAQDAL YQGGEAAALQ LPTFLGNHDM
GRIGHFVRGA HPEASEDEIA RRVVLAHAFL MFTRGVPVVY YGDEQGFAGV GGDKDARQDM
FASQVAAYNA DKLVGGAPAT GDHFKTDTVL YQAISAMAGL RQANPALRGG RQVVRASSDK
PGLLAISRST GAGETLVVFN TGLTPLEAQI EVDATSRTWR AAHGACAAAA SAPGSYRVQI
GPLDYMICVS EAGQ