Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_4090 |
Symbol | |
ID | 5901552 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | - |
Start bp | 4438481 |
End bp | 4440325 |
Gene Length | 1845 bp |
Protein Length | 614 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 641564610 |
Product | alpha amylase catalytic region |
Protein accession | YP_001685712 |
Protein GI | 167648049 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG0366] Glycosidases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 0.904888 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGCAAATCC AGAGCGCATG GCGGAGGGGA TCGGCGAGCG CCTTGCTGGC GGTGGTCGCC TTGGGCGCGG GCCTGGCCCA GGCCGCCCCG GCCGCCCGTC CCGCCCGTCC CGCCTATCTC GACCGCGCGC CGCGGGACGA GGTGATCTAC TTCGTCCTGC CCGATCGGTT CGCCAACGGC GACGCGGCCA ATGATCGCGG CAACCTCGCG GGAGATCGCT TGAAGACGGG CTTCGATCCG GCCGACAAGG GCTTCTTCCA CGGCGGCGAC CTAGCCGGCC TGACCGCCAG GCTGGGCTAT ATCCAGGGCC TGGGCGCGAC GGCGATCTGG CTGGGACCGA TCTTCAAGAA CAAGGTGGTC CAGGGGCCGC CGGGCCAGGA GTCGGCCGGC TATCACGGCT ATTGGATCAC CGACTTCACC GACGTGGACC CGCACTTCGG CACGCGAGCC CAGATGAAGA GCTTCGTGGA CGCCGCCCAT GCGCGCGGCC TGAAGGTCTA TCTGGACATC GTCATCAACC ACACCGCCGA CGTCATCCAG TACCGCGACT GCCCGGCCGG CGGCTGCGAC TACCGGTCCA AGGCCGACTA TCCCTTTGTC CGCAAGGGTG GGCCACAGGG ACCGGCGATC AACGACGGCT TCCTGGGCGA CCAGGCCAAG CGGCAGACGG CCGGCAACTT CGCGCGGCTG ACGCGGCCCG ACTACGCCTA CACGCCCTTC GTCCCCAAGG ACGAGGAGGG CGTCAAGAAG CCGGCGTGGC TGAACGATCC GATCTGGTAC CACAATCGCG GCGACAGCCG GTTCGTGGGC GAGAGCTCGA CCTATGGCGA CTTCTCGGGT CTGGACGACG TGGCCACCGA GAACCCGCGC GTGGTGCAGG GCTTCATCGA CATCTACGGC CAGTGGATCG ACGACTTCGG CGTCGATGGC TACCGGATCG ACACCGCCCG CCACGTGAAC CCCGAGTTCT GGCAAGCCTT CGTGCCGGCC ATGCTGGCCC GGGCCAAGGC GCGCGGCATC CCGAACTTCC ACATCTTCGG CGAGGTCGCC GAGACCGAAC CGGGCATGTT GGCCAATTTC ACGCGGGTGG ACGGCTATCC GGCGGTGCTC GACTTCGCCT TCCAGGGCGC GGTGGCCGAT GTCGTCAACG GCAAGGTCGG GACCGACCGC CTGGCCCACC TGTTCGCCCA GGACGCGCTC TATCAGGGCG GTGAGGCGGC GGCCCTGCAG CTGCCGACCT TCCTGGGCAA TCACGACATG GGCCGCATCG GCCACTTCGT TCGCGGCGCC CACCCCGAGG CTTCGGAGGA CGAGATCGCC CGGCGCGTCG TCCTGGCCCA CGCCTTCCTG ATGTTCACCC GGGGCGTGCC GGTGGTCTAT TACGGCGACG AGCAGGGATT CGCTGGCGTC GGGGGCGACA AGGACGCCCG CCAGGACATG TTCGCCAGCC AGGTGGCGGC CTACAACGCC GACAAGCTGG TCGGCGGCGC GCCGGCGACC GGGGATCACT TCAAGACCGA CACCGTGCTT TACCAGGCGA TCTCGGCGAT GGCCGGGCTG CGCCAGGCCA ATCCGGCGCT GCGCGGCGGC CGGCAAGTGG TGCGGGCCTC CAGCGACAAG CCGGGCTTGT TGGCGATCTC GCGATCCACT GGCGCCGGCG AGACCCTTGT GGTGTTCAAT ACCGGGCTGA CGCCGCTCGA GGCCCAGATC GAGGTCGATG CGACCTCGCG GACCTGGCGC GCCGCCCACG GAGCCTGCGC CGCCGCCGCC TCGGCGCCGG GCAGCTATAG GGTCCAGATC GGCCCGCTCG ACTACATGAT TTGCGTTTCG GAGGCTGGCC AGTGA
|
Protein sequence | MQIQSAWRRG SASALLAVVA LGAGLAQAAP AARPARPAYL DRAPRDEVIY FVLPDRFANG DAANDRGNLA GDRLKTGFDP ADKGFFHGGD LAGLTARLGY IQGLGATAIW LGPIFKNKVV QGPPGQESAG YHGYWITDFT DVDPHFGTRA QMKSFVDAAH ARGLKVYLDI VINHTADVIQ YRDCPAGGCD YRSKADYPFV RKGGPQGPAI NDGFLGDQAK RQTAGNFARL TRPDYAYTPF VPKDEEGVKK PAWLNDPIWY HNRGDSRFVG ESSTYGDFSG LDDVATENPR VVQGFIDIYG QWIDDFGVDG YRIDTARHVN PEFWQAFVPA MLARAKARGI PNFHIFGEVA ETEPGMLANF TRVDGYPAVL DFAFQGAVAD VVNGKVGTDR LAHLFAQDAL YQGGEAAALQ LPTFLGNHDM GRIGHFVRGA HPEASEDEIA RRVVLAHAFL MFTRGVPVVY YGDEQGFAGV GGDKDARQDM FASQVAAYNA DKLVGGAPAT GDHFKTDTVL YQAISAMAGL RQANPALRGG RQVVRASSDK PGLLAISRST GAGETLVVFN TGLTPLEAQI EVDATSRTWR AAHGACAAAA SAPGSYRVQI GPLDYMICVS EAGQ
|
| |