Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_1803 |
Symbol | |
ID | 5899258 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | + |
Start bp | 1905260 |
End bp | 1906336 |
Gene Length | 1077 bp |
Protein Length | 358 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 641562293 |
Product | acetamidase/formamidase |
Protein accession | YP_001683430 |
Protein GI | 167645767 |
COG category | [C] Energy production and conversion |
COG ID | [COG2421] Predicted acetamidase/formamidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 29 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.0385546 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCCTTCA TCTGCGAACC CGGCCAGACC CGTCCCGACG TGATGCCCGG CGCCCTGCAC ACGCTGAAGG CCACGCCGGC CACCGTGCAC TGGGGCTATT TCGATCCGTC CATCAAGCCG TCGCTGCGGA TCAAGAGCGG CGACCTGGTC AGCGCCGAGG CGATCACCCA CCACGCCGGC GACGCCCCCG ACCTGATGAT GGACGAGGCG GTCACCCGCA TCTTCACCGA GATCCCCGAG GACGACCGCA ATCCCGGCGT CCACATCATG ACCGGCCCGA TCTATGTCGA GGACGCCAAG CCCGGCGACG TGCTGGAGGT ACGCTACCTG CGCATGGTCC CGCGCAACAA CTACGGCTCC AACCTCGCGG CCAACTGGGG CTATCTCTAC AAGGAGTTCG GCGAGAAGGA GCGGGTGACG ATCTACGAGC TGGATCAGAA CACCAACACC GCCAGCGCCC TCTACGCCTA CGACTTCGAA GGCAAGTACC TGATCCCCGG GGCGATCACC AACTGCCCCG AGTGCGACCG CCAGCCGGCC CTGGGCGGCA TCCGCGTGGC GGCCCGTCCG CACCTGGGCA CGGCCGGCGT GGCGCCCGCC GTCGACGGCC GGGTCAGCAC CATCCCGCCC GGCGCCCACG GCGGCAATAT CGACAACTGG CGGATCGGGG CGGGGGCGAC CATGTACTAC CCCGTCCAGG TCGAAGGCGG GCTGTTCTCG ATCGGCGACC CCCACGTCAG CCAGGGCGAC GGCGAAATCT CCGGCACGGC CATCGAGAGC TCGCTGAACG TCCTGATGCA GATCGTGCTG CGCAAGGACT TCGTCTCGCC CGGACCCTTG CTGGAGACGC CTAAGTACTG GATCGTCCAC GGCTTCGACG AGGATCTTAA TGTCGCCATG CGCGACGCCT CGCTGAACAT GCTGACCCTG CTGAGCGACC ATGTGGGCCT GTCGAAGAAC GACGCCTATT CGCTGATGAG CGTGGCTTCC GACTTCGGCG TCACCCAGGT GGTCGATGGC AAGCAGGGCT GCCATGTGCG CATTCCTCGC GACATCTTTC CCAAGATGAA GGGCTAA
|
Protein sequence | MPFICEPGQT RPDVMPGALH TLKATPATVH WGYFDPSIKP SLRIKSGDLV SAEAITHHAG DAPDLMMDEA VTRIFTEIPE DDRNPGVHIM TGPIYVEDAK PGDVLEVRYL RMVPRNNYGS NLAANWGYLY KEFGEKERVT IYELDQNTNT ASALYAYDFE GKYLIPGAIT NCPECDRQPA LGGIRVAARP HLGTAGVAPA VDGRVSTIPP GAHGGNIDNW RIGAGATMYY PVQVEGGLFS IGDPHVSQGD GEISGTAIES SLNVLMQIVL RKDFVSPGPL LETPKYWIVH GFDEDLNVAM RDASLNMLTL LSDHVGLSKN DAYSLMSVAS DFGVTQVVDG KQGCHVRIPR DIFPKMKG
|
| |