Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_3661 |
Symbol | |
ID | 5901116 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | - |
Start bp | 3951107 |
End bp | 3952513 |
Gene Length | 1407 bp |
Protein Length | 468 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 641564172 |
Product | peptidase M28 |
Protein accession | YP_001685286 |
Protein GI | 167647623 |
COG category | [R] General function prediction only |
COG ID | [COG2234] Predicted aminopeptidases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 28 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.21215 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGCCTGT TCCGCACTCT GCTGACCGCC GCCGCCGTTT CAGCCCTGAT GGCCGGGGCG TCTCACGCCC AGGACGCTCG CACCGCCGAG GCGTTGCGGG ACAAGGCCCT GCTGGACCGC ACCGCCTGGG ATGTCACCGA GGACCTGACC ACCAGCATTG GCCCGCGCCT CGTCGGCTCG CCGGCCATGG CCCGCGCCAA GGACTGGGGC GTGGCCAAGT TCAAGGCCCT GGGCTTCACC AACGTCAAGG TCGAGGAATT CGCCAAGCCG TCGTGGACGC GGGGCGAGGA GAGCGCCGAA CTGGTCGCGC CCTATCCGAT GAAGCTGAGC ATCGTGGGCC TGGGCCGCAC GGTCCCGACC CCGGCCGCCG GCATCGAGGC CGAGGTCGCG CTGTTCAAGA CCTATGCCGA GCTGATCGCC GCGCCAGAGA GCGCCGTGAA GGGCAAGATC GTGGTGATCA CCCAGCCGAT GGTCCGCGCC CAGGACGGTG CCGGCTATGG CGTGGCCGGG ATCTCGCGCC GGTCCGGTCC AGTCGAGGCC GCCAAGCGCG GCGCCGTCGC CCTGCTGATC CGTTCGGTCT CGACCTCCGA CTCCACCGTG CCGCACACCG GCGTCACCGC CTTCGGCGAC GGCGTCGTCT CGATCCCCTC GGCCGCCCTG GGCGTGCCGG AAGCCGAACA GTTGGAGCGC CTAGCCGCCA AGGGTCCGCT GCGCATCAAG CTGAAGCTGG CGTCAACGAT CGACCCGGCC GACGTGGCCT GGAACATCTC GGGCGAGATC AAGGGCTCGG AAAAGCCCGA CGAGGTGATC GTCATTGGCG GCCACCTGGA CAGCTGGGAC GTCGGCACCG GCGCCCTGGA CGACGCCACG GGCATCGCCA TCACCACGGC CGCCGCCAAG CTGATCGGCG ACCTGCCCAA GCATCCCAAG CGCACTATCC GCGTGGTGAT GTTCGGCTCG GAAGAAAGCG GCGGCTCGTC GGAGGCCTAT CTGGCCGCCC ACAAGGACGA GGTGTCCAAG ATCGTCCTGG CCGGCGAGAG CGACAGCGGG GCCGACCGTA TCTACAGCCT GCAGATCCCC AAGGGTTCGG CGGGGCACCC GGCGATGCAG GCCGCCGCCC GCGTGCTGAC CCCGCTGAAG ATCTATGTCG ACCGCGCTCC GCCGGCCCAC GCCGGCGCGG ACATCGAGGG ACTGGAAGAA GCCGGCGTGC CGGTGATCGC CCTGAACCAG GACGCCAGTC GCTACTTCGA CTACCACCAC ACCATGGACG ACACTCTCAA CAAGGTGCGC CCGGACGAAC TGGCCCAGAA CGTCGCGGCG TGGGCCAGCT TCCTCTACCT GGTGGCCGAC AGCGACATCG ACTTCCGGAC GCTGAGCGCG GCCGCTCCTG CGGCTCCGGC GCACTGA
|
Protein sequence | MRLFRTLLTA AAVSALMAGA SHAQDARTAE ALRDKALLDR TAWDVTEDLT TSIGPRLVGS PAMARAKDWG VAKFKALGFT NVKVEEFAKP SWTRGEESAE LVAPYPMKLS IVGLGRTVPT PAAGIEAEVA LFKTYAELIA APESAVKGKI VVITQPMVRA QDGAGYGVAG ISRRSGPVEA AKRGAVALLI RSVSTSDSTV PHTGVTAFGD GVVSIPSAAL GVPEAEQLER LAAKGPLRIK LKLASTIDPA DVAWNISGEI KGSEKPDEVI VIGGHLDSWD VGTGALDDAT GIAITTAAAK LIGDLPKHPK RTIRVVMFGS EESGGSSEAY LAAHKDEVSK IVLAGESDSG ADRIYSLQIP KGSAGHPAMQ AAARVLTPLK IYVDRAPPAH AGADIEGLEE AGVPVIALNQ DASRYFDYHH TMDDTLNKVR PDELAQNVAA WASFLYLVAD SDIDFRTLSA AAPAAPAH
|
| |