Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_4786 |
Symbol | |
ID | 5902248 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | - |
Start bp | 5167729 |
End bp | 5168949 |
Gene Length | 1221 bp |
Protein Length | 406 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 641565306 |
Product | hypothetical protein |
Protein accession | YP_001686404 |
Protein GI | 167648741 |
COG category | [S] Function unknown |
COG ID | [COG2311] Predicted membrane protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 0.6743 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGTCAAGG ATCGTATCGT TCTGCTGGAC TCCCTGCGGG GCCTGGCGGT GTTGGGCATC CTCCTCTGCA ATATTCCCCT GGTGGCGGTC CCAGAGGCGG TCGGGGTCAG CCTGACCCTG TGGCCGCACG GCATGGCGCC GGCCTCGGTG GCGGTCTGGC TCGTCACCCA GCTGTTCTTT CAGCAGAAGT TCTACTCGCT GTTCGCCATG CTGTTCGGCG CCTCGATCCT GCTGGTCGGC GGCGAGGGCG GCGATGGCGA CCGTCGCAGG ATCCTGATCC TGCGCCTGGT CAGCCTGCTG GCCATCGGCC TGTTCCACGG CTTCGTCATC TGGCAGGGCG ACGTGCTCAA CACCTATGCG ATCGTCGGCC TGCTGGCGAT GTGGGCGCGC TCGTGGCCGG CCAAGCGCCT GCTCCAGGCA GGGATCGGCC TGCATCTTGG TCTGTCGGCC TGGAGCGGCT GGAACCTGCT GACGCGCGTC GCCAAGGGCG GCGGCGATCC GCCCCCCGCG GCCATGGCCA AATACCTGGC TGAAGCCCAG GCCGACGGCG CGCAATTTGC GGGAACCTTC GCCCAGTCCC TGGTCCAGAA CGCCAAGGAC TATGGCGAGT TCGTGGTCGG GTCGTTCACC CACTGGCCGC CGACCTGGCC GCTGCTGGTG CTGTCGCTGA TCCTGATCGG CATGGGCCTC TACAAGCTCG GCGTCCTGAC CGGCAAGGCC TCGACGGGCC TCTATCAGGG GCTGATTGGC GCGGGTCTCG GCGCGCTGGT GCTCGCCGGC ATGGCCGAGA CGATATACGT GCTGCTGCCG AGCCACGACT GGACGATCCG CGGCGTGGCC CGCTGGCTGC AGAGCGCCAC CGCCCCGGTG GTCACCCTGG GCTATGTGGG CCTGATGGTG CTGGCGACGC GGACCCGGGT CTGGAAGGCA ATCCCCGCCG TGCTGGCCCC GGTCGGCCAG ATGGCCTTCA CCAACTATCT GACCCAGTCG ATCCTGATGA CCGTGTTGCT GTATGGCGGG CGCGGGCCGG GCCTGTACGG CAAGGTCGAT CGCCCCGCAC TGGCCTTGGC GGTCCTGGCC ATCTGGACCC TGCAGATCCT GTGGTCGCGC TGGTGGATGG CGCGCTTCAC CATGGGGCCG CTGGAGTGGC TGTGGCGGCT GGCCTATCGC GGGCCGATGC CGCTGCGTCG CGCGCCGGCG ACGGCTGCGG TCACGGCCTA G
|
Protein sequence | MVKDRIVLLD SLRGLAVLGI LLCNIPLVAV PEAVGVSLTL WPHGMAPASV AVWLVTQLFF QQKFYSLFAM LFGASILLVG GEGGDGDRRR ILILRLVSLL AIGLFHGFVI WQGDVLNTYA IVGLLAMWAR SWPAKRLLQA GIGLHLGLSA WSGWNLLTRV AKGGGDPPPA AMAKYLAEAQ ADGAQFAGTF AQSLVQNAKD YGEFVVGSFT HWPPTWPLLV LSLILIGMGL YKLGVLTGKA STGLYQGLIG AGLGALVLAG MAETIYVLLP SHDWTIRGVA RWLQSATAPV VTLGYVGLMV LATRTRVWKA IPAVLAPVGQ MAFTNYLTQS ILMTVLLYGG RGPGLYGKVD RPALALAVLA IWTLQILWSR WWMARFTMGP LEWLWRLAYR GPMPLRRAPA TAAVTA
|
| |