Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_3384 |
Symbol | |
ID | 5900839 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | + |
Start bp | 3656882 |
End bp | 3657832 |
Gene Length | 951 bp |
Protein Length | 316 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 641563890 |
Product | hypothetical protein |
Protein accession | YP_001685009 |
Protein GI | 167647346 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1462] Uncharacterized protein involved in formation of curli polymers |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 0.632344 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.255786 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGCGGT CGTCGAGTCT TGTGGCGGTC GCCGCCACCC TCTCCATTCT CGCGCCCGCC GGAGCCTTCG CCCAGGCCAA GCCGAGCAAG GCGCAGAACA TGCAAACCCA GGCCATGGCC GAGATCCCGC ATTGCGCCCG CAAACTGGGC ACGGTGTCGA TCATGGACGG CGACGACACG CGCGGCTGGA TGCAATACAA TCTGGCCTCG CCGCAAAAGC TGCTGAAGGC GATCGTCCAG AAGTCGGGCT GCTTCAACCT GGTCGACCGG GGCGCGGGTC TCAACGCCGC CCAGATCGAA CGCAATGTCG GCGGCAATCT GGGCCTGCAG CGCGGCTCCA ACGTCGGCCA GGGCCAGATC AAGGCCGCGG ACTACGTGCT GGTGGCCGAG GTCCAGGCCA GTGACAGCAA CGCGGGCGGC GGCGCGGTGG CCGGCGCCCT GGGCGGGCTC ATCGGCGGCC GGGCCGGCGG GCTGATCGGC GGCATCAAGA CCAAGAAGCT GGAGGCCAAC ACCGTCCTGT CCCTGACCAA TGTGCGCACC ACCGAGACGG TGGCCGTCCA GGACGGCTAC GCGGTGAAGA ACGACATCGG CTGGGGCGCG GGCGGCGGCT ACGGTTTCGC CGGCGCGGTC GGCGGCGGCT ATGAAAGCAC CGATATCGGC CGGATCGTCA CCCTGGCGTT CATCAACAGC TACACCAAGA TGGTCAGCGA TCTGGGCCTG CTGTCCAACG ACGTGCCCGC CGCCCAGGCC GCGCCGTCCA AGACCTTCGT GGCCACCCGG GTGGTCAACC TGCGGGCCTC CCCCGCGCCA GGCGGCAAGC TGCTGCGCGC CCTGCCCGCC GGCTCGACCG TCTATCCCAC CGGCAAGAAG CAGGACATGT GGTGGGAAGT GGCCGACGAG AACGACAATG TCGGCTGGGT GCTGAACGCC GGGCTCGAGC CCGCGCGGTA G
|
Protein sequence | MKRSSSLVAV AATLSILAPA GAFAQAKPSK AQNMQTQAMA EIPHCARKLG TVSIMDGDDT RGWMQYNLAS PQKLLKAIVQ KSGCFNLVDR GAGLNAAQIE RNVGGNLGLQ RGSNVGQGQI KAADYVLVAE VQASDSNAGG GAVAGALGGL IGGRAGGLIG GIKTKKLEAN TVLSLTNVRT TETVAVQDGY AVKNDIGWGA GGGYGFAGAV GGGYESTDIG RIVTLAFINS YTKMVSDLGL LSNDVPAAQA APSKTFVATR VVNLRASPAP GGKLLRALPA GSTVYPTGKK QDMWWEVADE NDNVGWVLNA GLEPAR
|
| |