Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_3224 |
Symbol | |
ID | 5900679 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | + |
Start bp | 3484486 |
End bp | 3485496 |
Gene Length | 1011 bp |
Protein Length | 336 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 641563729 |
Product | hypothetical protein |
Protein accession | YP_001684849 |
Protein GI | 167647186 |
COG category | [S] Function unknown |
COG ID | [COG4093] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 33 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.0804136 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCCATA ACGCCGCCGC TCTGTCCCGC AAGCCCCGGC GTCGCGGTCT GCTGGCGCCT TTCGTGGTGT TGGCATTAGT TGCTCTGGGG TGGAGCGCCG GCTGGGTGTG GCTGCGCGGC CAGGCCGAGC AGCGAATGGA CGCCACCGCC CTGTCGCTGA AATCGCGCGG CTACGACCTG TCGTGGGACG TCCGGACCTT CAGCGGCTAT CCGTTCCGCA TGGACGTGCG CCTGACCAAC GCCCGGGTCG CCGAGCCGTC GGGCTGGGCG TTGCGGGCGC CGGAGCTGAC GGGCGAGGCC ATGGCCTACG ACATCGGTCA CTGGGTGGTC GTCGCCCCGG CCGGCGTGGT CATGACCCGG CCGATCAATG GCGACGTGGC GATCACCGGC CAGGCGCTGC GGGCCAGCTT CGCCGGCTTT GACAAGTACC CGCCGCGCAT CTCGGTGGAG GGCGCGAACC TGATCTTCAC CACCGCCCCT GGCGTCGCGC CCTTCCCGCT GCTGTCGACG GCGGGGCTGC AACTGCACAT CCGCCCCGGT CCGGACGACC AGGGCGCGAT CTTCTTCGAG GCCAAGGGCG CCAAGGCCCG CTTCACCGGC CTGATGGGCC GGATTGCCGA GGATCGCACC GCCGACCTGA TCTGGGATTC CAAGATCAGC AAGGTCAGCG CCCTGCGCGG CAGAAACTGG GCCGACGCAG TGGGCGACTG GTCCAAGGCC GGCGGAACCC TGACCGTGCA GCAGGGCAAG CTCAACGCCG GCGAGGCGCT GCTGGAAGCC AAGTCCGGCG CCCTGACCGT CGGCGACGAC GGCCGCCTGC AGGGCGCGCT CGACGTCACC GTGCGCGAGG TTCCCTCGCC CGGCGAGGCG CTGAAGAGCC CCGACGCCGC CGCGGCGGCC GTCGCCCAGG CGCTCGGCCG CGACCCGACC CTGTCGGCCA CCCTGAAGTT CGAAAATGGC CGCACCCGGC TGGGACTGTT CGACACCGGG CCTTCGCCGC GGGTTTATTG A
|
Protein sequence | MTHNAAALSR KPRRRGLLAP FVVLALVALG WSAGWVWLRG QAEQRMDATA LSLKSRGYDL SWDVRTFSGY PFRMDVRLTN ARVAEPSGWA LRAPELTGEA MAYDIGHWVV VAPAGVVMTR PINGDVAITG QALRASFAGF DKYPPRISVE GANLIFTTAP GVAPFPLLST AGLQLHIRPG PDDQGAIFFE AKGAKARFTG LMGRIAEDRT ADLIWDSKIS KVSALRGRNW ADAVGDWSKA GGTLTVQQGK LNAGEALLEA KSGALTVGDD GRLQGALDVT VREVPSPGEA LKSPDAAAAA VAQALGRDPT LSATLKFENG RTRLGLFDTG PSPRVY
|
| |