Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_4571 |
Symbol | |
ID | 5902032 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | - |
Start bp | 4946560 |
End bp | 4947855 |
Gene Length | 1296 bp |
Protein Length | 431 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 641565090 |
Product | hypothetical protein |
Protein accession | YP_001686189 |
Protein GI | 167648526 |
COG category | [S] Function unknown |
COG ID | [COG1322] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 34 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.775043 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACTTTT CCGATCCGTT CCTGATCCTC GCCATCGTCT TCGCCCTGCT GGCCGCCGGC GCCGTGCTGT GGGCCCTGGC CAGCCAGCGG CGCGCGACCG GCGCCGACGC CCGCGCGTGG GAGCTGAACG CCAGGCTGGT CCAGGCCGAC GAGCGGGCGC GGCTGCTGGA AGACCAGGCC GTCACCCAGG GCGAGCTGAT CCGCGCCCAG GCCGCCCAGC AGGCGACCAT GACCGCCAAC ACCGTCGCCG AGGCGCTGAT CAAACGCACC GAGGAGAACT TCAAGAGCCG CGAGGCGCTG TCCCAGGCCC GGCTCGAAGC TCAGTTGAAG CCGGTGGCCG AGACCCTGGC CAAGTTCGAG GCCCAGGTCA CCGCCGTCGA AAAGGCCCGC GCCGAGGAGA CCGGCGGCCT GAAGGCCCAG ATCAACGCCC TGATGGAGGC CTCGGTCGCC ACCCAGTTCG AGGCCCGCAA GCTGTCGGCC GCCCTGCGGC GCGGGGCCGG GGTCCAGGGC CGCTGGGGGG AGCAGACTTT ACGTAACGTT CTCGAGGCCG CCGGCCTCAA CAACCGCTTC GACTTCGAGG AGCAGTTCAG CGTCGAGAGC GACGAGGGCC GTCGTCGTCC CGACGTCAAG GTCAAGATGC CGGGCGGCGG GGTGTTCGTG ATCGACGCCA AGTGCTCGCT GAACGCCTTC CTCGAGGCCC AGGAAGTGAC CGAGGAGCAC CTGCGCGAGG CGGCCATGAT CCGTCACGCC GCCAGCGTCC GCGCCCACAT GCAGGGTCTT TCCGCGAAGG CCTATTGGGA CCAGTTCGCC GGCGAGGGCT CGCCCGACTT CGTGGCCATG TTCGTGCCCG GCGACGGATT CCTGGCCGCC GCCCTGGACC GCCTGCCCGA CCTGATGACC GAGGCCATGG ACCGCCGGGT GCTGCTGGTC ACCCCGACCA CCCTGTTCGC TCTCTGCAAG GCCGTCGCCT ATGGCTGGCG GGCCGAGGAC CAGGCCAAGA ACGCCGCCGC CATCGTCGCG GTGGGCCGCG AGCTCTATAA GCGCATCGCC GTGATGGGGG CCCATGCCGG CTCGGTGGGC AAGGCGCTGG AGGCTGCCGT CGGCCGCTAC AACCAGTTCG TCGGCTCGCT GGAAAGCCAG GTCCTGACCC AGGCTCGCCG CTTCGAGGAC CTGTCGGTGG ATCACGAGGG CAAGGAGATC GGCGAGCTGG CCCCGGTCGA GAACGCCGTG CGGCCGCTGG TCAAGCTGGC CGAGGCGCCG GCGGAGCCCG TGGCTCGCCT GCAGGCCAAG CCTTAG
|
Protein sequence | MNFSDPFLIL AIVFALLAAG AVLWALASQR RATGADARAW ELNARLVQAD ERARLLEDQA VTQGELIRAQ AAQQATMTAN TVAEALIKRT EENFKSREAL SQARLEAQLK PVAETLAKFE AQVTAVEKAR AEETGGLKAQ INALMEASVA TQFEARKLSA ALRRGAGVQG RWGEQTLRNV LEAAGLNNRF DFEEQFSVES DEGRRRPDVK VKMPGGGVFV IDAKCSLNAF LEAQEVTEEH LREAAMIRHA ASVRAHMQGL SAKAYWDQFA GEGSPDFVAM FVPGDGFLAA ALDRLPDLMT EAMDRRVLLV TPTTLFALCK AVAYGWRAED QAKNAAAIVA VGRELYKRIA VMGAHAGSVG KALEAAVGRY NQFVGSLESQ VLTQARRFED LSVDHEGKEI GELAPVENAV RPLVKLAEAP AEPVARLQAK P
|
| |