Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_0007 |
Symbol | |
ID | 5897719 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | - |
Start bp | 9533 |
End bp | 10543 |
Gene Length | 1011 bp |
Protein Length | 336 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 641560490 |
Product | putative DNA-binding protein |
Protein accession | YP_001681643 |
Protein GI | 167643980 |
COG category | [R] General function prediction only |
COG ID | [COG3943] Virulence protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 0.341702 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCGAAG GCGAGATCGT TCTCTACAAT ACCGAAGACG GGCTCGCGCG CGTGCAACTG CGGGCGGCCG ATGGGACGGT TTGGCTGACC CAGGCGCAGA TGGCCGATCT GTTCCAGACG TCGCGGCCCA ATATCACCCA GCATCTCATC GCGATCTTCG AATCGGGCGA ATGTCTGGAG GCAGCAGTAT GTAAGCAAGA CTTACTAACT GCCGCTGATG GCAAACGATA TCAGACAACG CTCTATCGCC TCGAAGCCAT CCTCGCTATC GGCTACCGGG TGCGATCAGC GCGCGGCGTG CAGTTCCGAC AATGGGCGAC AACCCGTCTG CGGGACTATC TGGTCAAGGG CTTCGTGCTC GACGAGCAGC GCCTGCGTGA GCCAGAACCC TTCGATTATT TCGACGAACT GCTTGAGAAG ATCCGCGACA TCCGGGCCTC GGAGAAGCGG TTCTACCAGA AGGTCCGCGA TGTCTACGCA ATGGCCGCCG ACTACGATGG GCGATCCGAA GCCGCCCAGA TCTTCTTCGC CACTGTTCAG AACAAGATGC TCCACGCTGT CTCGGGCAAG ACGGCGGGAG AGCTGATCGT AGCCCGCGCT GATCCTATCG CGCCAAATAT GGGGCTGCGA AGCTGGAAGG GCGGCAAGGT TCGCAAAGGC GACGTCGATA CCGCCAAGAA TTATCTAAGC GCTGACGAGG TCAGCGATCT TAATCGCATC GTGACGATGT TTCTGGACTT CGCCGAGGAT CAGGCTCGCC GTCGCAAGCC GATGACCATG GCCGAGTGGG CGGCGCGCCT GGACGCCTTC TTGGCCTTCA ATGATCGTGA CGTCCTGGCC AGCGCCGGTC GGGTCACCGC GGACGAGGCC AAACGTGTCG CGCACAAGCG GTTCGAGGTA TTCGACACCG GACGACGCGC GGCGGAATCG CACGCGGCCG ATGCCGAACA TGTCGAGGAA ATGCTTCGGC TAACCGAAGA GCTCAAGCCC GCCTCGAAGA AACGCTTCTA G
|
Protein sequence | MSEGEIVLYN TEDGLARVQL RAADGTVWLT QAQMADLFQT SRPNITQHLI AIFESGECLE AAVCKQDLLT AADGKRYQTT LYRLEAILAI GYRVRSARGV QFRQWATTRL RDYLVKGFVL DEQRLREPEP FDYFDELLEK IRDIRASEKR FYQKVRDVYA MAADYDGRSE AAQIFFATVQ NKMLHAVSGK TAGELIVARA DPIAPNMGLR SWKGGKVRKG DVDTAKNYLS ADEVSDLNRI VTMFLDFAED QARRRKPMTM AEWAARLDAF LAFNDRDVLA SAGRVTADEA KRVAHKRFEV FDTGRRAAES HAADAEHVEE MLRLTEELKP ASKKRF
|
| |