Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_5108 |
Symbol | |
ID | 5897338 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010335 |
Strand | - |
Start bp | 26403 |
End bp | 27353 |
Gene Length | 951 bp |
Protein Length | 316 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 641555211 |
Product | AraC family transcriptional regulator |
Protein accession | YP_001676542 |
Protein GI | 167621757 |
COG category | [K] Transcription |
COG ID | [COG2207] AraC-type DNA-binding domain-containing proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 0.310256 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCGCTCG GCGCCGCGCT CCAGGTCAAA GATGAACCTT GGGGCGCGCG TTTGGCGCGG GAGCCGTCAG GCGGTCGCTG GCGCGAAGTC TTCGAGCCGG ACTACGTCGG CGTCACCGTC TCCAGCGTGC AGCAAAAGAC CTGGAGCGGG CTGTCGGCGG TCGTACGCGA GCTGCGCGTG GACGCGCCGT TCGACGTGGA ATGCACCGCC GACTTCTCCC GTCTGCTCGT GGTCCTCGAC GAGGTGGGCG GACGTATGCG TGGCCGGACC ACCAAGAGCT GCTCGCCTGA TAATCCGCCG ACCAACGCGA TGTATTTCGT TCCGGCCGGC GCCCAGGTTT GGCAGTGCGC CAATCAGCTG CGCTACATGC GCCACATCAG TTTGCAGTTT GACGCGGCGT CCCTGGACCA GTTGCTCGGC GAGGCCGCGC CGGCTCTGCC CGACGGACCC CGAATGGGAT TTTCCGATCC AAGCCTGTTG GCGATCGCCA GACTGTTCGA GGCTGAATGC CAGAGCGAGC GTCCAGCCGA CCTGTTGTTT GGAGACGGCC TGTCGCTCAG TCTCTTGTCG GCGCTGGGGC GTCTGGCCCA GGTTCCAAGC CAGCGGCCCG CTGGACGGGG CGGGTTGACG CCGCGCCATC TCAAGCAGGT CATCGACTAT ATGGATGCTC ACCTTGGCGA GGCGATCAGT CCCCGTGATC TTGGCGACTT GGTCCACTTC TCGCCCTCGC ACCTGGGACG GGCCTTCAAG GTCTCGATGG GCGTGTCGCC CTACGCGTGG CTGATCGAAC GCCGGGTGCG GCGGGCCGCG GAGTTGTTAC TGGATCCGCA GCGCTCCATC GCCGAGGTCG CTCTGGCGGT CGGGTTCTCC GACCAACCGC ACTTTACGCG CGCCTTTGCG CGCGTTTTTG GCGCCAGCCC CGGCGCATGG CGCCGCCAGT CCCTGGCCTA G
|
Protein sequence | MSLGAALQVK DEPWGARLAR EPSGGRWREV FEPDYVGVTV SSVQQKTWSG LSAVVRELRV DAPFDVECTA DFSRLLVVLD EVGGRMRGRT TKSCSPDNPP TNAMYFVPAG AQVWQCANQL RYMRHISLQF DAASLDQLLG EAAPALPDGP RMGFSDPSLL AIARLFEAEC QSERPADLLF GDGLSLSLLS ALGRLAQVPS QRPAGRGGLT PRHLKQVIDY MDAHLGEAIS PRDLGDLVHF SPSHLGRAFK VSMGVSPYAW LIERRVRRAA ELLLDPQRSI AEVALAVGFS DQPHFTRAFA RVFGASPGAW RRQSLA
|
| |