Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_3970 |
Symbol | |
ID | 5901432 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | - |
Start bp | 4299033 |
End bp | 4300049 |
Gene Length | 1017 bp |
Protein Length | 338 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 641564491 |
Product | AraC family transcriptional regulator |
Protein accession | YP_001685593 |
Protein GI | 167647930 |
COG category | [K] Transcription |
COG ID | [COG2207] AraC-type DNA-binding domain-containing proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 0.0584916 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATTGAAC CTGCGACGAC GCGCGAGGGG CCGCTCCCGC CCCTGGCGAG AACCAATTTC AGTACGCGAA GCGTTCCGCC GGAAGAGCGG CACGACTACT ATCGCCGTGA GGTGCTGTCA GCGCTGGACG CGCGCGATCC CGAGCCGGGC TTTTCGGCCA ACATCACCTC GCTGCGTCTG GGGTCCCTGG CCTTCTACGT CACCGAGACC GGCGGCCACA CCATGTTCCG CACGCCGGAG ATGATCGCCG CCGACGGTCG CGACCACTAC ATCGTGCAGT TCAACATCGC CGGCTCGCAT ACCGGCGACT TCGACGGCGT GCCCTTTTCG GCCGGTCCGG GCGAGGTCGG CATCTGCGAC CTGTCGCGGC CGATGCTGCT GCACAGCACG GCGGTCAAGG TGCTCTCGAC CTTCCTGCCG CGCGCCGAGG TCAAGGCGGT GGCGCCCGAC ATCGAACTGC ACGGCATGGT GCTGGACGCC AACCGCGCCG GCCTGCTGAT CGAGCACCTG GCCTCGGTCA CCCGGTGGTT CCCGCGACTG CTGCCCGAGA CCCTGCCGGG CATCACCCGC GCCACCATCG AGCTGCTGGG CGCGTGCCTG GTCATGGAAG CCAGCCGGGC GGACTTCGGC GTGCGCGAGT CGCCGGTGCT GATGCGGGCT CGCGCCTATG TCGAGCACAA CCTGCTGGAG CCTACCCTCA ACCCGGCCAA GATCAGCGAA GCGCTGGGCG TGTCGCGCTC GACGCTCTAC CGCCTGTTCG AACCGCTGGG CGGGGTGACG GCCTATGTCT GGGACCGCCG CCTGCACCTG GCGCGCGCCG CCCTGCTGGA CCCCAAGCGA GCCCGGCGGA TCAGCGAGAT CGCCTTCCAG TGCGGCTTCA GCAGCGAGGC CCATTTCAGC CGCAGCTTCC GCAAGGCCTT CAACATCCGG CCCAGCGACC TGCGCTCGCT GCAGCCCAGC CTGGCCGACG AGCCCGACAG CCCGTTCGCC AAGTGGACCG AGGCGGCGAA GGGGTAG
|
Protein sequence | MIEPATTREG PLPPLARTNF STRSVPPEER HDYYRREVLS ALDARDPEPG FSANITSLRL GSLAFYVTET GGHTMFRTPE MIAADGRDHY IVQFNIAGSH TGDFDGVPFS AGPGEVGICD LSRPMLLHST AVKVLSTFLP RAEVKAVAPD IELHGMVLDA NRAGLLIEHL ASVTRWFPRL LPETLPGITR ATIELLGACL VMEASRADFG VRESPVLMRA RAYVEHNLLE PTLNPAKISE ALGVSRSTLY RLFEPLGGVT AYVWDRRLHL ARAALLDPKR ARRISEIAFQ CGFSSEAHFS RSFRKAFNIR PSDLRSLQPS LADEPDSPFA KWTEAAKG
|
| |