Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_1398 |
Symbol | |
ID | 5898853 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | + |
Start bp | 1488607 |
End bp | 1489497 |
Gene Length | 891 bp |
Protein Length | 296 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 641561885 |
Product | AraC family transcriptional regulator |
Protein accession | YP_001683026 |
Protein GI | 167645363 |
COG category | [K] Transcription [L] Replication, recombination and repair |
COG ID | [COG2207] AraC-type DNA-binding domain-containing proteins [COG3449] DNA gyrase inhibitor |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 28 |
Plasmid unclonability p-value | 0.584508 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGGACGA TGACGGCGGC GCTCGACCAA TACCATGCCC GGATGCAAAG GGTGCTGGAT CATATCGACA GGCATCTGGA TGGCGATCTG GACCTGGAGG CGATGAGCAG CGTCGCGGCG TTTTCCAAGT TTCATTTCCA CCGGCAGTTC AAGGCGACCT TCGGGGTGTC CCTGCATCGC TACGTCCAGC TGGCCCGCCT ACGGCGGGCT TCGAAACGGC TGGCCGACGG GCAGGGGCAG AGCGTCACCG ATATCGCCCT GGACGCCGGC TACGAGACGC CCGACGCCTT CGCCCGCGCC TTTCGCCAAA GGTTCGCGCA GTCGCCGTCG CGTTTCCGGA AATCTCCCGA CTGGGAGCCG TGGCTTGCGG CCTTCGGGCC GCTCAACGCA GCCAGGAGCA AGATCATGCA GACGACCTTC ACCCCCGATC AGGTGGCCCT CCGCGAGGTG GCCCCGACCC GCGTGGCGAT CTTCGAGCAC CGGGGCGATC CCGAAACCCT CGACGCCACC ATCCAGCGGT TCATCGCCTG GCGCAAAGCC GCCGGGCTGT CGCCCCGGAC CAACCCGACC TTCAACATCT GGCATTCCGA GCGACGCCCG CCCGATCCCG CCGACTACAG CATGGATCTC TGCGTCGGCG TCGGGGCCGA TCAACCGATC GACTCCAGCA GCGAGACGGT CAAGGCCGGC GAAATTCCGG GCGGACGCTG CGCGGTGCTG CGCGTGACCG GCGACACCCA TAACCTGGAG CCCGCCGCCC TCTATCTCTA CCGCGACTGG CTGCCGGCCA GCGGCGAAGA GATGCGGGAC TTTCCCATCT ACTGCCAACG CTTCTTCCTG GACGCACCGG AACAGGGCAC GGCGGCGGAG CTGTTTTTGC CGTTGAAGTA G
|
Protein sequence | MRTMTAALDQ YHARMQRVLD HIDRHLDGDL DLEAMSSVAA FSKFHFHRQF KATFGVSLHR YVQLARLRRA SKRLADGQGQ SVTDIALDAG YETPDAFARA FRQRFAQSPS RFRKSPDWEP WLAAFGPLNA ARSKIMQTTF TPDQVALREV APTRVAIFEH RGDPETLDAT IQRFIAWRKA AGLSPRTNPT FNIWHSERRP PDPADYSMDL CVGVGADQPI DSSSETVKAG EIPGGRCAVL RVTGDTHNLE PAALYLYRDW LPASGEEMRD FPIYCQRFFL DAPEQGTAAE LFLPLK
|
| |