Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_3995 |
Symbol | |
ID | 5901457 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | + |
Start bp | 4324861 |
End bp | 4325919 |
Gene Length | 1059 bp |
Protein Length | 352 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 641564516 |
Product | LacI family transcription regulator |
Protein accession | YP_001685618 |
Protein GI | 167647955 |
COG category | [K] Transcription |
COG ID | [COG1609] Transcriptional regulators |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 35 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.917064 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGTGAAC GGAACGAACG TCAGCGGCGC CGCACGACCC AGAGCGCGAC CATTCGCGAC GTGGCCGCCC TGGCTGGCGT GTCGCCGATG ACCGTGTCGC GGGTGATCAA CCGCGAGACG ACGGTCAAGT CGGAGACCAA GGCCCTGGTC GACGCGGCGA TCCGCGACCT CAACTACGCG CCCAACCCCG CCGCCCGCAG CCTGGCCGGC TCGGCCCCGT TCCGCATCGG CCTGCTCTAC GACAACCCCT CGACCGGCTA TCTGTCGGAA TTCCTGGTCG GGGCGCTGGA CGAGAGCAGC CGGACGGGCG CGCAAGTGGT GATCGAGAAG TGCGCCGAGC CCGAACTGGC CGGCGCCACC CTGACGCGGC TGCTGAAGAC CGGCGTCGAC GGCCTGATCC TGCCCGCCCC GCTCTGCGAG TCCGCCCAGG TCCTGGCCGA GGTCAAGGCG GCCGGCGCCG CCGCGGTGGC CGTGGCGCCC GGCATGCCCA GCGCCGACAT GGCCACCATC CGCATCGACA ACGAGGCCGC CGCCTTCGAA CTGGCCCAGC ACCTGCTGGC CCTGGGCCAT AAGCGGTTCG GGATCATCAA GGGCCACCCC AACCAGACGG TCAGCCAGCA GCGCCTGGAC GGCTTCATGT CGGCCCTGAA GGCGGCCGGG ATCCCGGACA AGGCCGTGCG CATCGAGCAG GGCTATTTCA CCTATCGCTC GGGCCTGGAG GCGGCCGAGC GGCTGCTGGG CGCCGACGAC CGGCCCACCG CCATCTTCGC CGGCAATGAC GACATGGCCG CGGCCACCGC CGGGGTCGCC CACCGGATGG GCCTGGACGT GCCGGAAGAC GTGTCGATCG TCGGCTTCGA CGACACCTCG ATCGCCGCCA ATATCTGGCC GGCCCTGACC ACGGTCCACC AGCCGATCGC CGCCATGGCC CGCGCCGCCG TCGATCTGGT GCTGGAGGAG ATCCGCCGCA AACGCGGCAA GGCGGGCGAG CCGCGCCAGC TGATGCATCC GCATACGCTG ATCGTGCGGG ATTCGACCGG GCCGGCGCCG GAAGGGTGA
|
Protein sequence | MSERNERQRR RTTQSATIRD VAALAGVSPM TVSRVINRET TVKSETKALV DAAIRDLNYA PNPAARSLAG SAPFRIGLLY DNPSTGYLSE FLVGALDESS RTGAQVVIEK CAEPELAGAT LTRLLKTGVD GLILPAPLCE SAQVLAEVKA AGAAAVAVAP GMPSADMATI RIDNEAAAFE LAQHLLALGH KRFGIIKGHP NQTVSQQRLD GFMSALKAAG IPDKAVRIEQ GYFTYRSGLE AAERLLGADD RPTAIFAGND DMAAATAGVA HRMGLDVPED VSIVGFDDTS IAANIWPALT TVHQPIAAMA RAAVDLVLEE IRRKRGKAGE PRQLMHPHTL IVRDSTGPAP EG
|
| |