Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_1016 |
Symbol | |
ID | 5898471 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | + |
Start bp | 1075801 |
End bp | 1077168 |
Gene Length | 1368 bp |
Protein Length | 455 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 641561498 |
Product | two component, sigma54 specific, Fis family transcriptional regulator |
Protein accession | YP_001682644 |
Protein GI | 167644981 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG2204] Response regulator containing CheY-like receiver, AAA-type ATPase, and DNA-binding domains |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 28 |
Plasmid unclonability p-value | 0.963955 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGCCTCC TGGTCGTCGG AAAACTGAAC GGACAGCTCT CGGTCGCCGT GAAGATGGCG ATGAACACCG GGGCCAAGGT CTCGCATGTC GAGACCATCG AGGCGGCGAC CCACGCCCTG CGGGCCGGGC AGGGGGCCGA CCTGCTGATG GTCGACTACG CGCTCGACAT CGCGGGCCTG ATCGCCGCCA ACGAGTCCGA GCGGATCCGG GTGCCGGTGG TGGCCTGCGG CGTCGACGCC GACCCCATGC GGGCCGCCGC GGCGATCAAG GCCGGGGCCA AGGAATTCAT CCCGCTGCCG CCGGACGCCG AGCTGATCGC CGCCGTCCTG GCCGCCGTGA CCGACGACAA CCGTCCGATG ATCGTCCGCG ATCCGGCCAT GGGCGACGTC ATCCGCCTGG CCGACCAAGT CGCGGGGTCG GAAGCCTCGA TCCTGATCAC CGGCGAAAGC GGCTCGGGCA AGGAGGTCAT GGCCCGCTAC GTCCACGCCA AGTCGCGTCG GGCCAAGGCG CCGTTCATCT CGGTCAACTG CGCCGCCATC CCCGAGAACC TGCTGGAAAG CGAGCTGTTC GGCCACGAGA AGGGCGCCTT CACCGGCGCC GTGGCCCGCC GCATCGGCAA GTTCGAGGAA GCCAATGGCG GCACGCTACT GCTGGACGAA ATCAGCGAGA TGGACACCCG CCTGCAGGCC AAGCTGCTGC GCGCCATCCA GGAGCGCGAG ATCGACCGGG TCGGCGGCTC AAAGCCGGTC AAGGTCGATA TCCGCATCCT GGCCACCTCC AACCGCGACC TGACACAGGC GGTGAAGGAC GGCACGTTCC GCGAGGACCT GCTCTACCGT CTCAACGTCG TGAACCTGCG CCTGCCGCCG CTGCGCGACC GCCCGGGCGA CGTCATCACC CTGTGCGAGC ACTTCGTGAA GAAGTACTCG GCCGCCAACG GCTTGCCGGA AAAGCCGATC GCGGCCGAGG CCAAGCGCCG GCTGATCGCC CACCGCTGGC CGGGCAACGT CCGCGAGCTG GAGAACGCCA TGCACCGCGC GGTGCTGCTG TCGCCGGGCG CCGAGATCGA GGAGTTCGCC ATCCGCCTGC CGGACGGCCA GCCCCTGGCC CCGGCCCCGG ACGTCGCGGT GGCCCGCGGC GCCCAGATGG CGGCCGACGC CGTCTCGCGC ACCTTCGTCG GCTCGACCGT GGCCGAGGTC GAGCAGCATC TCATCATCGA AACGTTGGAG CACTGCCTGG GCAACCGCAC CCACGCGGCC AACATCCTGG GCATCTCGAT CCGCACCCTG CGCAACAAGC TGAAGGAATA TTCCGAAGCC GGCGTCGCCG TGCCCGCCCC GCAAGGCGGC GTGACCAACG CGGCCTGA
|
Protein sequence | MRLLVVGKLN GQLSVAVKMA MNTGAKVSHV ETIEAATHAL RAGQGADLLM VDYALDIAGL IAANESERIR VPVVACGVDA DPMRAAAAIK AGAKEFIPLP PDAELIAAVL AAVTDDNRPM IVRDPAMGDV IRLADQVAGS EASILITGES GSGKEVMARY VHAKSRRAKA PFISVNCAAI PENLLESELF GHEKGAFTGA VARRIGKFEE ANGGTLLLDE ISEMDTRLQA KLLRAIQERE IDRVGGSKPV KVDIRILATS NRDLTQAVKD GTFREDLLYR LNVVNLRLPP LRDRPGDVIT LCEHFVKKYS AANGLPEKPI AAEAKRRLIA HRWPGNVREL ENAMHRAVLL SPGAEIEEFA IRLPDGQPLA PAPDVAVARG AQMAADAVSR TFVGSTVAEV EQHLIIETLE HCLGNRTHAA NILGISIRTL RNKLKEYSEA GVAVPAPQGG VTNAA
|
| |