Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_3978 |
Symbol | |
ID | 5901440 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | + |
Start bp | 4308206 |
End bp | 4309939 |
Gene Length | 1734 bp |
Protein Length | 577 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 641564499 |
Product | signal transduction histidine kinase |
Protein accession | YP_001685601 |
Protein GI | 167647938 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG3920] Signal transduction histidine kinase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 35 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.301159 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCTGATC ACGACCCCGC ATCGGACGCG CCGCCCCGGG CCGCCGGCAC CGTGCGAAGA CGCCTGCTGC TGATGGCCCT GAGCCTGGTG GTCCCGGCCG TGATCTTCAT GACGCTGTTG GCGCGGGCCG AGTTCGGCGA GAGCCAGGCC CGCTACGAGC GGCAGTTGAT CGCCACCACC CGCGCCCTGG TGCTGGCGAC CGACCGCCAG ATCGGCCAGG GGCAAAGCGT TCTGCAGGCC CTGGCGGTCT CGCCCGCCCT GGTTTCCGGC GACATTTCGG CCTTCGAACG CCAGGCGCGC GCGGCCGTGC AGGGCGGCGA GGGCTGGATC GTATTGCTCG ACAATGAGCG GCAGCTGGTC AACACCCGGC GACCGGCGGG CGCGCCGCCG CCCAAGGTGG GCCTGCCCGA CTATCGGTGG CGGACCATCC GCGCCGGCCG CACGTCGGTG TCGAACCTGG TGCTGCCCAA GACGCCCGGC CAGTTCCCGC CCTTCGTGTC GATCGACATG CCGGTCATCG TCGACGGCAA GCTGTACGAC CTGGCCTACC AGCAATCGCC CAGGGCCTTC TCGTCGATCT TCGCCGGCCA GAACATCCCG CGCAGCTGGA CGGCCAGCAT CGTCGACCGC GAGGCCACGC TGGTTTCGCG ATCCAAGGAC CAGGATCGCT TCCTGGGCCA CAAGGTCAGC CCCAACACCT ATGCGGCCAT GGCCCGCGGC GCCGAGGGAG TGGTGCTGAG CCGGACCCTG GATGGCACGC CCACGCTCTC GGCCTTCAGC CGTTCGCCGA CCACCGGCTG GGCGTTCATT GTCGGAGTGC CGCGCGCCGA GCTGAACCGG GCCAACTGGT CGTCGATTGG GCTGCTGAGC CTGGCCAGCG CGGTGCTGCT GACCTTCGGC GTGGCGGTGG CGCTGGTGTT CTCGCGCGAC ATCTCGGCGA CGGTGCGCGG CCTGGCGGTC GACGCCAAGG CGGTGGCGGC CGGCGAGGAA ATCGCCCCCA CCCCCGATCG CCCCGACCAG TTCATCGAAA TCGCCGAGGT GCGCGCGGCC CTGCACAAGG CCGCCCTCCA GCTGCGGACC CGCGAGGCCG AGGAACAGCG CGCCCATCAG CGCCAGCAGC TGATGATCAA CGAGCTGAAC CATCGGGTGA AGAACACCCT GTTCACGGTG CAGTCCCTGG CCCGCCAGAG CCTGGGACGG CCGGCCGACA CGCCCGGCCT GACGGCCTTC AACGAGCGTC TGATGGCCCT GGCCCGCGCC CACGACCTGC TGACCCGGAG CGTCTGGGAG GGCGCCGAGC TGAGGGAGAT CCTCGAGGAG ACGCTCGAGC CGTATCTGGA CCGGACCGTG CTGGCCGGAC CGCTGGCGGC GCTGTCGCCG AACGCCGCCC TGGCCCTGTC GATGGTGTTC CACGAGCTCG CCACCAACGC CGTCAAATAC GGCGCCCTGT CGGTTCCCGA CGGCACGGTG ACGGTCGTCT GGCACGTCGA CCCCGGCGCG GCGCACCGGC TGACCCTGCA CTGGGAGGAA CGGGGCGGAC CCAAGGTGTC GCCGCCCAGC CGCTCGGGGT TCGGCTCGCG CCTGATCGCC GCCAGCCTCA AGTCCGACCT CAACGGCGAG GCGCGCATCG ACTACCGGCC CACCGGCCTG GTCTGCGTGC TGACCCTGTC GCTGCCCCAG ACCGGCAAGG AGCAGGCGGC GGCCGAGACG GCGTCCGGAC CGGTGGAAAG CTAG
|
Protein sequence | MADHDPASDA PPRAAGTVRR RLLLMALSLV VPAVIFMTLL ARAEFGESQA RYERQLIATT RALVLATDRQ IGQGQSVLQA LAVSPALVSG DISAFERQAR AAVQGGEGWI VLLDNERQLV NTRRPAGAPP PKVGLPDYRW RTIRAGRTSV SNLVLPKTPG QFPPFVSIDM PVIVDGKLYD LAYQQSPRAF SSIFAGQNIP RSWTASIVDR EATLVSRSKD QDRFLGHKVS PNTYAAMARG AEGVVLSRTL DGTPTLSAFS RSPTTGWAFI VGVPRAELNR ANWSSIGLLS LASAVLLTFG VAVALVFSRD ISATVRGLAV DAKAVAAGEE IAPTPDRPDQ FIEIAEVRAA LHKAALQLRT REAEEQRAHQ RQQLMINELN HRVKNTLFTV QSLARQSLGR PADTPGLTAF NERLMALARA HDLLTRSVWE GAELREILEE TLEPYLDRTV LAGPLAALSP NAALALSMVF HELATNAVKY GALSVPDGTV TVVWHVDPGA AHRLTLHWEE RGGPKVSPPS RSGFGSRLIA ASLKSDLNGE ARIDYRPTGL VCVLTLSLPQ TGKEQAAAET ASGPVES
|
| |