Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_3588 |
Symbol | hisD |
ID | 5901043 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | - |
Start bp | 3871232 |
End bp | 3872572 |
Gene Length | 1341 bp |
Protein Length | 446 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 641564098 |
Product | histidinol dehydrogenase |
Protein accession | YP_001685213 |
Protein GI | 167647550 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0141] Histidinol dehydrogenase |
TIGRFAM ID | [TIGR00069] histidinol dehydrogenase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 0.85846 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 0.909122 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGTCTTGC GCATCAAGAC TTGCACCCGT CCCAAAAACC CGCATCTAGG CCCCATGCGT CGCTTCAATT CCGCCGATCC CGCCTTTCCC GCCGCCTTCA AGGCCTTCCT CGACGAGCGC CGGGGCGCGC CGGAGAATGT CGATGCGGCC GCGCGCAACG TGCTGGACGC CGTGAAGGCC GACGGCCTGG CCGCCGTGCT CAGCTTCACC AGCCGCTTTG ACGGGGTGGA TCTGACCGAA GACACCATCC GCGTCACCGC CGAGGAGATC GAGGCCGGCG CCGCCCAGGC TTCGTCCGAG GTGCGCGAGG CCATCGCCTT CGCCGCCAAG CGCATCCGGG CCTTCCACGT CCGCCAGCGC CCCGAAGACC AGTCGTGGAC CGATGAGGCC GGCGTGCAGT TGGGCTGGCG CTGGACGCCG CTGGAGGCGG TGGGCGTCTA TGTGCCCGGC GGCCGCGCGG CCTATCCCTC GACCGTGCTG ATGAACGCCG TGCCCGCCCA AGTGGCCGGG GTCGACCGCA TCGCCATGGT CACCCCGCCC GGCAAGCTGG AGCCGGCGGT GCTGGCCGCC GCCAAGGAGG CCGGGGTCAC CGAGATCTGG CGGATCGGCG GGGCCCAGGC CGTCGCCGCC CTGGCCTACG GCGCCGGCCC GATCCAGCCG GTCGACAAGA TCGTCGGCCC TGGCAACGCC TATGTCACCG CCGCCAAGCG CCGCCTCTAC GGCGTGGTCG GCATCGACGC CCTGGCCGGT CCGTCCGAGA TCGTCGTGGT CGCCGATGCG AAGAACGACC CCGCCTGGAT CGCCGCCGAC CTGCTGAGCC AGGCCGAACA CGACCCGGCC GCCCAGTCGA TCCTGATCAC CGACGACGAG GCCTTCGCCC AGGCGGTGTC CGACGCCGTC GACGCCCTGC TGGGGACCCT GGCCACCGGC GCCGACGCCG CCGAATCCTG GCGCGACCAC GGCGCCATCA TCCTGTGCCC GCTGGACGAC AGCCCGCGCC TGGTCGACCT GCTGGCGCCC GAGCACGTCG AGTTCGCGAT CGACGCGCCC GAGCGCCTGG CCGACCGGGT GCGCCACGCC GGGGCGATCT TCCTGGGCCG CCTGACGCCG GAAGCCATCG GCGACTATGT GGCCGGCTCC AACCACGTGT TGCCCACCAG CCGCGCGGCG CGCTTCCAGT CGGGCCTGTC GATCTACGAC TTCCTCAAGC GCACCTCGAT CGTGAAGTGC GATGCGGCCG CGTTCGGCGT CCTGGGTCCG CACACCGTGG CCCTGGCCAA GGCCGAGGGC TTGCCGGCCC ACGCCCTGTC GGCGTCGATT CGGTTGCCTT CCAAGCCGTA A
|
Protein sequence | MVLRIKTCTR PKNPHLGPMR RFNSADPAFP AAFKAFLDER RGAPENVDAA ARNVLDAVKA DGLAAVLSFT SRFDGVDLTE DTIRVTAEEI EAGAAQASSE VREAIAFAAK RIRAFHVRQR PEDQSWTDEA GVQLGWRWTP LEAVGVYVPG GRAAYPSTVL MNAVPAQVAG VDRIAMVTPP GKLEPAVLAA AKEAGVTEIW RIGGAQAVAA LAYGAGPIQP VDKIVGPGNA YVTAAKRRLY GVVGIDALAG PSEIVVVADA KNDPAWIAAD LLSQAEHDPA AQSILITDDE AFAQAVSDAV DALLGTLATG ADAAESWRDH GAIILCPLDD SPRLVDLLAP EHVEFAIDAP ERLADRVRHA GAIFLGRLTP EAIGDYVAGS NHVLPTSRAA RFQSGLSIYD FLKRTSIVKC DAAAFGVLGP HTVALAKAEG LPAHALSASI RLPSKP
|
| |