Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_0121 |
Symbol | |
ID | 5897833 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | + |
Start bp | 131815 |
End bp | 133362 |
Gene Length | 1548 bp |
Protein Length | 515 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | 641560605 |
Product | HemY domain-containing protein |
Protein accession | YP_001681757 |
Protein GI | 167644094 |
COG category | [S] Function unknown |
COG ID | [COG3898] Uncharacterized membrane-bound protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATCCGCG CGGCGATCGT CTTCTTCCTG GTGGCGGCCA TCGCCGTGGC CGTGGTGGCC CTGACCGGCG AGCCGGGCGT GGCCAGCCTC GACTGGATGG GCTGGCGGGT CGAGATGACC GCCGCCGCCG CCGCCCTGCT CACCCTGTTC ACAGCCCTGC TGGCCACCAT CCTGTGGCGC GCCCTGCTGT GGGTGATCGA GGCCCCGCAG CGCGCCGCCC GCGCCCGCGC CGAGGCCAAG CGCAAGCAAG GCGTCGAGGC CCTGTCGCGC GGCTTTCTGG CCGTGGCGGC CGGCGACGGG TCCGAAGCCC GGCGCCTGGC CCAGAAGGCG GCCGAACTGG CCGAGGACGC CCCCGCCCTG ATCCGCGTCC TGGCCGCCCA GGCCGCCGAG GCGGCCGGCG ACCACACCGC CGCGCGCCAG GCCTACAACG CCATGCTCGG CTTTCCCGAG ATGCGCCTGG CCGGCCTGCG GGGCCTGATG CAGACCGCCC TGGCCGAGGG CGACAAGGGC CTGGCCCTGA AGCACGCCGA GACCGCCTAC GGCCTGGCCA AGACCGCGCG CTGGGCCTGG CGGGCCTTGC TGGAGGCGCG GCTGGAGGCC GGCGACTGGA AGGCCGCCCT CGACCTGGTC CAGGGCGCGC TGGAACGCAA GATCGTGCCG CCGCTGGTCG CGGAACGGGC GCGCGCCGCC CTGCTGGCCG CCTCGGCCGC CAGTCTGGAA GAGTCCGACG ATCCCAAGAC CCTGGCCCAG GCCCTGGACT TCGCCGTCCA GGCCGCCAGG CTCAAGCCCG ACTTCGCGCC CGGCGTGGTC ATGGCCGCCC GCCTGCAGGC CGCCGACGGC AAGGCCGCCA AGGCCGGGAG CTTGATCGAG GCCGCCTGGA AGCTCAGCCC TCATCCGGCC CTGTGGCTGG CCTATCGCGA CCTGAAGACC AGCGAGACGC CCAAGCTGCG CGCCGCGCGC CTGGCGGCCC TGGCCGGCCT GAAGCCCGAG GCCCGCGAAA GCCGCCTGCT CAAGGTCGAG AGCGCCCTGA TCGGCGGCGA TCCCGTGGCG GCCCGCGCCG CCGCCCGGCT GCTTCCGGAA GAGGGGCTGA CCGCGCGGGT CGCCGGCCTG ATGTCCCGCG TCGCCTACGC CAACGGCGAG GCCGACGAGG CCCGCGCCTG GATCGCCCGA GGCGTCGCCG CGGCGCAGGA GCCCGACTGG TCGGATCTCG ACCCCGAAGG TCGCGCCTTC GCCTACGCCC GCGACGACTG GGCGCGGCTG ACGGTCAGCT ACGCCGAGAC CGGCGAGCTC ATCCATCCAA GGTTCGAGCG CAGCGAACGC ACGATGAGCG AACTGCCCGA ACTGCCCTCG GCCTATGCGG AGTCCACGCC CTTCGTCCGC GCGGCCGAGA CCGGCGGGGC GCTGATGCCG ATCCCGGACG ATCCCGGAAT TGGCCCAGGC GTGTTCGACG CGGCCGCTCC GGCGGAGGGC GGAAACGACG GTCCGGCCCC TCGTCGCAAC ACGGGTTCAC GCCGACGCTT GGCAAGCGGC CCTCGCGCCG CTAAATAG
|
Protein sequence | MIRAAIVFFL VAAIAVAVVA LTGEPGVASL DWMGWRVEMT AAAAALLTLF TALLATILWR ALLWVIEAPQ RAARARAEAK RKQGVEALSR GFLAVAAGDG SEARRLAQKA AELAEDAPAL IRVLAAQAAE AAGDHTAARQ AYNAMLGFPE MRLAGLRGLM QTALAEGDKG LALKHAETAY GLAKTARWAW RALLEARLEA GDWKAALDLV QGALERKIVP PLVAERARAA LLAASAASLE ESDDPKTLAQ ALDFAVQAAR LKPDFAPGVV MAARLQAADG KAAKAGSLIE AAWKLSPHPA LWLAYRDLKT SETPKLRAAR LAALAGLKPE ARESRLLKVE SALIGGDPVA ARAAARLLPE EGLTARVAGL MSRVAYANGE ADEARAWIAR GVAAAQEPDW SDLDPEGRAF AYARDDWARL TVSYAETGEL IHPRFERSER TMSELPELPS AYAESTPFVR AAETGGALMP IPDDPGIGPG VFDAAAPAEG GNDGPAPRRN TGSRRRLASG PRAAK
|
| |