Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_5288 |
Symbol | |
ID | 5897454 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010335 |
Strand | + |
Start bp | 229489 |
End bp | 230430 |
Gene Length | 942 bp |
Protein Length | 313 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 641555391 |
Product | luciferase family protein |
Protein accession | YP_001676722 |
Protein GI | 167621937 |
COG category | [C] Energy production and conversion |
COG ID | [COG2141] Coenzyme F420-dependent N5,N10-methylene tetrahydromethanopterin reductase and related flavin-dependent oxidoreductases |
TIGRFAM ID | [TIGR03571] luciferase-type oxidoreductase, BA3436 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 0.537455 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 5 |
Fosmid unclonability p-value | 0.0470255 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCGCAAT TTACTCCCCA TCCGGTGTTC GACAGGGTGT TTTCGCCCGG GCGCTTGTCG CTGGGTCTGG TGTTGCCGAT GGGCCGCGTG GGGGAAATAC AGCCTGACCC GCGCGAGCAG TTGGAGCTGG CGCGCCTGGC CGACGCGCTC GGGTTTGCGG CTCTGTGGGT GCGCGATGTG CCGATCAACA GCCCCGACTA TCCCGACCCC GTCGCCCATC TGGATCCTTG GACGCAGCTG GCGGCGCTGG CCGTGGTCAC CCAAAGCATC GCCTTGGTCA CCGGCGCGAT CGTCGCGCCG TTGCGCCATC CCCTCCACGT GGCCAAGGCC GCGCTGTCGA TCGACAGACT GTCGGCAGGG CGGATGATTC TGGGCCTTGG GTCAGGGGAC CGCCCCTCAG AGTTCCAGGC CTTTGGGCTG GACGTGTCGC AAGCGCCCGA GCGGTTGCGT CAAGCCTGGG CGACGATCGA GGGCTTGGTG GGCGACAACG CCTTGATCGA CCCCGCCGGC GTTGGGGCCG ATCCGGCCGC CACCCTGCTT CCCCGCCCCG TCCACGGACG CGTTCCGATC CTCGCCGTCG GCTCGGCCGG ACAGACGATC GGCTGGGTGG CGCGCAACGC CAACGGCTGG GCCACGTACT ATCGCCCCCT GGCCAAGCAG AAGGATCGAT TTGGCCTGTG GGCGGCGGCG GTGGAGAAGG CCCAACCAAG CGGCGCGCCC GCCTTCGCCT CGGCGATGGT GTTGGCGCTA TTGGCCGACC CGAACGCGCC GGCCGAGCCT TTGGGCCTTG GCCTCCGGAC CGGTCGCAAC GCCTTGATCG CCGAGCTTTC GGCCCTGCGC GATCTGGGTG CCCACCATGT GATGTTCAGC CTCGTCAGGA CCGATCGGAA CCTCGATGAG GTTCTGGGCG AGCTGGCCCA AGAGGTTCTG CCACGCCTCT GA
|
Protein sequence | MAQFTPHPVF DRVFSPGRLS LGLVLPMGRV GEIQPDPREQ LELARLADAL GFAALWVRDV PINSPDYPDP VAHLDPWTQL AALAVVTQSI ALVTGAIVAP LRHPLHVAKA ALSIDRLSAG RMILGLGSGD RPSEFQAFGL DVSQAPERLR QAWATIEGLV GDNALIDPAG VGADPAATLL PRPVHGRVPI LAVGSAGQTI GWVARNANGW ATYYRPLAKQ KDRFGLWAAA VEKAQPSGAP AFASAMVLAL LADPNAPAEP LGLGLRTGRN ALIAELSALR DLGAHHVMFS LVRTDRNLDE VLGELAQEVL PRL
|
| |