Gene Caul_5288 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_5288 
Symbol 
ID5897454 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010335 
Strand
Start bp229489 
End bp230430 
Gene Length942 bp 
Protein Length313 aa 
Translation table11 
GC content68% 
IMG OID641555391 
Productluciferase family protein 
Protein accessionYP_001676722 
Protein GI167621937 
COG category[C] Energy production and conversion 
COG ID[COG2141] Coenzyme F420-dependent N5,N10-methylene tetrahydromethanopterin reductase and related flavin-dependent oxidoreductases 
TIGRFAM ID[TIGR03571] luciferase-type oxidoreductase, BA3436 family 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.537455 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0470255 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGCAAT TTACTCCCCA TCCGGTGTTC GACAGGGTGT TTTCGCCCGG GCGCTTGTCG 
CTGGGTCTGG TGTTGCCGAT GGGCCGCGTG GGGGAAATAC AGCCTGACCC GCGCGAGCAG
TTGGAGCTGG CGCGCCTGGC CGACGCGCTC GGGTTTGCGG CTCTGTGGGT GCGCGATGTG
CCGATCAACA GCCCCGACTA TCCCGACCCC GTCGCCCATC TGGATCCTTG GACGCAGCTG
GCGGCGCTGG CCGTGGTCAC CCAAAGCATC GCCTTGGTCA CCGGCGCGAT CGTCGCGCCG
TTGCGCCATC CCCTCCACGT GGCCAAGGCC GCGCTGTCGA TCGACAGACT GTCGGCAGGG
CGGATGATTC TGGGCCTTGG GTCAGGGGAC CGCCCCTCAG AGTTCCAGGC CTTTGGGCTG
GACGTGTCGC AAGCGCCCGA GCGGTTGCGT CAAGCCTGGG CGACGATCGA GGGCTTGGTG
GGCGACAACG CCTTGATCGA CCCCGCCGGC GTTGGGGCCG ATCCGGCCGC CACCCTGCTT
CCCCGCCCCG TCCACGGACG CGTTCCGATC CTCGCCGTCG GCTCGGCCGG ACAGACGATC
GGCTGGGTGG CGCGCAACGC CAACGGCTGG GCCACGTACT ATCGCCCCCT GGCCAAGCAG
AAGGATCGAT TTGGCCTGTG GGCGGCGGCG GTGGAGAAGG CCCAACCAAG CGGCGCGCCC
GCCTTCGCCT CGGCGATGGT GTTGGCGCTA TTGGCCGACC CGAACGCGCC GGCCGAGCCT
TTGGGCCTTG GCCTCCGGAC CGGTCGCAAC GCCTTGATCG CCGAGCTTTC GGCCCTGCGC
GATCTGGGTG CCCACCATGT GATGTTCAGC CTCGTCAGGA CCGATCGGAA CCTCGATGAG
GTTCTGGGCG AGCTGGCCCA AGAGGTTCTG CCACGCCTCT GA
 
Protein sequence
MAQFTPHPVF DRVFSPGRLS LGLVLPMGRV GEIQPDPREQ LELARLADAL GFAALWVRDV 
PINSPDYPDP VAHLDPWTQL AALAVVTQSI ALVTGAIVAP LRHPLHVAKA ALSIDRLSAG
RMILGLGSGD RPSEFQAFGL DVSQAPERLR QAWATIEGLV GDNALIDPAG VGADPAATLL
PRPVHGRVPI LAVGSAGQTI GWVARNANGW ATYYRPLAKQ KDRFGLWAAA VEKAQPSGAP
AFASAMVLAL LADPNAPAEP LGLGLRTGRN ALIAELSALR DLGAHHVMFS LVRTDRNLDE
VLGELAQEVL PRL