Gene Caul_5124 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_5124 
Symbol 
ID5897398 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010335 
Strand
Start bp44571 
End bp45551 
Gene Length981 bp 
Protein Length326 aa 
Translation table11 
GC content66% 
IMG OID641555227 
Productluciferase family protein 
Protein accessionYP_001676558 
Protein GI167621773 
COG category[C] Energy production and conversion 
COG ID[COG2141] Coenzyme F420-dependent N5,N10-methylene tetrahydromethanopterin reductase and related flavin-dependent oxidoreductases 
TIGRFAM ID[TIGR03558] luciferase family oxidoreductase, group 1 


Plasmid Coverage information

Num covering plasmid clones34 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGCCA TTTCCGTTCT CGACTTCGTG CGCATCACCC GCGAGACAAA CGCACGTCAG 
GCCTTGGACC AAGCCCGCGA GCTGGCGGCT CATGCCGAGG CGCTCGGCTA TCGCCGCTAT
TGGGTGCCTG AGCACCATAA TTTCCCGGGC ATCGTCGGGG CGGCCACTTC TGTTGTGCTG
AGCCACATCG CGGCGGGCAC GCGCACCATT CGGATCGGCG CGGGCGGCGT GATGATGCCC
AATCATCCGC CGCTGGTGGT GGCCGAGCAG TTCGGCACCT TGGCCCAGCT CTTTCCAGAC
CGCATCGATT TGGGGATCGG ACGCGCGCCC GGCGGAGACC AGAACGTGAT CCGTGCCTTG
AGGCGTCCGG CGGGCGGCGG CGATCTGATG GCCGACGCCG TGGAGCTTTT GGCCTATTTC
GGGGAAGAGG GCCAAGCCAA GGGTGTGCGC GCCATGCCGG CGGCGGCCAC CAAGGTCCCC
CTCTGGATCT TGGGCTCCAG TCTCTATGGT GCGCGGCTGG GGGCGGAGCT GGGCCTGCCT
TACGCCTTCG CCTCGCATTT CGCGCCCGAG GCTCTTCTGC CGGCGCTGCA AACCTATCGC
GACCGTTTCA AACCCTCGGT CCACCTGGAG CGGCCCTATG CGATGATGGG GGTCAACATC
GTCGCGGCCG AGACGGACGC GGAGGCGGTG CGCCTGGCCA CCACACAGCA GATGACCTAC
ACCGATCTCA TCCGAGGCCG TCCAGGCGTC AGCCAGCCGC CCCTCGACGA CATCAACACC
TATTGGTCCC CGGTCGAACG CGACCACGTC ACGCGCATGT TGGGCTGCTC GATCATTGGA
TCGCTGGCCA CGGTGCGCGC GGCCATCGCC GCCCTCGTCG CCCAGACCGG AGTCGACGAA
CTGATCATCG ACTCCGACCT CTATGATCAC GGGCGACGCA TGACGTCCTT GGAGATCATC
GCCGAGGCGG TGGCGACCTA G
 
Protein sequence
MTAISVLDFV RITRETNARQ ALDQARELAA HAEALGYRRY WVPEHHNFPG IVGAATSVVL 
SHIAAGTRTI RIGAGGVMMP NHPPLVVAEQ FGTLAQLFPD RIDLGIGRAP GGDQNVIRAL
RRPAGGGDLM ADAVELLAYF GEEGQAKGVR AMPAAATKVP LWILGSSLYG ARLGAELGLP
YAFASHFAPE ALLPALQTYR DRFKPSVHLE RPYAMMGVNI VAAETDAEAV RLATTQQMTY
TDLIRGRPGV SQPPLDDINT YWSPVERDHV TRMLGCSIIG SLATVRAAIA ALVAQTGVDE
LIIDSDLYDH GRRMTSLEII AEAVAT