Gene Clim_1451 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagClim_1451 
Symbol 
ID6354764 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium limicola DSM 245 
KingdomBacteria 
Replicon accessionNC_010803 
Strand
Start bp1556775 
End bp1557782 
Gene Length1008 bp 
Protein Length335 aa 
Translation table11 
GC content55% 
IMG OID642669061 
Producthypothetical protein 
Protein accessionYP_001943489 
Protein GI189346960 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0502] Biotin synthase and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000000857119 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAATTG ATGATGTACT GAACAAGACA GGATCGGGCG AATTGCTTTC GCGCGACGAG 
ATGGTTTTTC TGCTTGATTT TCCGTCTGAT TCTATCGGGA CCTATATGGT TATGGCTGAA
GCAAACCGGA TATCGAAGGA GGTATCGCAG GGAAAAGCTG AAGTTCATGC CCAGTTCGCC
CTCAATCTTG CGCCGTGCAG TTGCGATTGT CTGTTTTGTT CGTTTGCGGA AGTGAACGGG
GTTTTCACTG CGTCAACGGC GTTGAGTTCC GACCAGGCTG TCGCTTATGC GCGACAGTTC
GAAAAGGATG GCGCGAACGC TCTTTTTCTG ATGTCGACGG CACACTATCC GTTTGAGCGT
TTTCTGGAAA TATCAGGGGA GGTGCGCAAA AACCTGAAGC CGGAAACGAC CTTGATTGCC
AACGTGGGCG ACCAGTCGAT CAAGAGCGCC CTCAAGCTGA AAGATGCCGG GTTCAGCGGC
GTGTATCATG CGGTTCGGCT GCGCGAAGGA ATCGATACCA CTCTCGATGT CGGCAGGAGG
AGGCAGAGTA TTGCGAATTT CAGGGAGGCC GGTCTTGAAG TAGGGACATG CGTTGAGCCT
GTAGGGCCCG AACATACCAA TGAGGAGCTT GCCGATATGA TCGCATTTAC GGCATCGTTC
AATCCCGCCT ATAGCGGCGC GGCGCGCAGG ATTCCCATAC CGGGCACCAG ATTGGCCGCG
CTCGGCATTA TCAGCGAGCT GCGGATGGCT CAGATCGTGG CCGTTACCAG ACTGGGTATG
CCGAGAAGCG TTTTGGGCAA TTGCACCCAT GAACCGTGTA CATTAGGCGC TATCGCCGGA
GCGAATCTGT TCTGGGCCGA AGTCGGGGCC AACCCCCGTG ACGTTGAGGC GAAGACGGAG
GAGGGAAGGG GAGAAAGTGT GATCAGCTGC CGTTCTGTTT TTCAGGAAAG CAATTGGGAG
GTGCTGCGGG GTCCGTCACG GTTTTATAAT CGAGCAAGTA GCCGGTAG
 
Protein sequence
MKIDDVLNKT GSGELLSRDE MVFLLDFPSD SIGTYMVMAE ANRISKEVSQ GKAEVHAQFA 
LNLAPCSCDC LFCSFAEVNG VFTASTALSS DQAVAYARQF EKDGANALFL MSTAHYPFER
FLEISGEVRK NLKPETTLIA NVGDQSIKSA LKLKDAGFSG VYHAVRLREG IDTTLDVGRR
RQSIANFREA GLEVGTCVEP VGPEHTNEEL ADMIAFTASF NPAYSGAARR IPIPGTRLAA
LGIISELRMA QIVAVTRLGM PRSVLGNCTH EPCTLGAIAG ANLFWAEVGA NPRDVEAKTE
EGRGESVISC RSVFQESNWE VLRGPSRFYN RASSR