Gene Clim_0743 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagClim_0743 
Symbol 
ID6356024 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium limicola DSM 245 
KingdomBacteria 
Replicon accessionNC_010803 
Strand
Start bp814982 
End bp816559 
Gene Length1578 bp 
Protein Length525 aa 
Translation table11 
GC content49% 
IMG OID642668368 
Productprotein of unknown function DUF92 transmembrane 
Protein accessionYP_001942803 
Protein GI189346274 
COG category[I] Lipid transport and metabolism
[S] Function unknown 
COG ID[COG0170] Dolichol kinase
[COG1836] Predicted membrane protein 
TIGRFAM ID[TIGR00297] conserved hypothetical protein TIGR00297 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.0893181 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTGAATC TACAAGCACC GTTTTCACAG GATCTCCCAG CCTTTTTTAT GACAATCGCG 
CTTCTGGTGT CGGTTATTCT GCTTGGAGAG TTGCTCATCA GAAGCTTCGG AGTCAGTACC
GTCGTTGTTC GAAAGATCAT ACACCTCGGG ATGGGTATAG TTGTTTTTTT TGTGCCGGAT
TATTTCGAGT CGAACTTCTA TCCGGTGCTT GCCGCTCTGT TTTTTCTTCT GTTTAACGGT
GCGAATGTTT TCGGAGGGTG GCTGCGCTCC CTGCACACTG AAACCGTTGA GCAGGAGGGC
GGGCTGAAGG TGAACAGTTA CGGTTCGATG CTGTTTCCGT TAGCTTTTAT TCTGCTCTGT
CTGCTGCTGT GGAGTGATCA CAAGTGGATC CTGCAGACGG CGATGCTTGT AATGGGTGTC
GGAGACTCGT TCGCGGCGCT TGTCGGCAGC AGTGTCGGCC GGCGTCACAT AGAGCATCTG
ACGGAAAGTC CGAAAACGAT CGAGGGATCG GCAACCATGT TCGTTATTTC CTTTTTGCTT
TTTCTTGTCT GTTTTTCTCT TTTCAGTGCC GAAATATCCG GCGCACTCGC ATCGAGACCA
TTCTGGATTC TGGTTCTTTT CGCCCTGCTT CTCGCTCTTG TTGTAACGGC TGTCGAGGCA
CTGCTCTCCT ATGGCCTGGA CAACTTGTTC ATTCCCCTTT CGATTGCCTA TGTTCTCTAT
GTTCTTGAGA TGAATCATAC CGTTCAGCTC GAGAGTTTTC TGATCGGCGG TGCTTTTGCC
CTGTTCCTTG CGATTTTTTC TGTCAAGGTG AAATTTCTTA ACAACAGCGG CGCGACGGCA
ACCTTTCTGC TCGGCACCAC GATTTTTGGA TTTGGCGGAA TTACATGGAC GGTTCCGATG
TTAACGTTTT ACCTGCTTTC ATCGGTGCTT TCGAAATTAG GCAAAAAAAG AAAAGCTAAA
TTTGATCTGG TTTTTGAAAA AGGTTCCCAG AGAGATTCAG GACAGGTGTA TGCAAATGGA
GGCATTGCCT GGATTCTCAT GATAATTTTT TCTCTTACCG GAGATCCGGC CGTGTTTTTT
GCCTATCTGG GTACGCTTGC TGCTGTACAG GCAGATACCT GGGCTACGGA GATAGGTACC
ATGTGGCCAA ATCCGAAAGC ATGGCTCGTT ACGTCGTTCA GGGAAGTTCC TGTTGGAACC
TCAGGTGGGG TATCGGTGCC CGGGACTTCA GGAGCGTTTA TCGGTTCTTT GTTTATCTGC
GCGAGTGCGC TCCTCGTCAA CAACGGATGG CTGTATGAGT TCGGGGTGGT ACAATCCATG
ATGCTGATAG GAGTATCCGG TCTTGTTGCA AGTCTTGTTG ACAGTTTTTT CGGTGCGACC
GTTCAGGCTC AGTACTACGA TCCCATACGT GAGAAAGTCA CCGAAAGAAC ACACAGTATT
GCAGAAGACG GCTCTATGGT CGAGAACCGG CTTCTGAAAG GTGTTGCGTT TGTGAATAAC
GATCTGGTTA ACACCCTGTG CGCACTCTCC GGCTCCGCGC TAGCCTATGT CGTTATTGAA
AACCTCAAGA TTTTTTAA
 
Protein sequence
MLNLQAPFSQ DLPAFFMTIA LLVSVILLGE LLIRSFGVST VVVRKIIHLG MGIVVFFVPD 
YFESNFYPVL AALFFLLFNG ANVFGGWLRS LHTETVEQEG GLKVNSYGSM LFPLAFILLC
LLLWSDHKWI LQTAMLVMGV GDSFAALVGS SVGRRHIEHL TESPKTIEGS ATMFVISFLL
FLVCFSLFSA EISGALASRP FWILVLFALL LALVVTAVEA LLSYGLDNLF IPLSIAYVLY
VLEMNHTVQL ESFLIGGAFA LFLAIFSVKV KFLNNSGATA TFLLGTTIFG FGGITWTVPM
LTFYLLSSVL SKLGKKRKAK FDLVFEKGSQ RDSGQVYANG GIAWILMIIF SLTGDPAVFF
AYLGTLAAVQ ADTWATEIGT MWPNPKAWLV TSFREVPVGT SGGVSVPGTS GAFIGSLFIC
ASALLVNNGW LYEFGVVQSM MLIGVSGLVA SLVDSFFGAT VQAQYYDPIR EKVTERTHSI
AEDGSMVENR LLKGVAFVNN DLVNTLCALS GSALAYVVIE NLKIF