Gene Clim_0032 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagClim_0032 
Symbol 
ID6355555 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium limicola DSM 245 
KingdomBacteria 
Replicon accessionNC_010803 
Strand
Start bp36820 
End bp38220 
Gene Length1401 bp 
Protein Length466 aa 
Translation table11 
GC content54% 
IMG OID642667657 
Productaromatic hydrocarbon degradation membrane protein 
Protein accessionYP_001942119 
Protein GI189345590 
COG category[I] Lipid transport and metabolism 
COG ID[COG2067] Long-chain fatty acid transport protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCTTTCC TGGCTGTTCT TTCAGCGGCG TCAACCGCTT TTGCCACCAA CGGCATGAAT 
CTCGAAGGGT ACGGGGCTAA ATCTCATGCT CTTGGAGGGA CAAGCACGGC CTACGATACC
GGAAATTCAG GGGTTACGAA CAACCCGGCA ACTCTTGGCC TGAGGGAGGA GGGGTCTTCG
GAAATCGGTA TCGGTATCCG CGGCCTTCAC CCGGACGTCA ATCTCGGGTT TAATGGCGTG
ACGACAACCG AATCAAAAGG GGATTCCTAT TACATGCCCT CGCTCTCTTA TATGCGCAAA
GACGGAAAGA TTACCTGGGG ATTTGCCGTT CTCGCCCAGG GAGGGATGGG AACGGAATAT
GGCGAAAACT CTTCGCTGTT CAGCTATGGC ATGCCCATGT CGAAGCAGGG AATGGTTCCG
CTGAGCGGGC AGGATATCCG TTCCGAGGTG GGAGTGGGAC GTCTGATGTT TCCCGTGGCG
TATAACCTTA CGGAAAACAC CGTCATCGGA GCATCGCTCG ATTTCCTCTG GGCAAGCATG
GATCTGCGTA TGGATATGGA CGGAGCGCAT TTCGGCGATA TGGCCATGCA GGGTATTGGC
GGCAAGGTAA GCGGATCGAT GTTCGGTACT CTCGGGGGAA TGATCGGTTC CTCAGTCCGG
GATATCGACT ACGTCCGCTA CGATTTTTCG AACGACAACG CCTTTCTCGG GGAGGCGATA
GGTTACGGAA CCGGTTTCAA GGTCGGCATT ACCCACCGGT TCGGCAAGTT CCTTACGGTG
GGAGGAAGCT ATCACTCGCA GACCCGGATT TCAGATCTCG AAACCTCCAA AGCGGTACTT
TCGTTTGCCG GGAAGGATGC GATGAATAAT TCTTTTACCC GGTCGGTGAA CGGCACTATC
AAGGTCCGCG ATTTCGAGTG GCCGGCCACC TTTGCCGCAG GAGTCGCCCT GTATCCCTCT
GAACGCTGGA TGATTACCGC CGACATCAAG CATATCGACT GGTCGTCGGT AATGGAGAAG
TTTTCGACAT CCTTTACCGC CGATAACTCT CTTTCCAACG GGCCGTTTGC AGGGCAGACG
CTTGATGTGG AGATGCTGCA GAACTGGAAG GATCAGACCG TCATTTCGGT CGGCGTTCAG
TACCGGGCAA CCGACAGGCT TGCGCTGAGG ACTGGAGCCA GCTTCGCGTC GAATCCGGTT
CCCGATATGT ATCTCAATCC CATGTTTCCG GCGATAACCG AAAACCATTA TACGGCAGGA
TTCGGTTACC GGCTTTCCGA CAGGTCTTCT GTTTCGGCGG CTCTTGCATG GGCTCCGGAA
GTAAGCGAAA CCTCTGATGA AGGACTTGAG ATCGGTCACA GCCAGCTTAA CTGGTCACTG
AACTATTCCC ACGAACTTTA G
 
Protein sequence
MAFLAVLSAA STAFATNGMN LEGYGAKSHA LGGTSTAYDT GNSGVTNNPA TLGLREEGSS 
EIGIGIRGLH PDVNLGFNGV TTTESKGDSY YMPSLSYMRK DGKITWGFAV LAQGGMGTEY
GENSSLFSYG MPMSKQGMVP LSGQDIRSEV GVGRLMFPVA YNLTENTVIG ASLDFLWASM
DLRMDMDGAH FGDMAMQGIG GKVSGSMFGT LGGMIGSSVR DIDYVRYDFS NDNAFLGEAI
GYGTGFKVGI THRFGKFLTV GGSYHSQTRI SDLETSKAVL SFAGKDAMNN SFTRSVNGTI
KVRDFEWPAT FAAGVALYPS ERWMITADIK HIDWSSVMEK FSTSFTADNS LSNGPFAGQT
LDVEMLQNWK DQTVISVGVQ YRATDRLALR TGASFASNPV PDMYLNPMFP AITENHYTAG
FGYRLSDRSS VSAALAWAPE VSETSDEGLE IGHSQLNWSL NYSHEL