Gene Clim_0035 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagClim_0035 
Symbol 
ID6355558 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium limicola DSM 245 
KingdomBacteria 
Replicon accessionNC_010803 
Strand
Start bp40764 
End bp43625 
Gene Length2862 bp 
Protein Length953 aa 
Translation table11 
GC content57% 
IMG OID642667660 
Productmolydopterin dinucleotide-binding region 
Protein accessionYP_001942122 
Protein GI189345593 
COG category[C] Energy production and conversion 
COG ID[COG0243] Anaerobic dehydrogenases, typically selenocysteine-containing 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.813982 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCTATA CCCATAAACC AACTGTAATA GAAAGCCTTG CTGAAAAGCT GCACCTTATT 
CCTGATCTGC ACGCAGAGAG CGGCGGGGCG AAAGCGCTTC GTGCGGCTGA AGAGGGCTCC
GAGGTCAGCT CCCCCCCGCC CGAGCAATGG GATAACTGGG TCGAGTATGA CGCCAAAAGC
TGGCCCGAGC GCAAGACGAA AGAGTACATG CTGGTTCCGA CCGCATGTTT CAACTGCGAA
GCGGGCTGCG GACTTCTTGC CTATGTCGAT AAAGAGACCA TGGAGATCCG CAAGCTGACC
GGGAATCCCT ATCATCCGGC AAGCCGCGGC AGAAACTGCG CGAAGGGCCC GGCAACGCTC
AACCAGATTC AGGACACCGA CCGCGTGCTC TATCCCATGA AGCGCGCCGG AAAGCGGGGC
GACGGTCAAT GGGAAAGGGT CAGCTGGGAC AGCGTGCTTG ACGATATAGC AGGACGGATG
CGCAAGGCCA TCATGGAAGG TCGGAACAAC GAAATTTCTT ACCATGTCGG GCGTCCCGGC
CATGACGGTT TCATGGAATG GATTCTTCGT GCATGGAATG TTGACGGCCA CAACAGTCAC
ACCAACGTCT GTTCCTCGGG AGCGCGGTTC GGTTACGCCA TATGGGAAGG TTTCGACCGT
CCCTCCCCCG ATCATGCCAA CGCTAAATTC ATTCTTCTGG TCAGCGCCCA TCTTGAATCG
GGTCACTATT TCAATCCGCA TTCGCAGCGC ATTATCGAGG CTCGCATGAA AGGGGCCAAG
CTTGCCGTGC TCGATCCCCG GCTTTCCAAC ACCGCAAGCA TGTCGGACTA CTGGATGCCG
AGCTATCCCG GCACCGAACC GGCCGTGCTG CTTGCCATGG CGAAGGTGAT TATCGATGAA
GGTCTCTACA ACAGAACCTA CCTTGAAAAC TGGGTGAACT GGCAGGAGTA CCTCCAGGCT
GAATATCCAG GAACTCCTGT GACCTTCGAG AACTTCATAG AAGGGCTGAA AAAGGAGTAT
GCGCACTATA CCCCCGAATA CGCTTCGAAA GAGAGCGGCG TCGATGCCGC CATGATCGTC
GAGATCGCCC GAAAGATAGG AGAGGCCGGC TCTCAGTTTT CTACCCATGT CTGGCGAAGC
GCCAGCAGCG GCAATCTCGG CGGCTGGGCA GTGTCGCGTA CGCTGCACTT CCTGAACGTA
CTGACAGGCA GCGTCGGTAC GCCGGGCGGC ACCTCTCCCA GCGCATGGAA CAAGTTCAAG
CCGCAAGTGC ATGCGGAACC CAAACCGCAG ACCTTCTGGA ACCCGCTCCA TCTGCCAAAC
GACTATCCGC TCGCGCATTT CGAGATGAGC TTCCTGCTTC CCCATTTCCT CAAGGAAGGG
CGAGGAAAAC TCGACGTCTA TTTCACCAGG GTGTTCAATC CGGTCTGGAC CTATCCCGAC
GGCTTTTCAT GGATAGAGGC GCTCGAGGAC GAATCGAAGA TCGGACTGCA CGCCGCGCTT
ACTCCAACCT GGAGCGAGAC CGCTTACTTT GCCGATTACG TCCTGCCGAT GGGCCACTCG
GCCGAACGGC ACGACCTGCT CAGCTACGAA ACCCACGCAG GGAAATGGAT CGCTTTCCGC
CAGCCTGTGC TGCGCACCGC CCTGAGAAGA ATGGGCAAGC CTGTCAGATA TACCTGGGAG
GCCAATCCCG GCGAAGTCTG GGAAGAGGAT GAATTCTGGA TCGAACTGAC CTGGCGCATC
GATCCTGACG GAAGCCTCGG CATCCGCCAG TACTGCATGT CGCCCTACAG GCCCGGCGAG
AAAATAACCA TTGACGAGTA CTACCGATAC ATCTTCGAGC ATACGGCAGG TCTGCCTGAA
AAGGCAGCAG AAGAGGGGCT TTCAGCGTTC GACTACATGC AGAAATACGG AGCGTTCGAG
GTTGAAAGCA ACGTTTACAA TGTGCATGAA AAAGCGGTGC CGCCCTCCGA TCTCGACGGC
GCTTCGGTGC AGCCTCAGAA CGGACTGATC GTGAAAAACG GCAAGGCGGT CGGCGTGGAA
GTCGCAGGCC GCTCCTGCGC CGGTTTTCCG ACCCCTTCGA AGAAGCAGGA GTTCTATTCA
GGCACCATGA TCGACTGGAA GTGGCCGGAG TATCGCCTGC CCGGCTACAT CAAGAGCCAT
ATCCATGAGG AGACCATGAA CCATAAGAAC GGAGAGTTTG TGCTGGTGCC GACCTTCCGT
CTTCCGGTGC TGATTCACTC CCGTTCCGGC AACGCCAAGT GGCTTGCTGA AATCGCTCAC
CGCAATCCGG TCTGGATCAA CGTTGACGAC GGCGCGGCTC TCGGCATAGC CAACGGCGAC
CTTATCAGGG TCAACACCGA TATAGGCTTT TTCGTGAACC GCGCATGGGT GACTGAAGGT
ATCCGCCCGG GCGTGGTTGC CTGTTCGCAT CACATCGGAC GGTGGCGTCG CGAGCAGGAT
CCAGAAGCCA ACCGCTGGGC GGCCAACAGG GTCAATATTT CCAAAGAAGG GAAAGGCAAG
TGGAAAATGC GCGTTGAAGA GAACATCCAG CCCTACGAGA GCAGCGATGC CGACTCCTCG
AGGATTTTCT GGTCGGATGG CGGCGTTCAT CAGAACATCA CCTTTCCGGT ACATCCCGAT
CCGATAAGCG GCATGCACTG CTGGCACCAG AAGGTTCGGA TCGAGAAGGC GCACGAGGGA
GACCAGTATG GCGATGTTTT TGTCGATACC GACCGTTCTT TCCGGATTTA CAAGGAGTGG
CTCGCCATGA CGCGTCCCGC GCCCGGACCA GGAGGCCTTC GCCGTCCGCT CTGGCTTAAC
CGTCCGTTCA GACCCGATGA GAAGACCTAC TATCTGAAAT AG
 
Protein sequence
MSYTHKPTVI ESLAEKLHLI PDLHAESGGA KALRAAEEGS EVSSPPPEQW DNWVEYDAKS 
WPERKTKEYM LVPTACFNCE AGCGLLAYVD KETMEIRKLT GNPYHPASRG RNCAKGPATL
NQIQDTDRVL YPMKRAGKRG DGQWERVSWD SVLDDIAGRM RKAIMEGRNN EISYHVGRPG
HDGFMEWILR AWNVDGHNSH TNVCSSGARF GYAIWEGFDR PSPDHANAKF ILLVSAHLES
GHYFNPHSQR IIEARMKGAK LAVLDPRLSN TASMSDYWMP SYPGTEPAVL LAMAKVIIDE
GLYNRTYLEN WVNWQEYLQA EYPGTPVTFE NFIEGLKKEY AHYTPEYASK ESGVDAAMIV
EIARKIGEAG SQFSTHVWRS ASSGNLGGWA VSRTLHFLNV LTGSVGTPGG TSPSAWNKFK
PQVHAEPKPQ TFWNPLHLPN DYPLAHFEMS FLLPHFLKEG RGKLDVYFTR VFNPVWTYPD
GFSWIEALED ESKIGLHAAL TPTWSETAYF ADYVLPMGHS AERHDLLSYE THAGKWIAFR
QPVLRTALRR MGKPVRYTWE ANPGEVWEED EFWIELTWRI DPDGSLGIRQ YCMSPYRPGE
KITIDEYYRY IFEHTAGLPE KAAEEGLSAF DYMQKYGAFE VESNVYNVHE KAVPPSDLDG
ASVQPQNGLI VKNGKAVGVE VAGRSCAGFP TPSKKQEFYS GTMIDWKWPE YRLPGYIKSH
IHEETMNHKN GEFVLVPTFR LPVLIHSRSG NAKWLAEIAH RNPVWINVDD GAALGIANGD
LIRVNTDIGF FVNRAWVTEG IRPGVVACSH HIGRWRREQD PEANRWAANR VNISKEGKGK
WKMRVEENIQ PYESSDADSS RIFWSDGGVH QNITFPVHPD PISGMHCWHQ KVRIEKAHEG
DQYGDVFVDT DRSFRIYKEW LAMTRPAPGP GGLRRPLWLN RPFRPDEKTY YLK