Gene Clim_1200 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagClim_1200 
Symbol 
ID6353717 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium limicola DSM 245 
KingdomBacteria 
Replicon accessionNC_010803 
Strand
Start bp1293100 
End bp1295070 
Gene Length1971 bp 
Protein Length656 aa 
Translation table11 
GC content53% 
IMG OID642668816 
Product4Fe-4S ferredoxin iron-sulfur binding domain protein 
Protein accessionYP_001943246 
Protein GI189346717 
COG category[C] Energy production and conversion 
COG ID[COG1148] Heterodisulfide reductase, subunit A and related polyferredoxins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.708049 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCAAAAA TAGGCGTTTT TATCTGTCAT TGCGGCGAAA ACATCGGCGC AAAAATCGAC 
TGTACCCGAC TCACTTCGGC TATGAACGAC CATCCCGGCG TATCGGTTTC TGTCGAATAC
AAGTATTTCT GTTCAGATCC TGGTCAGGAA AACGTCAAAA AAGCCATAAG GGAACACAAT
CTTACAGGAG TAGTGGTTGC CGCATGTTCG CCGAGAATGC ATGAGGCCAC GTTCCGCAAA
GCCTGTGCCG AAGCGGGCCT TAACCCTTAT CTTTGCGAGA TTGCCAACAT CCGGGAGCAG
TGCTCCTGGG TGCATACCGA TCAGGATATG GCTACTGAAA AGGCTATCGA AATCACCCGT
TCGCTCATCG AAAAGGTCAA GTTGAACAAC GAACTGCAGC CTATAGAAGT CCCGGTAACG
AAGCGAGCCC TCGTTATTGG CGGAGGTATC GCCGGTATTC AGGCCGCACT CGACATCGCA
AATGCCGGTC AGGAGGTGGT GCTTGTCGAA CGGGAGCCAT CACTTGGCGG CCATATGGCA
CAGCTTTCCG AAACCTTCCC TACGCTTGAC TGCTCGCAGT GCATCATGAC GCCGCGAATG
GTTGAGGCGG CCCAGCATCC GAAAATCCGC CTGCTGACCT ATTCGGAAAT CGAACAGGTC
GAAGGGTTCA TAGGCAATTT CAAGGTCAGA ATCCGCCAGA AATCCCGATA TGTGGATATG
AAAGCCTGTA CCGGATGTGG AGACTGCATC CAGAAGTGCC CGCAGAAGAA AATCAGCGAT
GAATTCGATT GCGCTCTTGG CAAAAGGCCG GCGATCTATA CACCTTTTGC CCAGGCGGTG
CCGAACATAC CGGTTATAGA CAAGGAACAC TGCACCTTTT TCAAAAACGG CAAGTGCAAG
GTGTGCCAGA AGGTCTGCGA GACGAACGCC ATAAATTTCG AAATGCAGGA CGAGTTCCTC
GACCTTGAAA TCGGAGCCAT TGTCGTGGCA ACCGGTTTTC AGATTCAGAA TACCGCCATG
TACGGCGAAT ACGGTTACGG CAAATATGCC GATGTGATCA CCGGCCTGCA GTTTGAACGC
CTTGCCTCTG CCAGTGGTCC GACCGCAGGA AAAATTCTGC GGCCATCCGA CGGCAAGGAA
CCTCAAACCA TTGTTTTTAT CCAGTGTGCC GGATCGCGGG ATCCCTCGAA AGGCGTAAAA
TACTGCTCGA AAATCTGCTG TATGTACACA GCCAAGCACG CCATGCTCTA TGCCCATAAA
GTTCACGGCG GAAAAGCCCA TGTTTTCTAT ATGGATATAA GGGCGGCCGG AAAAGGGTAC
GACGAGTTTA CCCGCCGGGC GATTGAAGAG GATGAAGCCG CCTATATGAG AGGCCGGGTC
AGCAAAGTCT GGCTGGAAAG CGGAAAGCTG ATGGTGCGGG GCGTCGATAC CCTGCTCGGT
AAACCCGTTG AAATTGCAGC CGATATGGTT GTGCTCGCTA CGGCCATAAC CCCGCAGCCG
GATGCAAGGG AGTTCGCAAA GGTTGTCGGT ATCGGATGCG ATGAATACGG CTTTTATAAT
GAAGCCCACC TCAAACTCCG TCCGGTTGAA ACCGCAACGG CCGGAATTTT TCTGGCCGGC
GCATGCCAGT CGCCTAAAGA TATTCCTGAC TCCGTGTCCC AGGCATCGGC CTGCGCCAGC
AAGGTAATAG GCCTCTTCAG CCGCGATCAG CTCGAACGCG AACCGGTCAT AGCGATCAAC
AACGAATCTA CCTGTTCCGG CTGCTGGGGC TGCGCTCTGG CCTGTCCGTA CAGTGCCATC
GAGAAAAAAG ATATTCTCAG CCGTTCAGGA GAGCTCATCA AGCAGGTCGC CTTTATCAAT
CCGGGTCTTT GCCAGGGATG CGGCACCTGC GTAACCTTCT GCCGATCGAA CAGCATTGAT
CTGGCAGGAT TTACCGAAAA ACAGATATTC GCCGAAGTCA TGGGGCTATA G
 
Protein sequence
MAKIGVFICH CGENIGAKID CTRLTSAMND HPGVSVSVEY KYFCSDPGQE NVKKAIREHN 
LTGVVVAACS PRMHEATFRK ACAEAGLNPY LCEIANIREQ CSWVHTDQDM ATEKAIEITR
SLIEKVKLNN ELQPIEVPVT KRALVIGGGI AGIQAALDIA NAGQEVVLVE REPSLGGHMA
QLSETFPTLD CSQCIMTPRM VEAAQHPKIR LLTYSEIEQV EGFIGNFKVR IRQKSRYVDM
KACTGCGDCI QKCPQKKISD EFDCALGKRP AIYTPFAQAV PNIPVIDKEH CTFFKNGKCK
VCQKVCETNA INFEMQDEFL DLEIGAIVVA TGFQIQNTAM YGEYGYGKYA DVITGLQFER
LASASGPTAG KILRPSDGKE PQTIVFIQCA GSRDPSKGVK YCSKICCMYT AKHAMLYAHK
VHGGKAHVFY MDIRAAGKGY DEFTRRAIEE DEAAYMRGRV SKVWLESGKL MVRGVDTLLG
KPVEIAADMV VLATAITPQP DAREFAKVVG IGCDEYGFYN EAHLKLRPVE TATAGIFLAG
ACQSPKDIPD SVSQASACAS KVIGLFSRDQ LEREPVIAIN NESTCSGCWG CALACPYSAI
EKKDILSRSG ELIKQVAFIN PGLCQGCGTC VTFCRSNSID LAGFTEKQIF AEVMGL