Gene Cag_1029 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCag_1029 
Symbol 
ID3747757 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium chlorochromatii CaD3 
KingdomBacteria 
Replicon accessionNC_007514 
Strand
Start bp1398353 
End bp1400323 
Gene Length1971 bp 
Protein Length656 aa 
Translation table11 
GC content47% 
IMG OID637773558 
Productheterodisulfide reductase, subunit A 
Protein accessionYP_379334 
Protein GI78188996 
COG category[C] Energy production and conversion 
COG ID[COG1148] Heterodisulfide reductase, subunit A and related polyferredoxins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.0503554 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCAAAAA TTGGGGTTTT CGTTTGTCAC TGCGGTGAAA ATATCGCGGG CAAAATTGAT 
TGCAATAAAC TAAACGATGC CATTAAAGAG CATCCTGGTG TTGAAATTGC TTTCGATTAC
AAATATTTTT GCTCCGACCC AGGGCAGGAA AGCGTTAAAA AAGCTATTCG CGACAACCAT
CTTACGGGTA TTGTTGTGGC AGCTTGTTCA CCGCGTATGC ACGAAGCCAC CTTCCGTAAA
GCGTGTGCTG AAGCGGGCTT AAACCCCTAC CTTTGCGAAA TAGCCAACAT TCGTGAGCAG
TGCTCGTGGG TGCATAGCGA TGGCGATATG GCAACCCAAA AAGCCATTGA TATTACTCGC
TCAATGGTTG AAAAGGTGAA ATTAAACAAC ACCCTTCAGC CTATTGAAGT GCCCGTTACA
CGCAAAGCGT TGGTAATTGG TGGCGGTATT GCTGGCATTC AAGCGGCGCT TGATATTGCA
AATGCAGGGC AGGAGGTGGT GCTTGTTGAA CGCGAAGCCT CGCTTGGTGG GCACATGGCA
CAGCTTTCGG AAACCTTTCC AACGCTTGAT TGCTCGCAGT GCATTATGAC CCCCCGCATG
GTGGAAACCG CACAGCATCC AAAAATTAAG TTATTGACCT ATGCCGAAGT TGATAGCGTT
GAGGGCTACA TTGGCAACTT TAAGGTAAAA GTGCGCATGA AGGCACGCTA CGTTATTAAA
GATCAATGTA CAGGTTGCGG CGACTGCATT TTAAAATGTC CTCAAAAGAA AATCCCCAGC
GAATTTGATT GCGGACTTGG TAACCGTCCA GCCATTTACA CCCCCTTTGC TCAAGCTGTA
CCAAACATTC CTGTTATTGA TCGCGATCGC TGCACCTACT TCAAAAATGG CAAATGCCAG
CTTTGCGTAA AAGCGTGTCA ACTGAACGCC ATTGATTTTC AACAGCAAGA TGAATTTCTT
GATATTGAAG TTGGCGCTAT TGTGGTAGCT ACGGGCTTCC AAATTCAAAA CAACGCCGTA
TATGGCGAAT ATGGCTACGG CAAATACAAA GATGTAATTA ACGGCTTGCA ATTTGAGCGC
CTTGCTTCTG CCAGCGGTCC AACATCGGGT AAAATTTTGC GCCCATCGGA TGGCACAGAG
CCAGAAACGG TCGTCTTTAT TCAATGCGCA GGTTCACGCG ATCCATCAAA AGGGGTGAAA
TATTGCTCCA AAATTTGCTG CATGTACACC GCCAAACATG CCATGCTTTA TGCCCACAAA
GTGCATCACG GTAAAACCCA TATTTTTTAT ATGGATATTC GTGCCGCAGG TAAAGGGTAC
GATGAATTTA CTCGACGTGC CATTGAGGAG GATGAAGCCT CATACTTGCG CGGTCGCGTA
AGCAAGGTAT GGGAAGAGAA TGGCAAATTG ATGGTACGTG GCGTTGATAC CTTGCTTGGT
AAACCCGTTG AAATTGCGGC TGATATGGTG GTGCTTGCCA CCGCAATTGT GCCTCAGCCC
GATGCAAAAG AGTTTGCTAA AACCATTGGT ATTGGCTACG ACGAATATGG CTTCTACAAT
GAACTGCACC TCAAGTTGCG CCCTGTAGAA TCCTCCACAG CAGGCATTTT TCTTGCAGGC
GCCTGCCAAT CACCAAAAGA TATTCCCGAC TCCGTTGCTC AAGCGTCGGC ATCGGCAAGT
AAAGTGTTAG CGCTCTTTAG CCGCGAAAAG CTTGAACGCG AACCCGTTGT AGCCTCCAAC
AACGAATCAA CATGTGCGGG CTGCTGGGGA TGCGTACTTG CCTGCCCTTA CAACGCTATT
GAGAAAAAAG ATATTTGCGA TCGTCAAGGC AATGTTATCA AGCGCGTTGC ATCGGTTAAT
CCCGGACTCT GCCAAGGATG TGGCACTTGC GTTACCTTCT GCCGCTCGCA TAGCCTCGAT
TTAGCTGGAT TTACCGAAAA GCAAATTTTT GCTGAAGTTA TGGGCTTATA A
 
Protein sequence
MAKIGVFVCH CGENIAGKID CNKLNDAIKE HPGVEIAFDY KYFCSDPGQE SVKKAIRDNH 
LTGIVVAACS PRMHEATFRK ACAEAGLNPY LCEIANIREQ CSWVHSDGDM ATQKAIDITR
SMVEKVKLNN TLQPIEVPVT RKALVIGGGI AGIQAALDIA NAGQEVVLVE REASLGGHMA
QLSETFPTLD CSQCIMTPRM VETAQHPKIK LLTYAEVDSV EGYIGNFKVK VRMKARYVIK
DQCTGCGDCI LKCPQKKIPS EFDCGLGNRP AIYTPFAQAV PNIPVIDRDR CTYFKNGKCQ
LCVKACQLNA IDFQQQDEFL DIEVGAIVVA TGFQIQNNAV YGEYGYGKYK DVINGLQFER
LASASGPTSG KILRPSDGTE PETVVFIQCA GSRDPSKGVK YCSKICCMYT AKHAMLYAHK
VHHGKTHIFY MDIRAAGKGY DEFTRRAIEE DEASYLRGRV SKVWEENGKL MVRGVDTLLG
KPVEIAADMV VLATAIVPQP DAKEFAKTIG IGYDEYGFYN ELHLKLRPVE SSTAGIFLAG
ACQSPKDIPD SVAQASASAS KVLALFSREK LEREPVVASN NESTCAGCWG CVLACPYNAI
EKKDICDRQG NVIKRVASVN PGLCQGCGTC VTFCRSHSLD LAGFTEKQIF AEVMGL