Gene Cagg_3385 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_3385 
Symbol 
ID7267125 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp4101641 
End bp4103101 
Gene Length1461 bp 
Protein Length486 aa 
Translation table11 
GC content58% 
IMG OID643568194 
ProductPolysulphide reductase NrfD 
Protein accessionYP_002464665 
Protein GI219850232 
COG category[C] Energy production and conversion 
COG ID[COG5557] Polysulphide reductase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.253171 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCACAGG CGCAACCACT CCGAACCCGG CCCGTCGATG ACGGTGAAGC GTACTTGCTG 
CCGGGGGAAA CCTACACGTC GATCACGCAA AAGATCGGCG ATGTCCCGTT AACGCCGCCG
CTGAAGACCC CAAAGGGCTG GTTGGCCGGC TTTAGCGTGG CGTTCTTCTT GTTAATGATC
TTCTTCGTGT CGGTGACGTG GCTGTTCATC CGCGGCGTCG GTATCTGGGG TATTAATATC
CCGGTCGGCT GGGGTATGGA CATCATCAAC TTCGTGTGGT GGATCGGTAT CGGCCACGCC
GGTACGCTTA TCTCGGCCAT TCTGTTGCTG CTTAATCAGG GTTGGCGCAA CTCGATCAAC
CGCTTCGCCG AAGCGATGAC GCTGTTTGCG GTGGCGTGCG CCGGTCTCTA CCCGATCCTG
CACCTCGGTC GACCATGGCT GTTCTACTGG TTGATCCCGT ACCCGAACAC GCACGGCATG
TGGCCACAGT TCCGCAGCGC ACTGGCGTGG GACGTATTTG CGATCTCGAC CTACGCTACG
GTGTCGCTGG TGTTCTGGCT GGTCGGTCTG ATCCCCGACT TTGCGACGCT CCGTGATCGG
GCCAAGAACA TCTGGGTCAA GCGCCTCTAC GGCATTGCGG CCCTTGGTTG GCGTGGATCG
GCCCGCCACT GGCACCGCTA TGAGATAGCC TCGATCTTGC TGGCCGGTCT CTCGACCCCA
CTGGTGGTCT CAGTTCACTC GATCATTTCA CTCGACTTCG CCATTTCGCA AGTCCCCGGC
TGGCAAGTCA CGGTCTTCCC GCCCTACTTC GTGGCCGGCG CCGTGTTTGC CGGGTTTGCG
ATGGTGCTCC TCCTGATGAT CCCGGTACGC ACCTTCTACG GCTTTGAGAG CTACATCACC
ATCCATCACC TCGATGTGAT GGCGAAAGTG ATGCTCGCTA CCGGTATGAT CGTCGTTTAC
GGCTACTTTA TGGAGGTCTT CGCCTCGCTT TACAGCGGCA ATGAATTCGA GGAATACCTC
CTCTACAACC GCCTCTTCGG GCCAAGCTCG TGGGCCTACT GGGGCCTGCT CTTCTGCAAC
GCGGTAGCCA TTCAACCCTT GTGGTTTAAG AAGGTGCGTC AGAATGTCCC AGCCTTGTTG
ATCATCTCAC TAATCGTCAG TGTTGGTATG TGGCTGGAGC GCTACGTGAT TATTGTCATC
TCGCTCGAGC GTGACTTCTT GCCTTCGTCG TGGGACATCT ATATTCCGAC GATTTGGGAC
TGGTCGCTCT ACCTTGGTAC CTTTGGTCTC TTCTTTACCC TGCTCTTCCT CTTCATCCGC
GTCTTACCGA TGATCAACAT CTTTGAGATG CGGCTGTTCC TCCATCAAGA GACAGAGAGG
GCGAAGCAGC GGGCAGAACA CGGTGCACAC AGCCATGGCC ACGATCACAG CCCGGCCCAC
GGTGTAGCCT CGGCAGACTA G
 
Protein sequence
MAQAQPLRTR PVDDGEAYLL PGETYTSITQ KIGDVPLTPP LKTPKGWLAG FSVAFFLLMI 
FFVSVTWLFI RGVGIWGINI PVGWGMDIIN FVWWIGIGHA GTLISAILLL LNQGWRNSIN
RFAEAMTLFA VACAGLYPIL HLGRPWLFYW LIPYPNTHGM WPQFRSALAW DVFAISTYAT
VSLVFWLVGL IPDFATLRDR AKNIWVKRLY GIAALGWRGS ARHWHRYEIA SILLAGLSTP
LVVSVHSIIS LDFAISQVPG WQVTVFPPYF VAGAVFAGFA MVLLLMIPVR TFYGFESYIT
IHHLDVMAKV MLATGMIVVY GYFMEVFASL YSGNEFEEYL LYNRLFGPSS WAYWGLLFCN
AVAIQPLWFK KVRQNVPALL IISLIVSVGM WLERYVIIVI SLERDFLPSS WDIYIPTIWD
WSLYLGTFGL FFTLLFLFIR VLPMINIFEM RLFLHQETER AKQRAEHGAH SHGHDHSPAH
GVASAD