Gene Cagg_0377 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_0377 
Symbol 
ID7268478 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp468339 
End bp470924 
Gene Length2586 bp 
Protein Length861 aa 
Translation table11 
GC content51% 
IMG OID643565245 
Productarsenite oxidase, large subunit 
Protein accessionYP_002461759 
Protein GI219847326 
COG category[C] Energy production and conversion 
COG ID[COG0243] Anaerobic dehydrogenases, typically selenocysteine-containing 
TIGRFAM ID[TIGR02693] arsenite oxidase, large subunit 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.103982 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAATTG TCCCACGCTT CGATCAATTA CCAATCCCAC CTGCCAATGC GGCGGAGTAC 
AACACAGTTT GCCAGTTTTG CAACGTTGGT TGTGGCTATA AAGTGTACGT CTGGCCGGTC
GACGAAAGCG GCGATGTCGC TGCGACTACT AATGCCTTTA AGCTCGACCT TTCAAAGCCA
CAACCGGCAT TGGCAGGTCA AAGCTATACC GAAACAATGC ACTCGATCAC CGTGGGTAAG
GATGGGCGTC AGTATCATGT GGTGATCGTT CCGGCAATGA ATAGTCCGAT TAATCAGGGC
AATTATTCGT ATCGCGGCGG TGCTAATGCG CTCACCGTGT GGAGCCTTGA TCGCGGTACG
CAAGATCGCT TAACATATCC GTTGCTGCGA GTTGGCGATC AGTTCCAAGC CATTACATGG
CAGGATGCAC TTGCCCTGAT CGCCGGGGTG ATCAAAGGTA TCCGTGACCG TGACAAGAAT
GATGATAATA TCGCCGTGAA ATGCTACGAC CATGGTGGTT CTGGGGCCGG TTTTGAAGAT
AATTATGGTG CCGGTAAGTT GTTTTTTGAT GCATTGTCGG TAAAGCATAT TGCGATCCAC
AATCGGCCTG CCTACAATTC TGAAGTATGG GGAAGCCGTG AGCGTGGTGT GCATGAGCTG
AATTACGACT ATAGCGATGC CCGTCTCGCC GATACCGTGG TACTCTGGGG TGCTAATTCG
TATGAGACGG CGACGATTTT ATATACTCAG CATATTCTTG CTAACATCCA AGGTGCGACC
GTCGCCGAAA AGCGTAAGGC CTTTGATCAG GGTGAACCGG CAGAGCCAGG CTATCTGATC
GTTATCGACC CTCGTAAGAC TTCATCCTAC ACCGTGGCTG AAACGGTCGC CTCAAATCAG
GTGCTGCTGC TCCAACCAAA TTTTGGCACC GATTATATCC TCGCTCATGC AATTGCCCGC
GTAGTTTGGG AGCGGAGCTA CTACGATCTT GATTACCTCA AAGCGCGGAC TGACATGAAA
CTGTTTGAGG AGTATAAGCA AAAGAACCTC AAACTGGATA AGAAATACGC CGACTTTATG
GCTGAGGCTG AGCGCATCAC CGGTGTCCCG AAAGCTAAGA TCGAGCAGGC GGCAGACTGG
ATTGCAAAGC CAAAAGCTGG TAAGTTTAAG CGCCGCACGC TCACGATCTA CGAGAAGGGC
ATCATTTGGA ACATGAAAAA CTACGACCAA GTGGCGGCCT TAGTGCAACT TGCGGTTCTG
ACGCATAACA TCGGTCGGCC CGGTACCGGC TGTGGTCGCC AAGGTGGGCA TCAAGAAGGG
TATGTACGAC CGCCGGCGCC AACACCGGGA TCGATCTACA ACGGTGGCCC GCCTGTGAAT
GTCGATAAGT TTTTAATCGG CGGTAAAGGG AAATTTTATT GGGTCATTGC GAATGATCCC
TACCTCTCGA CACCGAATAA CCAGATTTTC CGTAAGCGTA TTCACGAGCG CACCACGAAA
CTGACTAAAG CGCTGGGTGA GAGTGGCGAG GCCGCTACTA TTCAGGGTCG AATCGATGCG
ATCCTCAAGG CACTCTATAG CGATCCTGAC GCGCTCTTTA TGGTGGTGCA AGATATTTAC
ATGACCGAAA CAGCACGGGA TGCACATCTC ATTCTGCCGG CTGCCGGCTG GGGTGAGGCA
AATGATACTT CTATCAATTG CAATAGCCGT CTGTTACGGT TGTACGAGAA GTTTATGGAT
CCACCCGGTG AGGCGAAGCC GGATTGGGAG ATCTTTAAGC TGGTGGGTGA GGAGATTGCC
AAACTCTACC GCGCTGCAAA GCAGAACGAT GTGGCCGCGA AATTTGAGTT TGGCAAGAAT
TGGAAGACCG ACGAAGATGT CTTCTTGGCA GGGGCGCAGG AGTTTAAAGA CAATCAGGTA
AGTGAAGAGG ATGAGGCTAC GTTAGAGGCC GAGAATTACA AAGGCGTGAC CTACGCCTTC
CTCAAGCAAA AGGGGCAAGA AGGCATTCGC ACGCCGGTAC GCCGCGATCC CAAGACGAAG
AATTTAGTCG GGACACTGCG CCGCTATACG TCTAAGTTTG GGACAGCCGA TGGGAAGTTC
AAGTGGTACG CTACCGATAA TTGGGAGGGG TATCCCGCCG AAGTGGCGAA GTATCTCGAT
GGCACCAAGG CGAAGGAGTA TCCCTTCTGG GTTACTACCG GACGGATTCA GCACCTGTGG
CAATCAACGT ATCACGACCG GCATTTGCCG GAGCGGGAAA TCGCTAATCC GCTACCGTAT
GCCGAGATCA ATCCCGATGA TGCGAAGAAG CTAAATATTC AGTCGGGCGA CTTGATCGAG
ATTTACAACG AGGAGGGGAA TGCGATCTAC ATGGCCTACG TAACCGACGC AGTAAAGCCG
GGAACGATCT TTATGGTCAT GTATCATTGG CGCGGTACGT CAAACTCGTT GGTGAGCGGT
TACACCGATT CGAAGACCAC TATTCCGTGG TATAAGGGTA CACGGGCGAA TATTCGCAAG
GTGGGTAATC CTCCTACGTT CATTCAACTG ACTGCCAGTA CCCTCCAGCA GAATAAGTTT
AACTAA
 
Protein sequence
MAIVPRFDQL PIPPANAAEY NTVCQFCNVG CGYKVYVWPV DESGDVAATT NAFKLDLSKP 
QPALAGQSYT ETMHSITVGK DGRQYHVVIV PAMNSPINQG NYSYRGGANA LTVWSLDRGT
QDRLTYPLLR VGDQFQAITW QDALALIAGV IKGIRDRDKN DDNIAVKCYD HGGSGAGFED
NYGAGKLFFD ALSVKHIAIH NRPAYNSEVW GSRERGVHEL NYDYSDARLA DTVVLWGANS
YETATILYTQ HILANIQGAT VAEKRKAFDQ GEPAEPGYLI VIDPRKTSSY TVAETVASNQ
VLLLQPNFGT DYILAHAIAR VVWERSYYDL DYLKARTDMK LFEEYKQKNL KLDKKYADFM
AEAERITGVP KAKIEQAADW IAKPKAGKFK RRTLTIYEKG IIWNMKNYDQ VAALVQLAVL
THNIGRPGTG CGRQGGHQEG YVRPPAPTPG SIYNGGPPVN VDKFLIGGKG KFYWVIANDP
YLSTPNNQIF RKRIHERTTK LTKALGESGE AATIQGRIDA ILKALYSDPD ALFMVVQDIY
MTETARDAHL ILPAAGWGEA NDTSINCNSR LLRLYEKFMD PPGEAKPDWE IFKLVGEEIA
KLYRAAKQND VAAKFEFGKN WKTDEDVFLA GAQEFKDNQV SEEDEATLEA ENYKGVTYAF
LKQKGQEGIR TPVRRDPKTK NLVGTLRRYT SKFGTADGKF KWYATDNWEG YPAEVAKYLD
GTKAKEYPFW VTTGRIQHLW QSTYHDRHLP EREIANPLPY AEINPDDAKK LNIQSGDLIE
IYNEEGNAIY MAYVTDAVKP GTIFMVMYHW RGTSNSLVSG YTDSKTTIPW YKGTRANIRK
VGNPPTFIQL TASTLQQNKF N