Gene Cagg_2966 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_2966 
Symbol 
ID7266497 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp3637114 
End bp3638793 
Gene Length1680 bp 
Protein Length559 aa 
Translation table11 
GC content55% 
IMG OID643567788 
ProductCytochrome-c oxidase 
Protein accessionYP_002464262 
Protein GI219849829 
COG category[C] Energy production and conversion 
COG ID[COG0843] Heme/copper-type cytochrome/quinol oxidases, subunit 1 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00000678538 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGCCAGCA TCGCTGCACG CGCCCGACCG GTAGCTATCA GCACCGACCT CAGTGTCGAG 
CGCAAGCTTG CCGGGATCAA TATCTTTATT GCATTTGCCG CTCTTGCCGT TGGCGGTCTG
ATGGGAGTTC TCCAAATCCT GCGCTACAAC GGCTTGGACC TCTACACACC GGCCAAGCCA
ATCCTGCCTG GTGGCTATTA TCAGGGTTTG ACCGTCCACG GTGTGTTGAA CGTGCTCGTT
TTTACCACCT TTTATATTAT CGGCTTCCTG ACCTATATTT TCACCAAATC CCTGGGTCGG
CCCTTAGCCA GTAGCCGACT AGCGTGGGCC ACGTTGTGGG TGATGGTCAG CGGGTTAGTG
TTGGCGGCCA TCCCACTGCT GACCAACAAC GCAACGGTCA TGTTCACCTT TTACCCACCG
CTCAAAGCCG ACTCCCTCTT CTACATTGGG CTGACGCTCG TAGTTGCCGG CACGTGGTTG
CTGCTGGTAA ATATGATCCT CACGCGCAAT GCGTGGAAAG CCGACAACCC CGGTCAGATC
ACGCCACTAC CCGCGTTTAT GGCCATCGTC ACCATGCTAA TGTGGACGAT TTCAACCATC
GGTGTGGCGA GTGAGATGAT CTTTCTGGTG ATTCCGTGGT CGCTCGGACT GGTCGGCGGC
ACCGATCCGG TACTAGCACG CACGCTGTTC TGGTTCACCG GTCACCCCAT CGTCTACTTT
TGGCTCTTAC CGGCGTACAT TTCGTGGTAC ACTCTCGTAC CGGAACAAGC CGGTGGGAAG
CTGTTTAGCG ACCCGATGGC GCGCGTTTCA TTCATTCTCT TCCTGATCCT GTCATTACCG
ATTGGCATGC ACCACCAAGT AGCCGATCCG GGCATTTCCG AGATGTGGAA ACTGGTGCAT
GCTATCTTCA CTTTTGGTGT CTTCTTCCCC AGCTTGCTCA CCTTCTTTAA CGTGGTGGCA
TCACTCGAAT ACGCCGGACG GCGCAACGGC GGTGAGGGCT GGCTGGCGTG GATCTTTAAC
TTGCCGTGGG GCAAACCGGA GATTTCGGCC CAGATCGTGG CAATGATGCT CTTTGCCTTC
GGTGGGATTA GTGGGCTGGT CAATGCTTCG TACAACGTCA ACCTGGTGGT ACACAATACG
TCTTGGATTT CGGGTCACTT CCACCTAACG GTTGGTACGG CAGTAGCGCT GAGCTTTATG
GGCATTACCT ATTGGTTGGT ACCCCACCTC ACCGGACGGC AACTGTGGAG CCGTAGCCTG
GGCGTAGCCC AAGCATGGCT GTGGTTTGTC GGGATGGGAC TGTTCTCACA CTTTATGCAC
GAGCTGGGCT TACGCGGCAT GCCACGACGA ACCGACATCG GTGGTGCGCC CTATGTGCAG
GATGAATGGC GCTTCTTCCT CTTCTTTGCT ATGATCGGCG GGTTGATTAT GTTTGTGAGC
GCCATCTTCT ACTACTTTAA CATGATTATG ACCCTCACCC GTGGGAAGAA GCTGGCCGAA
GCACCCGATT TCAACTTCGC CAAGACGCTT TCTGGCCCTG AAGACGCACC CAAAATCCTT
GATCGGTTGT TGGTGTGGGT CACCGTTGCG GCGATTATTC TGATTATCAA CTACCTCCCA
ACGATCCTTG ACATCTTGAA TACGGCCACG TTTGACGTAC CAGGTCGACG GGTCTGGTAA
 
Protein sequence
MASIAARARP VAISTDLSVE RKLAGINIFI AFAALAVGGL MGVLQILRYN GLDLYTPAKP 
ILPGGYYQGL TVHGVLNVLV FTTFYIIGFL TYIFTKSLGR PLASSRLAWA TLWVMVSGLV
LAAIPLLTNN ATVMFTFYPP LKADSLFYIG LTLVVAGTWL LLVNMILTRN AWKADNPGQI
TPLPAFMAIV TMLMWTISTI GVASEMIFLV IPWSLGLVGG TDPVLARTLF WFTGHPIVYF
WLLPAYISWY TLVPEQAGGK LFSDPMARVS FILFLILSLP IGMHHQVADP GISEMWKLVH
AIFTFGVFFP SLLTFFNVVA SLEYAGRRNG GEGWLAWIFN LPWGKPEISA QIVAMMLFAF
GGISGLVNAS YNVNLVVHNT SWISGHFHLT VGTAVALSFM GITYWLVPHL TGRQLWSRSL
GVAQAWLWFV GMGLFSHFMH ELGLRGMPRR TDIGGAPYVQ DEWRFFLFFA MIGGLIMFVS
AIFYYFNMIM TLTRGKKLAE APDFNFAKTL SGPEDAPKIL DRLLVWVTVA AIILIINYLP
TILDILNTAT FDVPGRRVW