Gene Cagg_0947 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_0947 
Symbol 
ID7268020 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp1175392 
End bp1177443 
Gene Length2052 bp 
Protein Length683 aa 
Translation table11 
GC content61% 
IMG OID643565795 
Producthypothetical protein 
Protein accessionYP_002462301 
Protein GI219847868 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.119116 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGCGCA TAACGGTTGG TTCGCTCTTT GGTCTGGCGA CAGTCGTGTT GTTGATCGGC 
ACACTGGCCT CGCCTGCGCT GGTCTCAAAC GCGTTGGCGA CGCCGCCGGC CAAAGAGGTT
GCTATCCTTG TACATGGCTG GCAAGGGTGG AATCTGCTCA ACCCTAGTGG ACGTTCGTGC
GGGGAGAGCC CCATCCGCGC TGATCGCAAC CTCGCCCAAC AAGAGTTTGC CGATATTGGC
GCGTATCTAG TCGATGCCGG CTACGAGGTC TACCTAGCCA GTTGGGAGAC ACGTCCCGGC
TACACAATGC GGGCCGAAGA GGCTGCCGCC AAATGTCTAG CGCCCCAAAT TGCGGCAGTC
GCCGGAGGTG ATCGGGATGG GAAGGTACTG CTGGTTGCCC ACTCAATGGG TGGCGTCGTT
AGCCGAGCGT ACATTGAAGA TACGCAAAAT CCGCCGGTTG AGGCGTTGAT TACTTTGGGT
ACACCTCACG TTGGTGTCAA CTTTGCCTCG TTGATCAAAG TCATTCTCCT TCTCAACCCC
GGTACGAGAC TGTTGGTCGA GCAGCTCTGC GCCAGCGACC CCGGTCTCTG CCAATTGGGC
AGTGACCAGA TGTTGCTGTT TAACACCGTC TATCACCCGA CCGGCGTGCC GTACCTGTTT
GTTGGCGGCT ATGGCGGCCC CGTCTACATG GCCTTCCTCA ACTTCACCGA AGGCCGTAAC
GACGGGATTG TCGGTTATCG GAGCGGAGTC GGCTATCTGT ACAATCCGGT GTCAGATATA
CCGTTCATCG GGCGACCGTT GCGCAATGAT ACGCTGATCG TCAGTGGCGG CGATATTACC
CGCCGCTTCA CCAACGCGGC CCACGTCTCG TTCTTTGAAA CCGGCACGAA ACGTTGGTTC
TTCGCCGATC CCGCCACCAA GAGCTGTGTG GAAGGCTTCG TTGCCGGGAT CGAGAGCGGC
ACAACCGCCT ATCGGAACAC CTGCCTCTCG CACTCTCGGC CTCCGTCACT CTTGGCTGCC
GAGCAGTCGC CGGTCAGTTT CACGCCAGTG GTTGCCACAA CACTTGCGGC TGGTGAGACC
TTTACCGCGC CGATCGTGCT CGATGGCAAT GCCGGCGAAA TCTTGCTCGG CTGGAACGAT
GGCGATCTGG CGCTGACCCT GATCGCGCCC GATGGCCGTG CGATCACGCC GGCCAATCTC
GCCCAAGAAC TGCCCGGTTC GGTGTATCTG ATCGATCCGG TGAATGGTCT GATCACGTAC
CGGCTGACCA ATCCGCCGGC TGGTGAATGG ACGGCGGTGG TAACGGCTGG GCCGGCGACA
ACGACCGCTG AGGTTACGCT CGTGGCTGCG ATGCAGTCAC CGTTGCAACT CTTCCTTGAT
CTGCCATCGT CAGTCGCGAT CGGTGAACCG TTCACTCTCA CGGCGCGGCT GGTCCAAGCG
GCGACGCTGG CTGAGGCGAC CGCCGTTTCA GCGAGCCTGC TGACGCCGCA GGGCGTGCAG
ACGGTTGACC TAGTGCGCAT GGCTGCCGGC GAGTATCGCG GCCAACTGAT TGCACCCGCT
GCGGCAGGGC CGTATGTGAT CTCGGTGAGC GCGAGCGGCG CCACGTTTAG CCGCCAACTC
GAAGCGCTGC TCACGGTGCG TACTCCCGGT CTCGAACGGC ATGGTAGTAC GGCGACGAAC
ACTCCTGATT GGGATAGGAA CGGCAAGTAC GAGCGGTTGC AGATCCACGC GACATACCAA
GTCGCTGCTG CTGACCAGTA TGCGGTGATG GCGACCCTGC AAGACGCCGA TGGGCGCGCG
ATGATGATCA CGCGGACGAC GGTTGAGTGG GCTGCCGGTG CGAATCCGCT GCTGGTTGAG
TTCAACGGCG GTGAGATCGC GTCGACTGGG GTGAATGGGC CGTACCGGGT CGTCACCCAG
ATCGTGCGGG TGAGCGATGG CGCGTTGATG GCCGATGAAC AACCGCTGAT CGATGGTCTG
GACTACCTTG CCTCGGATTT TGAAACTGGG CCGTCGTCGT TGCAGGTCTT CATCCCGATG
CTCCAACGGT AA
 
Protein sequence
MKRITVGSLF GLATVVLLIG TLASPALVSN ALATPPAKEV AILVHGWQGW NLLNPSGRSC 
GESPIRADRN LAQQEFADIG AYLVDAGYEV YLASWETRPG YTMRAEEAAA KCLAPQIAAV
AGGDRDGKVL LVAHSMGGVV SRAYIEDTQN PPVEALITLG TPHVGVNFAS LIKVILLLNP
GTRLLVEQLC ASDPGLCQLG SDQMLLFNTV YHPTGVPYLF VGGYGGPVYM AFLNFTEGRN
DGIVGYRSGV GYLYNPVSDI PFIGRPLRND TLIVSGGDIT RRFTNAAHVS FFETGTKRWF
FADPATKSCV EGFVAGIESG TTAYRNTCLS HSRPPSLLAA EQSPVSFTPV VATTLAAGET
FTAPIVLDGN AGEILLGWND GDLALTLIAP DGRAITPANL AQELPGSVYL IDPVNGLITY
RLTNPPAGEW TAVVTAGPAT TTAEVTLVAA MQSPLQLFLD LPSSVAIGEP FTLTARLVQA
ATLAEATAVS ASLLTPQGVQ TVDLVRMAAG EYRGQLIAPA AAGPYVISVS ASGATFSRQL
EALLTVRTPG LERHGSTATN TPDWDRNGKY ERLQIHATYQ VAAADQYAVM ATLQDADGRA
MMITRTTVEW AAGANPLLVE FNGGEIASTG VNGPYRVVTQ IVRVSDGALM ADEQPLIDGL
DYLASDFETG PSSLQVFIPM LQR