Gene Cagg_0649 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_0649 
Symbol 
ID7268668 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp799740 
End bp801677 
Gene Length1938 bp 
Protein Length645 aa 
Translation table11 
GC content49% 
IMG OID643565509 
Producthypothetical protein 
Protein accessionYP_002462021 
Protein GI219847588 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.0357751 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTACCTT TTCTTGAACG CATAAATGAA ATTCTTTTTG CTGCGGTTTT GATTGTCAGT 
TTTTCACTGC TGTCATACAT TATTTTACAG AACTGGCGTA GTGACATCGT GCGTGCGCTC
AGTGTTTTGT TAGCAGGTGT AATCATTGCG TACAGTGGTG ATCTTTTGTT AGCTCGTGCC
CAGCGTCCGG CCACCATTGA GTTTTTAGGT CGAGCACAAT GGCTAGGAAT AGTCCTGGTA
CCGGCAGGGT ATATCCACTT TGCTAATGCT TTGTTAGCTT TTGGTAATGC TAACGCCATC
GCCCAACGCT GGCGACGTAG CGTTTATTTT GCCTATTTGA GTAGTTTTAT CATCTTTTTG
CTCGTGGTTC TTGGTACAAA TCTGGTTATT CACACCGGAA TACCAAGTGG GCCAATTGCG
CAATTTCAAG CAGGGCCGCT TTTTTGGGTC TACGTTATCT TCTTTATCAT TGCGATGGGG
ACGGCCTTGC TGGCAATTCT GATGGTACGG CGCGCGGCGT TAACGCCTTC ACAGCGGCGA
CGATTGGGCT ACTTAGGGGC GACGTTCGCT GCACCCGGTA TTGGTGTGTT CCCTTTTTTG
CTGGTGGCTT CACCAAGTAT GCTACCGGTG AGCGTTATCC TTATGCTCCA AGCGTTAGCG
AGCCTGATCG TCATTGCAAT GATTACGGTG ATGACGTATT CGGTAGCATT TCAAGGTGTG
CTTATTCCTG AGCGCTTGAT CAAACAAGAT TTCGTGCGCT GGTGGTTGTA CGGGCCGTTC
GTTGGCATTG CGACGATTCT GTTTATTCAG GCTGTGCCGG TAATGGCACA AATGTTAGGC
TTGCCGGCGG AAACCCTGAT CACGTTTGGT GTTATGGTGA TGACCGTCTT GATGCCGATC
TTTGTTACGC AGGTAAAACC GTATCTCGAT GCCTTAATTT ATCGCCAAGA TCATGCTGAA
ATCGATTACT TACGTAATTT ACCGCGCAGT GTGTTTACGC GGGCCGATTT ACGAACATTA
CTCGAGAATT CGTTGGTTGC GATTTGTACT CCTTTGCAAG TGAAGACCGG TTTTGTTATT
GCGCCAGGAG AAGATGGATT TAGTATCAAG GCAATCTGGG GATCGCGCCG CGAGGTTCGT
CGGTTAGTCA GTGAACATCC GATTGGCGAT CTTATTCCAC GCTTAGAGGC GATGCCATAC
GATGCGAGCG CGAGCCTGGA TAGTGGCTCG TTCTTGGTGG TCGGTTCGTT TTGCCTATTA
CCATTGCGTA GTCCTGATGG AATGTTTCTA GGGGCGATTG GACTTGAAGC AACCACCGAT
CAATTGCGAC GTAATGGTGG TACCTCACCC GAGATGCGGC GCATGGTGAC CGGATTAGCT
CATCAGATTG AGCTAGCGCT AACAACGGCT CAGATGCAAC GTCAGATTTT TGATGCGTTA
CGCGGTTTAG CCCCTGAAAT GCAATCGTTG CAGCGGTTGA GTTCACGGTT AGAACAGACG
ACCCCGCTGA CCCTGGCTAC ACTCGATGAA GATGTCGTAT TACATCCAGA GTTTTCACAG
TTGGTAAAAG AAGCCTTAAC CCAGTTTTGG GGTGGGCCGA AGCTGGCTGA AAGCCCGTTG
ATCGGTTTGC GCAGTGTGCG GCGCGTGTTG GCCGAGCAGG GCGGAAGTCC GACCCAAGCG
TTGCAGACGG TGTTGCGACA AGCGATCGCG AATCTACGGC CTGATGATCA GATAGATCCC
TCAGCCCAGG AATGGCTATT GTATAATTTA CTCGAAGGCC GGTTTTTACG TCGTCAAACG
GTACGAGACG TGGCCCATCG CTTGGCGATG AGTGAGTCGG ATTTTTATCG CAAACAACGA
GCGGCGATTG AGGAAGTTGC TCGTCAAATT CTTTTGATGG AAGAGCATGA GTATGAAAAT
TCTACTGGCC GAAGATGA
 
Protein sequence
MVPFLERINE ILFAAVLIVS FSLLSYIILQ NWRSDIVRAL SVLLAGVIIA YSGDLLLARA 
QRPATIEFLG RAQWLGIVLV PAGYIHFANA LLAFGNANAI AQRWRRSVYF AYLSSFIIFL
LVVLGTNLVI HTGIPSGPIA QFQAGPLFWV YVIFFIIAMG TALLAILMVR RAALTPSQRR
RLGYLGATFA APGIGVFPFL LVASPSMLPV SVILMLQALA SLIVIAMITV MTYSVAFQGV
LIPERLIKQD FVRWWLYGPF VGIATILFIQ AVPVMAQMLG LPAETLITFG VMVMTVLMPI
FVTQVKPYLD ALIYRQDHAE IDYLRNLPRS VFTRADLRTL LENSLVAICT PLQVKTGFVI
APGEDGFSIK AIWGSRREVR RLVSEHPIGD LIPRLEAMPY DASASLDSGS FLVVGSFCLL
PLRSPDGMFL GAIGLEATTD QLRRNGGTSP EMRRMVTGLA HQIELALTTA QMQRQIFDAL
RGLAPEMQSL QRLSSRLEQT TPLTLATLDE DVVLHPEFSQ LVKEALTQFW GGPKLAESPL
IGLRSVRRVL AEQGGSPTQA LQTVLRQAIA NLRPDDQIDP SAQEWLLYNL LEGRFLRRQT
VRDVAHRLAM SESDFYRKQR AAIEEVARQI LLMEEHEYEN STGRR