Gene Cagg_1505 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_1505 
Symbol 
ID7267282 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp1844891 
End bp1846459 
Gene Length1569 bp 
Protein Length522 aa 
Translation table11 
GC content59% 
IMG OID643566349 
Producttail sheath protein 
Protein accessionYP_002462845 
Protein GI219848412 
COG category[R] General function prediction only 
COG ID[COG3497] Phage tail sheath protein FI 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGGTCA CTTTGGCGTA TCCGGGTGTC TACATTGAGG AGGTTCCGAG CGGTGTGCGC 
ACGATCACCG GGGTGGCGAC GTCGATTACC GCTTTTGTCG GGCGGGCGCG GCACGGGCCG
GTCAATGATC CGGTGCGAAT TCAGAACTTC GGCGATTACG AACGGATCTT CGGTGGGCTT
TGGGAAGAGA GCACGATGAG TTATGCGGTG CAGCACTTCT TCCTCAACGG CGGCACTGAT
TCCTTGATTG TGCGCCTGAT CAATGGCGCG ACTGTGGCTC GTTTCAATCT TTCGCCCGCG
AGCGGTACCG ATAATCTGAC GCTGGATGCC AGCAACGAAG GCGCGTGGGG CAATAATTTG
CGCATCAGTA TTGATCATGC GACGCGCGAA CCGTTGCAAC CCGATGAGTT TAATCTAACC
GTGGTCGAGA TTATTCCGGG GACGAATCCG GTGCAGGCGG TGCGGCGCGA GACGTTTCGC
AATGTCTCGA TCAATCGGAC TGTGCCGCGT TATGTGGGGA CGGTGCTGGC GCAGGAGTCG
CTGCTGTTGC GTGTGCCCGA CCCCTATGCC GGCACATTTC CGACAGTGCG ACCGGCCGAG
GTTGGCACAG GAGGCGATCT CAGCACCTTC GCTACACCAT CAAGCCCGGG TTCCGACGGA
GAAGCGCTGG TTAGTTCGCA GTATGAAGGT AGCTTTGATA ATAAAACCGG CATCTACGCC
TTGCGCAAAG CCGATATGTT CAACCTGCTC TGCCTCCCGC CGTTCACCCG CGAGACCGAT
GTTAGTGAGA CGGTGTGGAC GAAGGCATTA GCGTTCTGTC GTGATGAGCG AGCGTTTTTG
CTGATTGATG CGCCATCGGG CTGGCGGAGC ATTGAAGCAG CGGTCAGCGG AATTGGTACG
TTTAGTGCAG CGGTTGCGCG TGACGATCAT GCGGCGATCT ATTTTCCGCG AGTGCGCATT
CCTGATCCAT TGCGTGAGAA TCGGTTGGAG GAGTTTGCGC CTTGCGGTGC GGTGGCCGGC
GTCTTTGCTC GCACCGATGC CCAACGCGGG GTGTGGAAGG CTCCGGCCGG ACAGGATGCG
ACGCTGTTTG GCGTGCGAGC GCTGGCGGTC AATTTGACTG ATGGGGAACA GGGCCAACTC
AACCCGCTGG GCGTCAATTG TCTGCGCACC TTCCCGGTGA GCGGCAATGT GGTCTGGGGA
GCGCGCACCT ATCGCGGCGC TGACCAGTTG GCTTCGGAAT GGAAGTACAT CCCGGTGCGC
CGGTTGGCAT TGTTTATCGA AGAGACGCTC TACCGCAGTC TGCAATGGGT GGTGTTTGAG
CCGAACGATG AGCCGCTGTG GGCGCAGATC CGGCTGAACG TAGGCGCGTT TATGCAGACT
CTCTTCCGGC AAGGCGCGTT TCAGGGCCGG TCGCCGCGCG AAGCCTATTT GGTGAAGTGT
GACCGCGAGA CGACGACCCA GACCGACATC AATATGGGGG TAGTAAACAT TGTGGTTGGC
TTTGCGCCGC TCAAGCCGGC TGAGTTTGTG ATTATCAAGA TTCAGCAGTT GGCCGGACAG
ATCGAATAG
 
Protein sequence
MPVTLAYPGV YIEEVPSGVR TITGVATSIT AFVGRARHGP VNDPVRIQNF GDYERIFGGL 
WEESTMSYAV QHFFLNGGTD SLIVRLINGA TVARFNLSPA SGTDNLTLDA SNEGAWGNNL
RISIDHATRE PLQPDEFNLT VVEIIPGTNP VQAVRRETFR NVSINRTVPR YVGTVLAQES
LLLRVPDPYA GTFPTVRPAE VGTGGDLSTF ATPSSPGSDG EALVSSQYEG SFDNKTGIYA
LRKADMFNLL CLPPFTRETD VSETVWTKAL AFCRDERAFL LIDAPSGWRS IEAAVSGIGT
FSAAVARDDH AAIYFPRVRI PDPLRENRLE EFAPCGAVAG VFARTDAQRG VWKAPAGQDA
TLFGVRALAV NLTDGEQGQL NPLGVNCLRT FPVSGNVVWG ARTYRGADQL ASEWKYIPVR
RLALFIEETL YRSLQWVVFE PNDEPLWAQI RLNVGAFMQT LFRQGAFQGR SPREAYLVKC
DRETTTQTDI NMGVVNIVVG FAPLKPAEFV IIKIQQLAGQ IE