Gene Cagg_2831 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_2831 
Symbol 
ID7267537 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp3477921 
End bp3479963 
Gene Length2043 bp 
Protein Length680 aa 
Translation table11 
GC content60% 
IMG OID643567652 
ProductTetratricopeptide TPR_2 repeat protein 
Protein accessionYP_002464129 
Protein GI219849696 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1807] 4-amino-4-deoxy-L-arabinose transferase and related glycosyltransferases of PMT family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAACGTG ACACTTGGTG GATTATCGGG ATTCTAGTAC TGGCGGCGCT GATTCGTATC 
GCGCTTTGGT TGCAGCCACT TCACTTACCG GCTAACGATG AGGTCGAGTA TCTCACCGTG
GCCCGCGATC TGTTAAGCGG GAAGGGATGG AGTTTCTACG AACGTTACCA CTGGTTACGG
GCACCGCTCT ACCCGCTCTG GCTAGCCGGA TCGCTCTGGC TGGCCGGTGG TAACGTGTGG
TTAGCCGCAC TTCCCAACAT TGCGCTGAGT GTGCTCAACG TCTACCTCGC CTACCGTCTC
TCCCAGGCAA TTGGTGATAC CCTCAACCCA GTCGTTCATC GCCTGACCGC TTTCATCACC
GCGATCCTGC TCACGAATAC CACCTTCGCC GCGCTATACA TGAGCGAAAC CCTCTTCACC
ACTCTCTTCA GCGCAGCGCT GTGGCTCCTC TTAGCGTGGC GACAGCGCGG TGCGCGCTGG
CCCGACCGGC GCATCTTCGT CGCCGGTGGG TTGTGGGGTC TCGCCCTCCT GACCCGCTCA
ATGCCACTCT ACTTCACACT CGTCGTCGTC GGATGGATTG GGGTGGTCGC TGCCGGGCAT
TGGCGTACTC TGCTGCGTCG CCCGGCCACC ATCCTGATCG GGGTTGTCTT TGCCATCACC
GCCCTTGCCG TAGTCGCGCC GTGGACAATT CGCAACTGCC TCAGTTACCG TAGTTGCATT
CTGATCGAGA CCGGTCTCTC GTACAACCTG TGGGCATTCA ACGAGCCACG CGAGGATATG
GCAACTATCT TTCGGGTCCT CGAAAATATT CCCGACCCTG CGGAACGGGC AGCCTATGCT
ACGGCCCGCG GTCTCGAGCG TTTACGCGAA GACCCGGCAA TTATGGTTCG CAAATTGTGG
CCGAATTGGC TGGCGATTTG GCGTGTCAAA GCTATTCAAG ACCGCTTTTT GCTTACAGAC
TACCGCGCCG ATCCACCACC GTTGCTCTTC CTCGCCGCCC TGATGTTCGA TGATCTGCTC
TACGGATGTA TCGCGGTGGG GGGAGTTGCG GCGATGGTCT ATGCGGCAAC CCGTAAACGA
GCACCGGCCA TCTTGTTCGT GCTTTGGCTT AGCTATTTCA TTGCGGTATC GTTGGTCACC
CACGGTGAAG GAAGGTACCG CCACTTCATC TTTTGCGCGC TCATTCCCTA CGCGGCGCGG
GCGTGGCTGG AACTGCCCAA CCTGCGCCGC TTGTCCCGGC CTGCATTGAC TGGAGGCATG
GTTGCTGCCG CAATCGTCGG TCTATCGGCC CTCATCGCCT ATCATTGGGA ATACGCCTGG
TACGGTGGCG GGCGTAGCGT ATGGCGTCTC CAAGGCGATC TCGCTCGCGC CACCGGTGAC
CTATCACGGG CGATAGCGGC GTATCGGCAA GCATTAGCAG TGCAACCAAC TCCCGACGGC
TATCTGGTCC TTGGGCATGC CTTACGCGCG CAGGGTGACA GTGACGGCGC CGTCGCTGCC
TATCGCGCCG CCGTGCGCAG CAACATCCTC TATCCGTTGC CCTACGTCTA TCTCGGCGAC
ATCTTGCGTG AACAGGGTGA TGAGAGCGCA GCGCGGCAAA ACTACCGTCC GCAATACGTC
AGCGAACAGA CCTTACTCGA TCTCGCATGG CGCGAGACCA GCCCGGTGGC GCAACCGGTG
ATTGATGTCG GCGGCGGGCT GGATTTCGGC TACCTGCATG GCTTCTATCC AGCCGAAGCG
TTAGCCGGTT CATCAGCACG CTGGACGGGA GCGGTCGCGC GCATCCGGCT CCCGGCGACT
GCGCGGATGG TGCGGCTACG GGTAGCGGCC CTACGGCCCG ATGCGCCGGT GGAGGGTTAC
GTCTGTGTCG AAGGCGACTG TCAGGCGTTT CGGCTCGGTA TTGCATGGCG CTGGCTGATC
GTTCCCTTGC CGCCGGTGAC AATGGGCGAA GGGTTGCGCG AAATTGAGCT ACGAGTAGTG
CCGTGGGCTG CTCCTGACGG GCGTGCGGTG GGAATGGTGG TAGATGTGGT GGAGACACGG
TAA
 
Protein sequence
MKRDTWWIIG ILVLAALIRI ALWLQPLHLP ANDEVEYLTV ARDLLSGKGW SFYERYHWLR 
APLYPLWLAG SLWLAGGNVW LAALPNIALS VLNVYLAYRL SQAIGDTLNP VVHRLTAFIT
AILLTNTTFA ALYMSETLFT TLFSAALWLL LAWRQRGARW PDRRIFVAGG LWGLALLTRS
MPLYFTLVVV GWIGVVAAGH WRTLLRRPAT ILIGVVFAIT ALAVVAPWTI RNCLSYRSCI
LIETGLSYNL WAFNEPREDM ATIFRVLENI PDPAERAAYA TARGLERLRE DPAIMVRKLW
PNWLAIWRVK AIQDRFLLTD YRADPPPLLF LAALMFDDLL YGCIAVGGVA AMVYAATRKR
APAILFVLWL SYFIAVSLVT HGEGRYRHFI FCALIPYAAR AWLELPNLRR LSRPALTGGM
VAAAIVGLSA LIAYHWEYAW YGGGRSVWRL QGDLARATGD LSRAIAAYRQ ALAVQPTPDG
YLVLGHALRA QGDSDGAVAA YRAAVRSNIL YPLPYVYLGD ILREQGDESA ARQNYRPQYV
SEQTLLDLAW RETSPVAQPV IDVGGGLDFG YLHGFYPAEA LAGSSARWTG AVARIRLPAT
ARMVRLRVAA LRPDAPVEGY VCVEGDCQAF RLGIAWRWLI VPLPPVTMGE GLREIELRVV
PWAAPDGRAV GMVVDVVETR