Gene Cagg_3475 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_3475 
Symbol 
ID7269701 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp4236054 
End bp4239476 
Gene Length3423 bp 
Protein Length1140 aa 
Translation table11 
GC content57% 
IMG OID643568284 
ProductNHL repeat protein 
Protein accessionYP_002464751 
Protein GI219850318 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG4745] Predicted membrane-bound mannosyltransferase  
TIGRFAM ID[TIGR03663] conserved hypothetical protein TIGR03663 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGGGCAATG AGAAGGTTCA TGTAGGTCTG ATCGAGACGC TGATCGGTAT TGGTATGGCC 
GTACAAACAC TGACAACGGA AACTGTGCTC GACCGACGAT TACGGGCCGG CTGGCTAAGG
TGGGAGACGG CACTCTACAC CTTGATCGTG ATTGCCAGCG TCCTGGCTCA TCTGTGGGGG
CTGGAGCGTA TGGCTCTGCA TCACGATGAG TCCATCCATG CGTGGTCGAG TTGGCGGCTC
TATAGTGGGG CGGGGTCGTT TTCGTGCTGG AATGGACTGG ACGAGAACGG GAATGCACGG
GGCGGCTTGT TTCACGAGAC ATATTGCTAC GATCCGGTTT ACCACGGCCC GTCGCTCTAC
TTTCTCACGG CTCTGATCTA CTTTCTGTTT GGTGATGGCG ATGCGCAAGC CCGTTTACCG
ATGGCGCTAG CCGGGATCGG TCTGGTGATG TCGGCGTGGT GGTTACGTCC ATACCTTGGC
CGTGCAGGAG CGTTGATTGC AGCAGTGTTG CTCGGTTTTT CACCGTCGTT GCTCTACTAT
ACCCGTTTTG CCCGTCACGA TGGCTTGATG GTGCTGTGGG AATTGTGGAT GGTCATCGGG
GCGCTGCGCT GGATCGATAG TGGGCAGCGA CAGTGGCTCT ATCTGACGGC GGTAGGGTTG
GCGTTGGCGA TTGCCACCCA TGAGCTGTAC TATATTTTGC TCTTTATTTT TGGTGTGTTT
GTGCTGATGC GATTGTTGGC GGAAAGCCGG TTTGCTCGGT ATCAAAACAT TGTGCTGCCG
GTGATCATCG GTATCTGTGT GGTGTTGATG ATTGTCAACC CGCCGCTGCC GTTTGGTAGA
GGGTTATACA TCGGTGAGAA AGCCTTCTTG GTTGCTTCAG CCCTGACATT AGCATGGCTT
TGCCAGCGCT TGTGGCCGCC GGAGCCGATC CTGATACCAC GCCTGCAACA CCTCTGGCGC
AGCGAACGTT CGGTGCTCTG GACGGCGTTG GCGGTATTGG GTGGGATTTA TCTGGTTCTA
TTCACGAGTT TCTTCACCTA CCTGCCCGGT GCGATTGATG GTATCAGTGC CGGTTTGATT
TACTGGCTGG GTAGTCAGCA AGAGTATGCG CGTGGCGACC AGCCGTGGTA CTACTACCTC
ATCCTATTGC CGCTCTATGA GCCGCTGGCC GTACTAAGCG GGATTGGGGT TGTGGTGGCG
ATGATCGTGG CAGTTGTGCG ACGCTGGCGG GCCGGTCGAA CGGTACCACC GCCGGTCGCT
GATGCGGCAA CGTCGGATAT GGACGATGTT GATGTGGCTG CCCGATTGAA TGTCGCAAAG
CCATGGCCGC TGTATCCCTT GTTGGTCGTA TTTTGGTTTT TTACCGCGAT CATTATCTTC
TCGTGGGCCG GTGAAAAAAT GCCGTGGTTG GTAACGCATA TGGCGTTGCC GGGCAATCTG
TTGGCGGCGT GGGTCATCAG CCGTCTCAGT GACATGATCC AACGCGAGCA GACGTCGGCG
CGAATCTGGC TTGTTCCATT ACTCGTGATT TTGCTGCTGG TGGCGGTAGG GGTGACCTTT
TGGCGGTTGG GAAGTGGTGG TACAACGGCA TTCTTGCAGG CAATTGTCCC CTTGGCGATT
GTGTTTGGTT TGATCTATGC CCTGTTGACG CTGATCGGTC AGTTGGGGAT ACGACGGACG
AGTGCCGCGA TTGGGTTGAC GGTTGCAGCA CTCTTGGCGA TGTATACGAT ACGGGCTACG
TGGCTGGTTG TCTACGATCA TCCCGATGTG CCGGTCGAGC CGTTGATCTA CACCCAAACC
GCGCCCGATG TGCCACGCTA CGCTGCCGAT ATTCGTGATT TGGCGATCAA TTTGACCCGT
GGCAATCGTA GTCCGCAAGA TACCACCGGT GGCTTGACGA TGCCGATCAT TCTTGATGGT
GGTGATCGCA CTGCCGAGGG ATCGCTGGCG TGGCCGTTGC AGTGGTATCT GCGCGATTTT
AAGAGTATCG TGTGGGTGAA CGGCAGTGAA CTGGGACGGG TGCCGTCGGT TGAGCAATTG
ACCGTGACGA TGCCTGATGG GAGCCGTGAT TTGGCGCCGA TGGTCATGCT CTACCGCCCG
TATGTTACCG ACCGGCTGCG CAATATCTTG CGCGAGTCGT ATGTGCAGCC CTACGGTACA
GCGGGGGTGT TTAACTGGTG GTTCCCTGAA GGCAACAAGT GCGCTCCGCA AAGCCCCGGC
TACAAGAAGT TTTACTACAG TACGTGGTCG GCAGCGGCAG CCCAGGCCGC TTGTGGTCGC
GATCTGAGCG GTGAGTTGCA CGGGCCGTTT GATGTGTTGT TGTGGCCATT GCAACGCGAG
AACTGGCCGG CGTTAGGCCG CTATCTTCTC TTCCGTCAGT TGCCTGACCC ACTCGTGCCC
GGTGCACGTG AGGTGGAGGT CTGGTTGCGA CGCGATTTAG CCGGTGGCGT TGGGCAGACA
ATGACGGCGA CTACTACGCC GACGTTGCGG TTGGCTGCGC TAGCCGAGAT TCGGCTGTCG
AGCGGTGGGA ATGGTGCTAC CGGGATTGCC TTCGACCGGC AGGGAAATAT CTATGTCGCC
GATACGCTCA ACCATCGGAT TGAAGTCTTT GCGGCGGACG GTACGCCGAT ACGCACGATT
GGTACGCAAG GCAATGCGCT CGATCAGTTC TATGAGCCGC GGGGGTTGGC CTTCGATGCG
CAAGGGAACC TCTACGTGGC CGATACGTGG AACGCACGGA TTGTGAAGTA CAGTCCCGAT
TTGCGGCCAA TGACGAGTTG GGGTGGTGGC GATCTCGATC TCGGCGATGG GCGACGAGCG
ACAATCACCG AAGGTGATCC GGCGCGGAAC GCTGCGGCAC CGCTTGGTTT CTTTGGACCA
CGCGGGGTAG CGGTTGATGC CGCCGGTAAC GTCTATATTG CCGACACGGG TAATAAGCGG
ATCGTTGTGA CCGATAGTAA TGGGACGTTC CTGTACCAAT TTGGTGGTGC AGGCAGTGCG
CCCGGCCAGT TCAATGAGCC GACCAGCCTG GCGTTTGATG CTGCCGGTAA CCTGTATGTG
GCCGATACGT GGAATGGGCG GGTCCAAGTG TTTACCCGTA CCGCCGATGG TCGGATCGAT
CCGACTCCGC TGACAACGTG GCCGGTAGCG GGTTGGCAGC CCAATACCTA TGACGATCCG
ATGTTGGCGG TTAGTCCTGA TGGTATGGTT TACGTAGCAG TACCGGCGCG CCAGTATATC
CTTGTGGCGA GCACCGGTGG CGAGGCGCTG TTGCAATGGA CCGGCTTTGG CAGGGATGGC
GTCCCGATCA CGAGTCCGAG TGGGTTGGCA GTTGCGACTA ATGGAAGTAT TTGGGTGGTG
GACCGACTAG GTGGACGAGC AGCGCGCTTC GCGTTACCGG CACTGGCACC GACGCAACCA
TGA
 
Protein sequence
MGNEKVHVGL IETLIGIGMA VQTLTTETVL DRRLRAGWLR WETALYTLIV IASVLAHLWG 
LERMALHHDE SIHAWSSWRL YSGAGSFSCW NGLDENGNAR GGLFHETYCY DPVYHGPSLY
FLTALIYFLF GDGDAQARLP MALAGIGLVM SAWWLRPYLG RAGALIAAVL LGFSPSLLYY
TRFARHDGLM VLWELWMVIG ALRWIDSGQR QWLYLTAVGL ALAIATHELY YILLFIFGVF
VLMRLLAESR FARYQNIVLP VIIGICVVLM IVNPPLPFGR GLYIGEKAFL VASALTLAWL
CQRLWPPEPI LIPRLQHLWR SERSVLWTAL AVLGGIYLVL FTSFFTYLPG AIDGISAGLI
YWLGSQQEYA RGDQPWYYYL ILLPLYEPLA VLSGIGVVVA MIVAVVRRWR AGRTVPPPVA
DAATSDMDDV DVAARLNVAK PWPLYPLLVV FWFFTAIIIF SWAGEKMPWL VTHMALPGNL
LAAWVISRLS DMIQREQTSA RIWLVPLLVI LLLVAVGVTF WRLGSGGTTA FLQAIVPLAI
VFGLIYALLT LIGQLGIRRT SAAIGLTVAA LLAMYTIRAT WLVVYDHPDV PVEPLIYTQT
APDVPRYAAD IRDLAINLTR GNRSPQDTTG GLTMPIILDG GDRTAEGSLA WPLQWYLRDF
KSIVWVNGSE LGRVPSVEQL TVTMPDGSRD LAPMVMLYRP YVTDRLRNIL RESYVQPYGT
AGVFNWWFPE GNKCAPQSPG YKKFYYSTWS AAAAQAACGR DLSGELHGPF DVLLWPLQRE
NWPALGRYLL FRQLPDPLVP GAREVEVWLR RDLAGGVGQT MTATTTPTLR LAALAEIRLS
SGGNGATGIA FDRQGNIYVA DTLNHRIEVF AADGTPIRTI GTQGNALDQF YEPRGLAFDA
QGNLYVADTW NARIVKYSPD LRPMTSWGGG DLDLGDGRRA TITEGDPARN AAAPLGFFGP
RGVAVDAAGN VYIADTGNKR IVVTDSNGTF LYQFGGAGSA PGQFNEPTSL AFDAAGNLYV
ADTWNGRVQV FTRTADGRID PTPLTTWPVA GWQPNTYDDP MLAVSPDGMV YVAVPARQYI
LVASTGGEAL LQWTGFGRDG VPITSPSGLA VATNGSIWVV DRLGGRAARF ALPALAPTQP