Gene Cagg_0631 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_0631 
Symbol 
ID7266103 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp776147 
End bp777910 
Gene Length1764 bp 
Protein Length587 aa 
Translation table11 
GC content60% 
IMG OID643565492 
ProductLPXTG-motif cell wall anchor domain protein 
Protein accessionYP_002462004 
Protein GI219847571 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0737] 5'-nucleotidase/2',3'-cyclic phosphodiesterase and related esterases 
TIGRFAM ID[TIGR01167] LPXTG-motif cell wall anchor domain
[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAGAAGA TCTCACGTCG GCGGTTTCTC AAAGGGACGG TTGCTCTCGG TGCTGGCGCA 
TTGCTTGCGA TCTACAGTGA TGGTAGCTTC CGGCTAGCAT TGGCACAGGA AAACCCTGCG
TTCCGCATGC GGATTCTGCA CACCAATGAC CACCATGCCC GGATTGAGCC TGTGTTCAGC
GGTAACAATC CGGTTCACGG CGGTGTCTCG CGCCGTAAAG CGTTAATTGA CAAGATCCGT
CGCGAGACGG CACTGCCGAC CTTACTGGTT GATGCCGGTG ATGTATTTCA AGGGACGCTC
TACTTTAACC AATACAACGG CATGGCCGAC CTCGAGTTCT ATAACGCAAT GGGCTATGAG
GCGATGGCCG TCGGTAATCA CGAATTTGAC AAAGGGCCGC AGGCATTAGT CGATTTTATT
ACGCGTGCCA AATTCCCGGT GTTAAGTGCT AACATCTCGG TTGCCGCCGG CAACCCACTG
GCCGGTCTGA TCAAGCCGCG CACCATCATT GAGAAAGATG GTAAGAAGAT TGGGATTTTC
AGCCTTACGC CTGAAGATAC CGGTGTGCTG TCGAATGCCG GCCCCGGCAT TAGCTTCACA
TCGGCGATTG AAGCGGCACG GCAGCAGGTT GCCGCGCTGA AGGCGGAAGG TGTCTTCACG
ATCATCGCTC TGACCCACGT CGGGATTAAT GTTGATCGCC AGATTGCACG CGAAGTTGGT
GGAATGAGTC TGATTATTGG CGGCCACTCA CACACGCCGA TGGCACCGAT GAACAATGTG
CGCACGCCGC CGTACCCCGA ACTCATCGCC GGGCCGGATG GCAAGCCGGT GGTGGTCGTT
ACCGATTGGG AGTGGGGGCG CTGGCTAGGT GACATCACCG TAGCCTTTAA TGCCGCCGGC
ACGGTAATCG ACTTACAGGG CAACCCGACT GAGGTGCTGC CGTCGTTGCC GGCGGATCAG
GGGTTCGAGA ACCGGATTGC GGTCTTCAAG GGGCCAATCG AGCAGTTGCG TGCGCGGGTG
GTTGGTTCGG CAGCGGTCGA TCTCGATGGC AGCCGGACCA ACATCCGCTC ACGCGAGACC
AATCTCGGCA ATCTCGTGGC CGAGGCGATG CTGGCGAAGG CGCGTAATTC AGGAGCCACT
ATCGCCATTA CCAACGGTGG CGGTATTCGA GCGTCGATCC CTGCCGGTCC GGTAACTGTC
GGCCAGATTT TAGAGGTCTT GCCGTTCGGT AATACGCTGG CGCTCGTAAC ACTCACCGGG
TCACAGGTCA TCGAAGCGCT TAACAATGGT GTAAGCCAGG TTGAGAGCGG TGCCGGTCGG
TTCCCGCAAG TGGCCGGGCT ACGCTTCACC TACGATCCGT CACTGCCAGC AGCCAGCCGG
GTGACGAGTG TGACCGTCGG TGGCGCGCCG ATCGATCAAA ACGCCAGCTA CGTCGTCGTC
ACCAACAACT TTATGCTGAC CGGCGGCGAC GGCTACAGCG TCTTTATCCG CGGGCGCAAT
CAGGTTGACA CCGGCTTCAT TCTGGCCGAC GTGGTAGAGG AATACATCGC CGCCAATTCA
CCGGTCAATC CGGCGGTCGA TGGGCGCATT GCTATCGGTG CAGCACCGGC AACGACACCG
GCGCAACCGG AGACGCCGGC GCAGCCGGTG CCGGCAACGT TGCCTAACAC GGGTGGCGCG
CTGACGCCAC TGGCGTGGCT GGCCGGGTTG GGTGCGGCGG CGCTGGCCGG TGGTGCCGCG
TTGCAGCGTA GTGAGAAGGA GTAA
 
Protein sequence
MEKISRRRFL KGTVALGAGA LLAIYSDGSF RLALAQENPA FRMRILHTND HHARIEPVFS 
GNNPVHGGVS RRKALIDKIR RETALPTLLV DAGDVFQGTL YFNQYNGMAD LEFYNAMGYE
AMAVGNHEFD KGPQALVDFI TRAKFPVLSA NISVAAGNPL AGLIKPRTII EKDGKKIGIF
SLTPEDTGVL SNAGPGISFT SAIEAARQQV AALKAEGVFT IIALTHVGIN VDRQIAREVG
GMSLIIGGHS HTPMAPMNNV RTPPYPELIA GPDGKPVVVV TDWEWGRWLG DITVAFNAAG
TVIDLQGNPT EVLPSLPADQ GFENRIAVFK GPIEQLRARV VGSAAVDLDG SRTNIRSRET
NLGNLVAEAM LAKARNSGAT IAITNGGGIR ASIPAGPVTV GQILEVLPFG NTLALVTLTG
SQVIEALNNG VSQVESGAGR FPQVAGLRFT YDPSLPAASR VTSVTVGGAP IDQNASYVVV
TNNFMLTGGD GYSVFIRGRN QVDTGFILAD VVEEYIAANS PVNPAVDGRI AIGAAPATTP
AQPETPAQPV PATLPNTGGA LTPLAWLAGL GAAALAGGAA LQRSEKE