Gene Cagg_0161 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_0161 
Symbol 
ID7269076 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp212979 
End bp215015 
Gene Length2037 bp 
Protein Length678 aa 
Translation table11 
GC content56% 
IMG OID643565033 
Producthypothetical protein 
Protein accessionYP_002461548 
Protein GI219847115 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1807] 4-amino-4-deoxy-L-arabinose transferase and related glycosyltransferases of PMT family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0126763 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTGCTGG GTCTCGGCTT ACGACTATGG TTTTTAGCCG TCAACCGGAT CGACCCGCGC 
TTTTCCGCCG CCGATGACGG CGATTATTAT GTGCGTGCGT TGCAATTTGC CGTTACCGGT
GAATACCACG ACAACTCGTG GTTGGTACGC CCACCCGGCC ACATCTTCTT CTTTGCGGCT
ATGCTGAAGA TCGGTCTTTG GTTGGGTGAT CCGGCAATTG GCATCTCCTT AATCCGGGCC
GTGCAGGTTG GATTATCGCT TGCCCTGATC CCACTTGGCT ATGACATAGC GCAGCGTCTA
TTCGACCGGC GTACCGGTGT GATCTTTGCC ACAATTCTCG CCGTTTGGAT GCCGATGGTT
GAATTGCCGG CTCTGATACT GAGTGAGCCG CTCTTCTTCA GCATGCTGGT GATCCACGCA
TGGATGCTCG TGCGTTGGCG TGACGAACGA CGATCAGGTT GGTTGATCGG GGCCGGTATC
ACGCTGGCGT TGGCTGCTCT GGCCCGCTCT CCCGGTCTGT ACGGTGTACC GTTTGCGGTA
CTGTTTATCG CTCTGAGCGC ATGGCACGCT GCGCATCAAC CGCGTCTCCG GCGTGTGATA
CCGGCATTGC TGAGCTTCTT GTTGCCATTC GCAATAACCA TTGCGCCGTG GACGATCCGT
AATTATCTGC TGTACCACGA CCTCATCGTG GTTGACACCC TCGGCCCGGT CAATCTGTGG
ATTGCGATGA GCGATGCTGT GCATGAAGGG CGTGGTGAAG GCGAGGCCAA AGCGATATTA
CTACAAATTC CACAAAGTGA GCGGCAACGA TTTGTCAGTG CTGAACTGAG ACGAATTTTA
CAAACGGAAC CTTGGCGATT TACTCGCAAT TTCTGGCCAC ATTTTCAACA TATCTGGAAG
GCTCAATTTA TTGAAGATTT TTTTGTTAAA GCGAGCTTCT TCACTCGTCC GTTACGCTCG
GTGTGGCCGC TCGGGCTTAT CGGCGATCTG ATATGGTTCG CGTTTATCGT CGCTGCACCG
TTTACGCTGC TATCACGTCT GCGCGAAGGA GCCTTCCGCA TTATTGCTCT CGGTTGGATT
GGGTATACCT GCCTTATGGT GATGCTCACT CACGTCGAGC CGCGTTATCT CTTGCCCATC
TGGCTCTGGT TAGCGCTGTA TGGCGCAGCA GCAGTGGCGC GAATCGGTCA ACAGCCGTTG
CGCTTCGATC GATATAGCCG GGCGGGGTTG GCGGTAAGCT TGGCTCTGAT CGCGCTGATC
ATCGGCTACC GCGATTACCC GCAGGTCATT CGGAACGGGA TCGCACGCGA ACAGGCGTGG
ATGACTGCCC AACAAGCCAT CGCCCGCAAT GATCCTTCGG CGGTTGAGCA GGCATATCAG
GCGATGTTAA CTGCCGATCC CGATTTTGCC GATGGACGTA CCGATTTTGC CCGTTGGCTT
CTTGCCCAAG GTCGCTACGA TGAGACATGG CAGGTGATCG GTGATTACCA AACCCACCGT
GGTAATTTAA TCCGTGGGGC ATTAGCTCGT GCCCAAGGTG ATGCCGAAAC GGCAAGGTTG
CTTCTGCGCA ATACTGAAGA GTTGGGCGGT GAGGATGTTC AGCGGCTAGC CTTGGAGTGG
CTTTCACCAT CGCCCACGTC TGTTCTTACC CTCGGCAACG ATCTCGACCT CGGTTACGTG
ATGGGATTTG CCCTCGGTGA ACGGGTAGGT GATACGACGT TTCGCTGGTT ACAACGCGAG
GGTGTCATTC GTTTGCCGGT ACCGACCGCG CTGACCGGCA CCGAGATCAT CGCGCTACGG
CTCGCTGCCC CCCAACCAAC GCCCCTCACA GTAATGGTCG GTTCCCAGGC ATACCAGATC
AACGTTGTAC CGGGCGGTTG GCGGGTGTAT CTTCTCCCCC TCCCGGCAAC CACCCGAGGT
ACCGATGAAG TGGTCATCAC GCTACAGGCG CCGACGTTTG TACCGTACCG TCAATTTGCC
GATAATGCCG ATGCACGCCC GCTAAGCGTG ATGGTAAACC AAATAGCCAT ACGATAA
 
Protein sequence
MLLGLGLRLW FLAVNRIDPR FSAADDGDYY VRALQFAVTG EYHDNSWLVR PPGHIFFFAA 
MLKIGLWLGD PAIGISLIRA VQVGLSLALI PLGYDIAQRL FDRRTGVIFA TILAVWMPMV
ELPALILSEP LFFSMLVIHA WMLVRWRDER RSGWLIGAGI TLALAALARS PGLYGVPFAV
LFIALSAWHA AHQPRLRRVI PALLSFLLPF AITIAPWTIR NYLLYHDLIV VDTLGPVNLW
IAMSDAVHEG RGEGEAKAIL LQIPQSERQR FVSAELRRIL QTEPWRFTRN FWPHFQHIWK
AQFIEDFFVK ASFFTRPLRS VWPLGLIGDL IWFAFIVAAP FTLLSRLREG AFRIIALGWI
GYTCLMVMLT HVEPRYLLPI WLWLALYGAA AVARIGQQPL RFDRYSRAGL AVSLALIALI
IGYRDYPQVI RNGIAREQAW MTAQQAIARN DPSAVEQAYQ AMLTADPDFA DGRTDFARWL
LAQGRYDETW QVIGDYQTHR GNLIRGALAR AQGDAETARL LLRNTEELGG EDVQRLALEW
LSPSPTSVLT LGNDLDLGYV MGFALGERVG DTTFRWLQRE GVIRLPVPTA LTGTEIIALR
LAAPQPTPLT VMVGSQAYQI NVVPGGWRVY LLPLPATTRG TDEVVITLQA PTFVPYRQFA
DNADARPLSV MVNQIAIR