Gene Cagg_3737 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_3737 
Symbol 
ID7267810 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp4550192 
End bp4553182 
Gene Length2991 bp 
Protein Length996 aa 
Translation table11 
GC content56% 
IMG OID643568544 
Producthypothetical protein 
Protein accessionYP_002465009 
Protein GI219850576 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.16903 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATAACAC TCGTGTTTGA ACTGATTTTG GTTTTACTTG TTGCTAGTCT CCTCGCGTTC 
TGGAATGGAT GGGGACTGGC ACGTTTGTTG CTCCCGGCGG CCGTTGCGCC CTGGCGAGCG
TTATTGTCAC CACTCCTTGG CTATGCGCTG ACGATCCTCG TTGGTTACTG GGTGGTACGG
TTTATCGGTG GTTTAGGATG GGCGTTGGGA TTGATCGCTC TCCTGAGCGG ATGGTTCAAT
TGGCTCGCAT GGCGATGGTA TGGTCCACCT CAGATCATCA TGGCACTACA CCATCATTGG
CCGGGATTGG TAGTAGCCGG TATTGCGGTT GCGTTTGGAG TTGCACCACT ATTGAGTTAT
GGCTATGCTG CGCCCATCGG TGGCGGCTGG GATATTGAGA ATTACTGGCC TACTGCGCGC
TATCTGGTCC GTGGTCCGGT GAGTGCCATC GCAACGGCAC CACCAAACCC CTTGCGCGAT
ATCAATGCTG ATCCGCCGCG AATTGGCTTG ACCCTTGGTT TTAGTATTTG GCAAGGTAGT
GTTGATCTCC TGAGTGGGAG TGAACCCCTC GTTAGTTTCG CACCATTGCT TGCGTGGCTA
CGTGCGCTTG GCGTCGTCGG GATATATGTG CTCTTACAGG CGGTGTTTAC CCTGCGGCGT
GGGCCGGCAG CGTTTGCGGC TCTTCTGGCG GCACTTAACG GTCTGTTGCT CTGGACGAGC
TACTTCAATT TCGGGATGCA ATTAGCAGCA TGGCCGCTGC TGCCGTTGGT GCTAACGCTT
GGTTTAGCAA CTGTGCGTGC CGATGCCGAA CGTTGGGCAA CCCAATCGTT GGTCGGTCGA
GTTGCCCACT ATTGCGCTGC TGCCATGAGT TTGGCGGCTA TACCGGTTGC CTACTATCCA
GCCCTTGGCC CGCTGGGATT GATGGCAGTG GGGATTGGAA TCGTTGTGCT TGGGCAGACA
CGCGACCGTT GGCGGCTGAT TAAACGGTCG CTGGTACTTC TTCTATTGAC GCTGGCACTG
GCTGCACCGA CGATACCCGA TTATTTTGCC GGCTTCAACT ACCGTTACAG CCTGCCACTT
ACGACGCTTG GTCTCTTTCG TTTTATTCCA CTGAGCGACA TTGTTGGTTT TACCATCTTC
CGCCTACGTG AGGTGTCCGA ACCACTAACA GTACCCGCCT GGGTAGCAGC AGCGATGCTC
ATTGTTCTCG TCGGATATGG CTTGATCACC AGCCCCCAAC GTTGGTACTG GGTTGGGATG
TTGGCCGGCG CAAGTAGCTT TTTACTCTAC CTGCGTTTTG GGGCTGTATA CCACTACGGC
TATCTCAAAG CTGCGGCGTA TGTGGGATGG ATGGCCGGAG CACTAGCAGC AAGTGGTTTG
CAAGCTATGC TCGATCATCT GCGTAGTCGT TCGGGGTGGC AACGAGTGTT GGTTACCGGT
AGTGTGACCG TCTTGATCGG TAGTCCGGTG GCATTGACGG CTATTCGTGT AGTCGCCGAT
CATTGGGGTA AACCTGCCTT ATTTGCCGAT CAATTACCGG TATTACGCGA GCTGCGTCAG
CTTGTTCCTG TCGGTAGCAC GGTGCGTCTT ACCGGTGATC CGCGGGTCGA GGGGGTGACG
AGCGCGCTCG CAGCGTACCT TCTCGATCAC ACACGGGTGT GGGGCAATGT TCGTACCGGT
TATGCTTCAT CGTCCGCCGG TGAATCCGAC GCTATTGCCG AATACGCGCT CTTACAACGG
GATGAAGATC CGACCTTGTG GGGCTACACC GATCCGCCGA TCTGGAGCGG TGGATCATAT
CGCCTGTATC GGCGTCCGTC TGAGACAGTG GCACATCTTA TCTGGCAGCA CATAGTAGAC
AAGGAGGCCG TGACCCTGCC GATCATCGGC GAGCGTCTGG CTTCTGAGGA GACAGTTTTC
GTTGCGGAGC GTGAGCCACG CTGGATCGCG TTACAAACCG CTAGTTTCAC GCCGGCTGAG
CTTATCATCA ATGGTGAGCG ATATGGGGTA CCTGCCGGTC GTTCGCGTGT GGTTATTGGT
CCGCTGCGAA AGGATCAAAA GCTCACCCTC CAAGCCGTCG ATCGGCCGGT TCAGATTCAG
ACGGTGAGCT TGCTCACGAC GCGATCAAAG AGTCAAGTGT CCAGGTTCGC GCATAGTGTC
ACGATCAAGG CTGCCAGTGT TGCAGATGGG TCGACGGTGG CAACAAGTAT TGACATGCTC
AACGCGGACA GTGGTCCGGT GGTGGTTGCA CTTGAATTGT GGGAGCGCCG GCAAGGAATA
CTGTTCGGCC GCTATGGATT GCGTGTTATG CCATCAGCAG AGGTACAACA GGCCACGGTA
ACTCTCGATC TTGCGACCGG TGCAGCGCGT GCGCAAGATG CTGCCGGTGC TCCGATAGTA
CTGGGCGTGG ACCAAGGTGC ATCTTTACCC GGCACCTATA TCGCTCGGCT GTGGGTTGGT
ACTGACCAAC GCGCATTACT GACACCGGTT GATCTATTTA CCTTCACGAT CGATCGGCAA
GGGGCAGTAA CAGTTGATTG GACGGCGCAA ACTTCCTTGT TGACGGCGCA AATTGAGCGA
CCATTACAAC CGCTGAGCGT TCAGTTTGGT GATGATATGC TGCTCCGCGG TTACGATCTA
AGCACGACAC AGGCTACACC CGGTGAGACG ATAGCGTTAA CACTCTGGTG GCAGGCATTG
CGGGGTAATC TCGATGAACG GAGCATCATG GTTCACGTGC GCAACGGCCA AGACGAGCGT
ATCATCGACG CCGATGGTCC ACCGGCGGGT GGTGGCCGGC CAACGAGTGT GTGGCAAGGC
GGTGAGTTGA TTATCGATGA GCGACACATA ACGATTCCTA CCGATGTGCT ACCGGGTCAC
TATTGGTTGG TTATCGGAAG TTACCGATGG CCCTCGCTCG CTCCAATCCC ACGGGTAGGT
GCCGATGAGG CGGTGTGGCG TATTCCGATT GAGATTGTGG CACGGCGTTG A
 
Protein sequence
MITLVFELIL VLLVASLLAF WNGWGLARLL LPAAVAPWRA LLSPLLGYAL TILVGYWVVR 
FIGGLGWALG LIALLSGWFN WLAWRWYGPP QIIMALHHHW PGLVVAGIAV AFGVAPLLSY
GYAAPIGGGW DIENYWPTAR YLVRGPVSAI ATAPPNPLRD INADPPRIGL TLGFSIWQGS
VDLLSGSEPL VSFAPLLAWL RALGVVGIYV LLQAVFTLRR GPAAFAALLA ALNGLLLWTS
YFNFGMQLAA WPLLPLVLTL GLATVRADAE RWATQSLVGR VAHYCAAAMS LAAIPVAYYP
ALGPLGLMAV GIGIVVLGQT RDRWRLIKRS LVLLLLTLAL AAPTIPDYFA GFNYRYSLPL
TTLGLFRFIP LSDIVGFTIF RLREVSEPLT VPAWVAAAML IVLVGYGLIT SPQRWYWVGM
LAGASSFLLY LRFGAVYHYG YLKAAAYVGW MAGALAASGL QAMLDHLRSR SGWQRVLVTG
SVTVLIGSPV ALTAIRVVAD HWGKPALFAD QLPVLRELRQ LVPVGSTVRL TGDPRVEGVT
SALAAYLLDH TRVWGNVRTG YASSSAGESD AIAEYALLQR DEDPTLWGYT DPPIWSGGSY
RLYRRPSETV AHLIWQHIVD KEAVTLPIIG ERLASEETVF VAEREPRWIA LQTASFTPAE
LIINGERYGV PAGRSRVVIG PLRKDQKLTL QAVDRPVQIQ TVSLLTTRSK SQVSRFAHSV
TIKAASVADG STVATSIDML NADSGPVVVA LELWERRQGI LFGRYGLRVM PSAEVQQATV
TLDLATGAAR AQDAAGAPIV LGVDQGASLP GTYIARLWVG TDQRALLTPV DLFTFTIDRQ
GAVTVDWTAQ TSLLTAQIER PLQPLSVQFG DDMLLRGYDL STTQATPGET IALTLWWQAL
RGNLDERSIM VHVRNGQDER IIDADGPPAG GGRPTSVWQG GELIIDERHI TIPTDVLPGH
YWLVIGSYRW PSLAPIPRVG ADEAVWRIPI EIVARR