Gene Cagg_3577 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_3577 
Symbol 
ID7269721 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp4348878 
End bp4351946 
Gene Length3069 bp 
Protein Length1022 aa 
Translation table11 
GC content55% 
IMG OID643568385 
ProductFG-GAP repeat protein 
Protein accessionYP_002464851 
Protein GI219850418 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00287898 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGTCGAACC GAACACGTTC TTTGTTCGTG CTCACGGCCA TGGTAGCGGT GATACTGACC 
ACAATGCCGG TTAAGGCTAC CTTTGGCTGC CAAGGCGAGC TTGATGAGAA GTCGCGGTTG
GGGTTGATGC CACCGGCATG CGGCCCGGCT CTCAACGAAC CGGTAACGAC CACGACGGCT
GAGGCGACAA TGTTTGCCTC CCCGCTTGAG GTGTCGGTGT CCAGTTGGCC GCAAGTGGTG
GCAGCGGTAA ACCTTGATAC CGACTCGGCT GCCGAGTTGC CGTTGTTGAC CGATCAGTAT
TTTGCACCAG AGACCGATCG GTCGTTGTTT GTCTACGATC TACAAGATAG CACATTGCGC
GCGGTGCAGC AGGTGGCAGC CGGTTCTACA CCGTCGGCTA TGGCAGTCAC CGCCGGTCAG
TCTGGCCCTC TCGTGGCCGC AGCGTTAGCC GGAAGCAATG CGTTGGCGGT GTATACCGGC
ACAGTGCCTT TGCGTGGTCC GCATATGCTG CGCCAGACCG GTTCGCCTGA TGCTGTGGCC
TTTGCCGATG TAAATGGCGA TTTGTGGCCC GACCTTGCAG CGGTTAGTCC AGAAATTGAT
ACGATCACGA TCCGTAATCC GCATCAAGCC GGTATGCCGA TCCTGTTGCA GGTACCCTTC
GCCACCGATG GGTTTAGTGC GTTACGGGTT GGTGATCTCG ATAATAATGG GTTTGATGAT
CTGGTCGTGT TGCGTGGCGC CGGTTATACG CGCGATTCGG TGGTGATTTT GTTGCAAGGG
AGTAGTGGTT TTCCCCATCA ATTAACGCTG AGTCCGGAAA CAGGTGGTTT TTTACCGCAT
AGTCTGGCAG TTGGCGATCT GAATGGCGAT GGCCGTGATG ATATTGCCGT GGTAGCCGGT
GGCAATAGTC CAAACGCTTT TCTTAGTGTG TTTCTGCAAA CAACGAGTGG TTTCACCGCT
TTGCCACCGT TGCCGACATT CCACATCCCC GGTGCCGTGT CAATTGCCGA CGTTACCCAC
GACGGGCGGA ATGATATAGT GGTGTTTCAC CATGCATGGC GTACATTGAG TGTCTATGCT
CAACAGTCTG ATGGTCGTTT TGCCGAGCCG ATGACGAAAA CGGTACCGTA TAGTGGATCG
CGTCGGCCTG ACTCGTTAAG TGTGGCCGAT GTGAATGGTG ATGGTGGTTT AGATGTGGCC
CTGGTGGGTC GCCAACCCGG TCTGACAGTG CTGCTCAATA CGGCCGGTGC GCCGGTAGCG
ACGATCACGT CACCGGACGC AACGACACGC TTGCCGGCCG GTCCACTGCT TGTTCGGGGC
ACGATGAGTA GCGGTACTGT GCAGGTTGAG GTGCGGATTA AGGGGGTCAC TGATTGGCAA
CCGGCAACGT TGAGTGCAAG CGAGTGGCAG GCTACGCTAA CCTTGCCTGA TGCAGTACGG
CCATATACGA TTGAAGCACG AGCAATTGAT AGTAGTGGCC GAGTGCAAGC ACCGCCGGCA
CAGCTACGGA TACAGGTCGC GCCATTACTG ATCGGTTATG CAGTGGCCGA TAACGGCGAA
TCGCCTGTCC CCGGTGATCG CCTGATGCGG TTTGATCCGG AGACCGGTGC AGCGACGCTG
ATCGGCTCGA CCGGTACCGA TCATATTCAC GCCATCACGT TTTTGCCCGG TACTAATACG
TTGTTGGCGG CCAACCTTGA TCGTCTCGGT ACGATCGATC TCACCACCGG CCGCTTTACC
CCATTCCCGC GCCCGTTTGG TGTCGGACGC AACGGGAGCC GGACGCGCGT CTTCACTAAT
GCCCATGGGT TGACCCCCGA TCCGCGCGAT GGTACGCTCT TTGCCGCAAT GCGGTTGGGA
AATGGGCAGC CTGATCTGTT GTTCAAACTC AATCCGGTTA CCGGTACGTA TATACCTGAT
GCGTTTGGGC CGGGTAAGGA TTTTGTGGAG GTAACCGGTG TTGGTGTTGG TGATGATATT
GACGATCTGG CCTTTGACCC CGCTACCAAC ACGCTATACG CCATTGCAAA CGTTGATGGT
CGCCGAGATC AACTTGTGAC GCTCGATCCC GTTACCGGTG TTGCAACAGT GATCGGTAGT
CTTGGGGTTG AGAATATGGA AGGGTTAGCT CTCGCGCCAA ATGGTGTACT GTACGGTAGT
ACCGGTAGCT CACGGCCGGC AACGCGCGAT CGACTATGGA CGATCAATCG TACTAACGGA
ACGGCCAGTT TGGTCGGGCC GTTTGGGATC GAAACTGACT ACGAGGCAAT TGCTTTCGTA
CCGCCGCGAG TGACCGTTCC TACCCTTGCC ACAGTGCCGC GGTTGAGTAG TCTCATGGTA
CATAATGCTC AATTGACGAC TTCTGACTTA TCGGTCACCA TTCAAGGTGG GATAGCTCAC
GGTGATTCGT CGAATCTCGT ACTCATCACC GAATATGCGT TTGATCCGGT ACAGAGCGAT
TGGCAACCGC ATACCGGTAG TGTGCAACTG GCAACGACCT ATATTCCGGC AGATACGCTG
GCTAACGGTG TGTCGTGGCA ATTAGTACGC ACTGTTGGAG CGCATTATAT TCTCGCAGGA
GTCGTCAATG ATAACGGTCA ATCACTATCT CTACCTGAGC GTGCTCTCAT CAACTATCTG
CCGCCACAAT TCGATTTGGC CGCCGGTGAA GTGGCTGTCT TCCGTTATAT CCTGAATACC
GGGGATCAGC TCGATGTACA GCTTGAAACA CTAAATGGTG ATGCTGATAT CGTGATCTGG
TCAGGCGATG ATCCAGAAAC AGCATGGGTG AGCAATCTCG CTGTCGGTAA CGATCATATC
TACCTTACCG CACCCCTTGA TGGCTTGTAT CAGATCGAAA TCCGTGCTCA GATGAGGAGC
ACTGTACGTT TGCACATCTC GTCGATTGCA CGTGAACAAG CTTCACCACG TGCTGATGGG
AGTGCCGGAC GCGATCCCGG TAAGGAGCTG CCGAGTGAGC CGGTTGTTAG TCTGGCAGCG
ATCCCGACAA GCCTGATCAC CGATCCGGGT ACGTCTCATA CTATCTACCT TCCCATTATC
GGGCGCTGA
 
Protein sequence
MSNRTRSLFV LTAMVAVILT TMPVKATFGC QGELDEKSRL GLMPPACGPA LNEPVTTTTA 
EATMFASPLE VSVSSWPQVV AAVNLDTDSA AELPLLTDQY FAPETDRSLF VYDLQDSTLR
AVQQVAAGST PSAMAVTAGQ SGPLVAAALA GSNALAVYTG TVPLRGPHML RQTGSPDAVA
FADVNGDLWP DLAAVSPEID TITIRNPHQA GMPILLQVPF ATDGFSALRV GDLDNNGFDD
LVVLRGAGYT RDSVVILLQG SSGFPHQLTL SPETGGFLPH SLAVGDLNGD GRDDIAVVAG
GNSPNAFLSV FLQTTSGFTA LPPLPTFHIP GAVSIADVTH DGRNDIVVFH HAWRTLSVYA
QQSDGRFAEP MTKTVPYSGS RRPDSLSVAD VNGDGGLDVA LVGRQPGLTV LLNTAGAPVA
TITSPDATTR LPAGPLLVRG TMSSGTVQVE VRIKGVTDWQ PATLSASEWQ ATLTLPDAVR
PYTIEARAID SSGRVQAPPA QLRIQVAPLL IGYAVADNGE SPVPGDRLMR FDPETGAATL
IGSTGTDHIH AITFLPGTNT LLAANLDRLG TIDLTTGRFT PFPRPFGVGR NGSRTRVFTN
AHGLTPDPRD GTLFAAMRLG NGQPDLLFKL NPVTGTYIPD AFGPGKDFVE VTGVGVGDDI
DDLAFDPATN TLYAIANVDG RRDQLVTLDP VTGVATVIGS LGVENMEGLA LAPNGVLYGS
TGSSRPATRD RLWTINRTNG TASLVGPFGI ETDYEAIAFV PPRVTVPTLA TVPRLSSLMV
HNAQLTTSDL SVTIQGGIAH GDSSNLVLIT EYAFDPVQSD WQPHTGSVQL ATTYIPADTL
ANGVSWQLVR TVGAHYILAG VVNDNGQSLS LPERALINYL PPQFDLAAGE VAVFRYILNT
GDQLDVQLET LNGDADIVIW SGDDPETAWV SNLAVGNDHI YLTAPLDGLY QIEIRAQMRS
TVRLHISSIA REQASPRADG SAGRDPGKEL PSEPVVSLAA IPTSLITDPG TSHTIYLPII
GR