Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cagg_3577 |
Symbol | |
ID | 7269721 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chloroflexus aggregans DSM 9485 |
Kingdom | Bacteria |
Replicon accession | NC_011831 |
Strand | + |
Start bp | 4348878 |
End bp | 4351946 |
Gene Length | 3069 bp |
Protein Length | 1022 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 643568385 |
Product | FG-GAP repeat protein |
Protein accession | YP_002464851 |
Protein GI | 219850418 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.00287898 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGTCGAACC GAACACGTTC TTTGTTCGTG CTCACGGCCA TGGTAGCGGT GATACTGACC ACAATGCCGG TTAAGGCTAC CTTTGGCTGC CAAGGCGAGC TTGATGAGAA GTCGCGGTTG GGGTTGATGC CACCGGCATG CGGCCCGGCT CTCAACGAAC CGGTAACGAC CACGACGGCT GAGGCGACAA TGTTTGCCTC CCCGCTTGAG GTGTCGGTGT CCAGTTGGCC GCAAGTGGTG GCAGCGGTAA ACCTTGATAC CGACTCGGCT GCCGAGTTGC CGTTGTTGAC CGATCAGTAT TTTGCACCAG AGACCGATCG GTCGTTGTTT GTCTACGATC TACAAGATAG CACATTGCGC GCGGTGCAGC AGGTGGCAGC CGGTTCTACA CCGTCGGCTA TGGCAGTCAC CGCCGGTCAG TCTGGCCCTC TCGTGGCCGC AGCGTTAGCC GGAAGCAATG CGTTGGCGGT GTATACCGGC ACAGTGCCTT TGCGTGGTCC GCATATGCTG CGCCAGACCG GTTCGCCTGA TGCTGTGGCC TTTGCCGATG TAAATGGCGA TTTGTGGCCC GACCTTGCAG CGGTTAGTCC AGAAATTGAT ACGATCACGA TCCGTAATCC GCATCAAGCC GGTATGCCGA TCCTGTTGCA GGTACCCTTC GCCACCGATG GGTTTAGTGC GTTACGGGTT GGTGATCTCG ATAATAATGG GTTTGATGAT CTGGTCGTGT TGCGTGGCGC CGGTTATACG CGCGATTCGG TGGTGATTTT GTTGCAAGGG AGTAGTGGTT TTCCCCATCA ATTAACGCTG AGTCCGGAAA CAGGTGGTTT TTTACCGCAT AGTCTGGCAG TTGGCGATCT GAATGGCGAT GGCCGTGATG ATATTGCCGT GGTAGCCGGT GGCAATAGTC CAAACGCTTT TCTTAGTGTG TTTCTGCAAA CAACGAGTGG TTTCACCGCT TTGCCACCGT TGCCGACATT CCACATCCCC GGTGCCGTGT CAATTGCCGA CGTTACCCAC GACGGGCGGA ATGATATAGT GGTGTTTCAC CATGCATGGC GTACATTGAG TGTCTATGCT CAACAGTCTG ATGGTCGTTT TGCCGAGCCG ATGACGAAAA CGGTACCGTA TAGTGGATCG CGTCGGCCTG ACTCGTTAAG TGTGGCCGAT GTGAATGGTG ATGGTGGTTT AGATGTGGCC CTGGTGGGTC GCCAACCCGG TCTGACAGTG CTGCTCAATA CGGCCGGTGC GCCGGTAGCG ACGATCACGT CACCGGACGC AACGACACGC TTGCCGGCCG GTCCACTGCT TGTTCGGGGC ACGATGAGTA GCGGTACTGT GCAGGTTGAG GTGCGGATTA AGGGGGTCAC TGATTGGCAA CCGGCAACGT TGAGTGCAAG CGAGTGGCAG GCTACGCTAA CCTTGCCTGA TGCAGTACGG CCATATACGA TTGAAGCACG AGCAATTGAT AGTAGTGGCC GAGTGCAAGC ACCGCCGGCA CAGCTACGGA TACAGGTCGC GCCATTACTG ATCGGTTATG CAGTGGCCGA TAACGGCGAA TCGCCTGTCC CCGGTGATCG CCTGATGCGG TTTGATCCGG AGACCGGTGC AGCGACGCTG ATCGGCTCGA CCGGTACCGA TCATATTCAC GCCATCACGT TTTTGCCCGG TACTAATACG TTGTTGGCGG CCAACCTTGA TCGTCTCGGT ACGATCGATC TCACCACCGG CCGCTTTACC CCATTCCCGC GCCCGTTTGG TGTCGGACGC AACGGGAGCC GGACGCGCGT CTTCACTAAT GCCCATGGGT TGACCCCCGA TCCGCGCGAT GGTACGCTCT TTGCCGCAAT GCGGTTGGGA AATGGGCAGC CTGATCTGTT GTTCAAACTC AATCCGGTTA CCGGTACGTA TATACCTGAT GCGTTTGGGC CGGGTAAGGA TTTTGTGGAG GTAACCGGTG TTGGTGTTGG TGATGATATT GACGATCTGG CCTTTGACCC CGCTACCAAC ACGCTATACG CCATTGCAAA CGTTGATGGT CGCCGAGATC AACTTGTGAC GCTCGATCCC GTTACCGGTG TTGCAACAGT GATCGGTAGT CTTGGGGTTG AGAATATGGA AGGGTTAGCT CTCGCGCCAA ATGGTGTACT GTACGGTAGT ACCGGTAGCT CACGGCCGGC AACGCGCGAT CGACTATGGA CGATCAATCG TACTAACGGA ACGGCCAGTT TGGTCGGGCC GTTTGGGATC GAAACTGACT ACGAGGCAAT TGCTTTCGTA CCGCCGCGAG TGACCGTTCC TACCCTTGCC ACAGTGCCGC GGTTGAGTAG TCTCATGGTA CATAATGCTC AATTGACGAC TTCTGACTTA TCGGTCACCA TTCAAGGTGG GATAGCTCAC GGTGATTCGT CGAATCTCGT ACTCATCACC GAATATGCGT TTGATCCGGT ACAGAGCGAT TGGCAACCGC ATACCGGTAG TGTGCAACTG GCAACGACCT ATATTCCGGC AGATACGCTG GCTAACGGTG TGTCGTGGCA ATTAGTACGC ACTGTTGGAG CGCATTATAT TCTCGCAGGA GTCGTCAATG ATAACGGTCA ATCACTATCT CTACCTGAGC GTGCTCTCAT CAACTATCTG CCGCCACAAT TCGATTTGGC CGCCGGTGAA GTGGCTGTCT TCCGTTATAT CCTGAATACC GGGGATCAGC TCGATGTACA GCTTGAAACA CTAAATGGTG ATGCTGATAT CGTGATCTGG TCAGGCGATG ATCCAGAAAC AGCATGGGTG AGCAATCTCG CTGTCGGTAA CGATCATATC TACCTTACCG CACCCCTTGA TGGCTTGTAT CAGATCGAAA TCCGTGCTCA GATGAGGAGC ACTGTACGTT TGCACATCTC GTCGATTGCA CGTGAACAAG CTTCACCACG TGCTGATGGG AGTGCCGGAC GCGATCCCGG TAAGGAGCTG CCGAGTGAGC CGGTTGTTAG TCTGGCAGCG ATCCCGACAA GCCTGATCAC CGATCCGGGT ACGTCTCATA CTATCTACCT TCCCATTATC GGGCGCTGA
|
Protein sequence | MSNRTRSLFV LTAMVAVILT TMPVKATFGC QGELDEKSRL GLMPPACGPA LNEPVTTTTA EATMFASPLE VSVSSWPQVV AAVNLDTDSA AELPLLTDQY FAPETDRSLF VYDLQDSTLR AVQQVAAGST PSAMAVTAGQ SGPLVAAALA GSNALAVYTG TVPLRGPHML RQTGSPDAVA FADVNGDLWP DLAAVSPEID TITIRNPHQA GMPILLQVPF ATDGFSALRV GDLDNNGFDD LVVLRGAGYT RDSVVILLQG SSGFPHQLTL SPETGGFLPH SLAVGDLNGD GRDDIAVVAG GNSPNAFLSV FLQTTSGFTA LPPLPTFHIP GAVSIADVTH DGRNDIVVFH HAWRTLSVYA QQSDGRFAEP MTKTVPYSGS RRPDSLSVAD VNGDGGLDVA LVGRQPGLTV LLNTAGAPVA TITSPDATTR LPAGPLLVRG TMSSGTVQVE VRIKGVTDWQ PATLSASEWQ ATLTLPDAVR PYTIEARAID SSGRVQAPPA QLRIQVAPLL IGYAVADNGE SPVPGDRLMR FDPETGAATL IGSTGTDHIH AITFLPGTNT LLAANLDRLG TIDLTTGRFT PFPRPFGVGR NGSRTRVFTN AHGLTPDPRD GTLFAAMRLG NGQPDLLFKL NPVTGTYIPD AFGPGKDFVE VTGVGVGDDI DDLAFDPATN TLYAIANVDG RRDQLVTLDP VTGVATVIGS LGVENMEGLA LAPNGVLYGS TGSSRPATRD RLWTINRTNG TASLVGPFGI ETDYEAIAFV PPRVTVPTLA TVPRLSSLMV HNAQLTTSDL SVTIQGGIAH GDSSNLVLIT EYAFDPVQSD WQPHTGSVQL ATTYIPADTL ANGVSWQLVR TVGAHYILAG VVNDNGQSLS LPERALINYL PPQFDLAAGE VAVFRYILNT GDQLDVQLET LNGDADIVIW SGDDPETAWV SNLAVGNDHI YLTAPLDGLY QIEIRAQMRS TVRLHISSIA REQASPRADG SAGRDPGKEL PSEPVVSLAA IPTSLITDPG TSHTIYLPII GR
|
| |