Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cagg_1487 |
Symbol | |
ID | 7267264 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chloroflexus aggregans DSM 9485 |
Kingdom | Bacteria |
Replicon accession | NC_011831 |
Strand | - |
Start bp | 1820093 |
End bp | 1823278 |
Gene Length | 3186 bp |
Protein Length | 1061 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 643566331 |
Product | hypothetical protein |
Protein accession | YP_002462827 |
Protein GI | 219848394 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGTGGTG ATTTCACCCG ATGGACATTC GACCGTACCA AACACTTCAG CGGTGTACTC CATCAGCAAG GGCGTGTTGC GCTCGATGCC GATTGGAACG AGCAACTCGC GATCGATCTC GCCGATGAGC GCGCAGCTCG CATCGATCTG ATCGGACGTT GCGGCGGCCC GCAGGGCCAA GCCGGTTTTG ATATCGAGGT TGCCGGCGAC ACGCTCAACA TCACTGCCGG TCGCTACTAT GTCGCCGGTA TTCGCGTCGA AAACGAGCAG ACGGTTGCCA TCACCGCTCA ACCTGATCTG CCCGTCGCCA CCTTGGCGCA GGTGGCCGGC TTAGCCGCCA ATGCAACCCT GCCCGCCGGC GTCTATCTCG CCTATCTCGA TGTCTGGGAG CGGCATGTAA CTGCACTTGA AGATGCTGCT ATCCGCGAAG TTGCGCTCGG TGGCCCTGAC ACCACGACCC GTATGCAGGT AATGGCGCAG GTCAAGCTCT TGCGTGTAGC CGATCCCGGC GCAGCCTTTG ATTGCGCTAC GTCAAACGAA GCATGGGATG CGTTGCTGGC CGGCAGTAGC GGTATGCTCG AAGTGCGCGC CGAACCCGAC CCAACCGTCA ATGATCCCTG TATCGTGCCG GCCCAAGCCG GCTATCGTGG GTTAGAAAAC CAGCTCTATC GGGTTGAAGT GCATCGGATC GTCTCGGCCA CCCGTATTGC GCTCAAGTGG TCGCGGGAAA ACGGTTCGGT GGCGGTTGGC TGGAGCGGGC AAGATCCGCT GGATTCCAAT CGCTTGATTG TCGCCAGTAC TGGCCGCGAC GAGGTATTGG GTCTCGCGCC CAACCAGTGG GTCGAGCTGA CCGATGATGC GCGTGACAAG CGGGGCGAAC CGGGTCTACT GGTCAAAGTA ATACAGGTTG AAGGCAATGT CATCACTATT GATCCAAACG GTCAGACCAT CGTCTACAGT GCATTCGGTC CCAACCCCAA ACTGCGCCGC TGGGATATGC CCGCTGATAC CGGTGAGCTT ATCGTCACCA CCAGCGCAGC AACGTGGACC GACCTCGAAC AAGGGGTGCA GATCCGGCTC AAGAACGGTA CGTTCCGTCC CGGCGACTAC TGGCTCATTC CCGCCCGCAC TGTCACCGGC GATATCGAGT GGCCGCGCTC TGGCAACCAA CCGGTGCCGC AGCTTCCGCA CGGCATCCAC CATGCCTATT GCAAATTAGC CCTGCTTGAC TTTAATGGCA GCGTGTGGGC TAAACGCAGT GATTGCCGTC GACTCTTCCC TCCCTTGACT GAGTTAATCC GTTTCTATGC AGTCGGTGGT GATGGTCAAG AGGCATTGCC CGGCGCCACC CTTGCCCAAC CGCTGCAAGT GGCCGTCACC AACGGCCAGC AACCGGTCAC CAATGCCCGT GTGCGCTTTC GGATCGTGCC CCAAACTGCT GCCGGCCAAC TCACTGCCGG CGGCGCCACC GGCAAGAGCG TTGATGTGAC TGCCGGCCCA AACGGTATCT ACTCCTGCCA GTGGCAACTC GGCCCCGCCG TCCAGAGCCA GCGCGTCGAA GCCTTCTTGG TCGAGATCGA CGGCAAACCG TTTGTCGATA GCGCCGGCGA ACCGTTGCTG CCGCGCGTCT TCTTCAACGC CAACCTCAGC CGCGCCAGCA ACGTGGCTTA CCAACCCGGT ACGTGCTCCG ATCTAGCCAA TGCCAGCACC GTGCAAGAGG CGCTCGACAT CCTTTGCCGG CGGCCCCGTG GCGGTGGCTG TTGTGTAAGC GTCGGCGACG GCGGCGAGTT TCCCAATCTT GAAACGGCCC TCAAAGAGTT GCTCGCTCGC GGCGAGCGTC GCATCTGCCT CTGCTTGCGC TCCGGCCGTC ACGCTGCCGG GAGTATCCAG CTCGATCTCC CGCCCGCCAA CGAACCGTTC GTGCTTGAAA TCAAGGGCTG TGGCCCGGCC ACCAGTGTGC GCTTTGGTGG CCCGCTGCGG TTAAGAGGGT TTGATTCGGT GGTATTGCGC AATCTCTCCA TCGACATCGC TTTCATACCT GAATCCGGCA CGGCAGCCCT CTCGCTTGAT CGTTGCCTAC ACGTGATCAT CGCCGATTGT GCGATCAGTG GCCTCACTCT GCCGGGCCAG CTCAACCAAG AGCAATTCAT TGATGGCAGC GCCCTCGTTG CCATCACCGG CAGTGATGAT GTACGGCTAA GCGGCAATCT GTTCGTGGCT GCCGTGCCGA GTCGCACTTT CTCGCTCCTT CTTGAGTTAT TCGGCGAAGC CCGGCTTGAC GTTTTTGCTA AGCTCTTCGT CGACGACGAA CGGATCATGA CCGTAGAATG GCGTCAATTG GCACTTGATG TTGTAACGGA GATAGTCCAA CTCAATCCCG AACAGCGCCA ACAATTTGCT CGTCAACTCC AAGCACAGGT GCAAGAGCGG GTAGCTGCTC TGACCTTTGT TGAGCTTCTC CAACTCCAGA AACTCCTCTT GGCTCTCAAC CAAGCCGAAT CCAATGCAAG TGCCTTGCTC GACATTCTCT TCGACCTACG TGCCGCCGCG GTCAAATCTC GTCCCGGTAC AGCGCTGATG CTTCACGAAC GACGTTCGTT CAAGGAGCAG AATCTTGGCA AGATCATCGA TCTGGTTGAT GAAGACGATC TGGTGGTGCT CGAACATAAT CGGTTTGATG GCATTGTCAG CCTCTACGGT ATGCCTGCCC CTCTTGAGCT GGTTGTGTCA TTCGCCGATA AACTTCAAAA TCCCGATCAA CAGACGAATA TCAATGAAGC CTATGTGGGT ACCGTTCAGC TTCGTAGCAA CCATCTGGTC AGGCTGAGCA TTAGCTTTGA TTTGTTGAAC GAGATTGTCC GCGATAGTGA TAGGACTGTT GTCTTCGATG TCTGCGCGCG CTTGTTGCTC GACGGCAATG TGATTGAGGG TATCGCCAAT CTCACCCTCA GCCAGCATCT GATTGCCCAA GCCAATTCGT TCACTGCCAT GGCTTCACTG CGCAGCAGCA CAAGTCCTGC CGTCACAAGT GCCGGTACTG TGGCTTCGCC GGTCTTGGGT TGGTGTCTTG CTAATGCCGC AGCCTTCATC GGTAATCAGG GGTCGGCCCA ATCGCAAAAT GGACGTCTGT TTGCTATATC GCCTATCACC GAACGGGTGG CAAACGTATT GATCCAAATC ATATAA
|
Protein sequence | MRGDFTRWTF DRTKHFSGVL HQQGRVALDA DWNEQLAIDL ADERAARIDL IGRCGGPQGQ AGFDIEVAGD TLNITAGRYY VAGIRVENEQ TVAITAQPDL PVATLAQVAG LAANATLPAG VYLAYLDVWE RHVTALEDAA IREVALGGPD TTTRMQVMAQ VKLLRVADPG AAFDCATSNE AWDALLAGSS GMLEVRAEPD PTVNDPCIVP AQAGYRGLEN QLYRVEVHRI VSATRIALKW SRENGSVAVG WSGQDPLDSN RLIVASTGRD EVLGLAPNQW VELTDDARDK RGEPGLLVKV IQVEGNVITI DPNGQTIVYS AFGPNPKLRR WDMPADTGEL IVTTSAATWT DLEQGVQIRL KNGTFRPGDY WLIPARTVTG DIEWPRSGNQ PVPQLPHGIH HAYCKLALLD FNGSVWAKRS DCRRLFPPLT ELIRFYAVGG DGQEALPGAT LAQPLQVAVT NGQQPVTNAR VRFRIVPQTA AGQLTAGGAT GKSVDVTAGP NGIYSCQWQL GPAVQSQRVE AFLVEIDGKP FVDSAGEPLL PRVFFNANLS RASNVAYQPG TCSDLANAST VQEALDILCR RPRGGGCCVS VGDGGEFPNL ETALKELLAR GERRICLCLR SGRHAAGSIQ LDLPPANEPF VLEIKGCGPA TSVRFGGPLR LRGFDSVVLR NLSIDIAFIP ESGTAALSLD RCLHVIIADC AISGLTLPGQ LNQEQFIDGS ALVAITGSDD VRLSGNLFVA AVPSRTFSLL LELFGEARLD VFAKLFVDDE RIMTVEWRQL ALDVVTEIVQ LNPEQRQQFA RQLQAQVQER VAALTFVELL QLQKLLLALN QAESNASALL DILFDLRAAA VKSRPGTALM LHERRSFKEQ NLGKIIDLVD EDDLVVLEHN RFDGIVSLYG MPAPLELVVS FADKLQNPDQ QTNINEAYVG TVQLRSNHLV RLSISFDLLN EIVRDSDRTV VFDVCARLLL DGNVIEGIAN LTLSQHLIAQ ANSFTAMASL RSSTSPAVTS AGTVASPVLG WCLANAAAFI GNQGSAQSQN GRLFAISPIT ERVANVLIQI I
|
| |