Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cagg_1472 |
Symbol | |
ID | 7269306 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chloroflexus aggregans DSM 9485 |
Kingdom | Bacteria |
Replicon accession | NC_011831 |
Strand | - |
Start bp | 1806095 |
End bp | 1808992 |
Gene Length | 2898 bp |
Protein Length | 965 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 643566314 |
Product | hypothetical protein |
Protein accession | YP_002462813 |
Protein GI | 219848380 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.00440571 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGTGGCGC CGGCGTTGCG GACTCAGCAG GAACGAGTCC ATGCCTGCTT GCGCTCCGCT ATTGAGCGAC CGGCGTTGCT CGAAGCTGTC TGCACGATGC TGCGCGAACA ACATGGCGTG ATTGCTCTCG ACGCACCAAT GGGGAGCGGA GCGACGACGT TGCTCGCCCA GTTGGCAGTG CGCGCTGAGT GGCCACTCTG GCTGGCCGAT GATGATGATG GCGGCGGGGC ACTGGCATTC TACGCCCAAA TCGCCGCACT ACGCCGCCCT TCTTTGCCTT TGGTCGATCC TGCTGCATTA ACCGACCCTG CCACTTTTGA ACGGCTTTTG GCCGAAGTCG TCGATCCGAA TCGACCACTG GTGCTACTGG TCAGCGCGCC TAACCGTGAT CGACAGCCCT TGCGCTCACT GCCGTTACCA CTACCGCTCG ATCTGCCTGC CGGTGTGACA CTGCTCGTCC ACGGTTCATT GCCAATCGAG CCTGATGCGC GAATTGTCTT GCCAAAGGCC GATAGTGCCC TTTTCCAAAC CCAAGCGTCG TTGCTCGAAC GCCGTAATTG CCCACCAGGC TGGCGCAAAC CGCTTGTTTT GGCAGCCCGC GGTAATCTGC TCTATCTGAG TTGGGCTGAA CGCTGGCTCC ACCTTGGTTT GCTTGATGTG GCAAATTTGC CACCCGATCT CGATCATCTG TTGCAGCAGT GGTGGCAGTC GCTTAGTCGA ACCGAGCAAC GATTGGCCGT TTTGTTGGCC GCCGCGGGTG AACCGTTGCC GATGACGGTG TTAGCCGAAG TTAGTGCCGA ACACCCACAT CTCATTCTCG ACCGCTGGGA AGAACAGGGT CTCGTCCATA TCAATCTACG CCGACTGAGT GAGGATGATA CGATTTTGCT CGTGCGCTAT GCGCATCGCG CGGTGCGCTT GTTTTTGGCC CGTCACGCTG CGCACGAGAT GAATGCTGCA CACGGAGAGC TGGCCCGTTG GTACGCCGAA CGGCTCAAAC AAAACCCGCT CGATTTGACG AACCGTTATC TAGGACGCCA ATTGGCCCGC CACACAGCGT TATGTCCTCC TGCCCAGCGT CCGGCTCATT TGCCTACGGC CAACCCAACA ACTTGGTTGC GTGAACGAGA ATTGCGTGAA GGAATAGCCG GCGCGCTGCG GGATGCCGGT TGGATGCTGT ACGATGCCGC AGCCGGTTCG CCGTTGGATT TAGCGCGGAT TGCTGCCATC ACCGGTACAC TTGCTACCCG CGCGCGGCAA CTTACCGGCG ATGTTGTGGT TGCTGCCTTT CTCACCGCTG TTCAAACCGG TGGACGTGAA GGGAGTTTGC GCCGGGTAAC GGCGATCGTT GAGCAGTTGC CCGATGGTGT CCCAAAAGCG GCTGTGTTGC GCCAGCTCGG TGAGGCGTGT TATAGCGTTA ATATGCGTAG CGCCGCGATG CGTCTCCTTT CTCGGGCACT TGACCTCGAA GCACAGCCGG TTTCGCGGGC TTGGCGTGAC GTTCGTGATC AAGCTATCGA GGCGCTGGCT ACTGCTTGCT TGATAGCCGG TGATGTTGAT CGGGCTTTAG CCTGTGCGGA GTTGATCGAT CTACTCGAAC GCCGTGCCCA GGTTGAGACG TTGGTGATAC GACGCTTACT TGAAGATGGT CAGTACGACC GGGCATGGCG TTTGTCACGC TCCATTCTGC ACGAAAATCG GGCTGCGTGG GCGCAGGCCG AGGTGGCGGT TGCCTTAGAA CGGATCGGTG ATCCGCGCGG TGCGATGATG TTGGACGAGC TGAAGGTAGA GACTGCACGC GCCTGGGCCG AGATTGAGCT GGCTTGCGAG GTGGCCTTGC GTGATGAAGA GGCTGCGTTG CGCCGAATTA TGGCGTTACC CGGCCAACAT CAGCGTGATC GCGGTTTAGC TCGCCTGGCC CGCGTCTTTG CACACGCCGA AAAGGATGGT GATGCATTGG CAGCGGCTGA GCGGATCAGC AATCGTGAGT TGCGTGTGAC GACGTTGCTC GAGTTGCGCG TGTTGTTGCA AGGGTTGGTT GCTAATCTTG CTACCGAACG AGCAACGCGC GAGATAGATG CCCTCCAAGG CGAGGATCGT CCGATATTGT TGGCTGCTCT TGCTTCGGCC CATGCAGCGA TTGGTCGTAA GGATCGGGCA TTGGCGATAG CCAATCAGTT ACGCGGGGAA GAACTGGAAC GGGCCTTGTC GCGGGTTGCG GTTGCTTGTG TACAGGCAGG TGATTATGCC GGAGCGCAGG CTGTGTTGGC CCAGATGACC GATGACGATG AGCGAGATTG GGCACGCGAT GAGATTGCGC GCACGTTGGC TTCTATCGGT GATTGGGAAT CGGCAATGGC GCAGGCAATG GCGATTGTTG CTGCCGATCA GCGTGCGCGT ACTAGCGCCG ATTTGGCCAT TACTCGCGCG CGTTCCGGTG ATGTACTCAC CGCTGTATCA ATGATTCGTG CGATCGAGGT GCCTGCTGAG CGGGGACGTG CCTTAGTGCT GATCGCACCG TTGTTAGCAA CGACCGATGC CACGCTGGCC GACCAACTGG CCGATGAGCT GCTGATCGGT GAGGTACGTA GCCGGTATCG TGCAGCCCTG GTGGTAGCCC TCGCCGAACG CGGTGAGTTG GCGACTGCGG CTAAGATCGC CCGTCGCATC CGCCGCCGGA ATGAGCGGGT ACGCGCCGAA CTGGCAATTA TTGTGGCCCT TGATCCTACC GATCCCATGA CCTTGGCGCG CTTGGCAACA ACATTGGCAA AGGCCGCGGT GGGGCGTGAA GAGATGTTTC ATGCACTTGA GCTGGTCATC CCTCTCTTGC AACGAATCGG TGGGACCCCG TTGCTGGCCG ATCTGGCGAC GGCGATCGTT GCCGATGATC GGGCGTAG
|
Protein sequence | MVAPALRTQQ ERVHACLRSA IERPALLEAV CTMLREQHGV IALDAPMGSG ATTLLAQLAV RAEWPLWLAD DDDGGGALAF YAQIAALRRP SLPLVDPAAL TDPATFERLL AEVVDPNRPL VLLVSAPNRD RQPLRSLPLP LPLDLPAGVT LLVHGSLPIE PDARIVLPKA DSALFQTQAS LLERRNCPPG WRKPLVLAAR GNLLYLSWAE RWLHLGLLDV ANLPPDLDHL LQQWWQSLSR TEQRLAVLLA AAGEPLPMTV LAEVSAEHPH LILDRWEEQG LVHINLRRLS EDDTILLVRY AHRAVRLFLA RHAAHEMNAA HGELARWYAE RLKQNPLDLT NRYLGRQLAR HTALCPPAQR PAHLPTANPT TWLRERELRE GIAGALRDAG WMLYDAAAGS PLDLARIAAI TGTLATRARQ LTGDVVVAAF LTAVQTGGRE GSLRRVTAIV EQLPDGVPKA AVLRQLGEAC YSVNMRSAAM RLLSRALDLE AQPVSRAWRD VRDQAIEALA TACLIAGDVD RALACAELID LLERRAQVET LVIRRLLEDG QYDRAWRLSR SILHENRAAW AQAEVAVALE RIGDPRGAMM LDELKVETAR AWAEIELACE VALRDEEAAL RRIMALPGQH QRDRGLARLA RVFAHAEKDG DALAAAERIS NRELRVTTLL ELRVLLQGLV ANLATERATR EIDALQGEDR PILLAALASA HAAIGRKDRA LAIANQLRGE ELERALSRVA VACVQAGDYA GAQAVLAQMT DDDERDWARD EIARTLASIG DWESAMAQAM AIVAADQRAR TSADLAITRA RSGDVLTAVS MIRAIEVPAE RGRALVLIAP LLATTDATLA DQLADELLIG EVRSRYRAAL VVALAERGEL ATAAKIARRI RRRNERVRAE LAIIVALDPT DPMTLARLAT TLAKAAVGRE EMFHALELVI PLLQRIGGTP LLADLATAIV ADDRA
|
| |