Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cagg_3357 |
Symbol | |
ID | 7267097 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chloroflexus aggregans DSM 9485 |
Kingdom | Bacteria |
Replicon accession | NC_011831 |
Strand | - |
Start bp | 4068950 |
End bp | 4070950 |
Gene Length | 2001 bp |
Protein Length | 666 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 643568166 |
Product | hypothetical protein |
Protein accession | YP_002464637 |
Protein GI | 219850204 |
COG category | [S] Function unknown |
COG ID | [COG4412] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.00235574 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 4 |
Fosmid unclonability p-value | 0.000266176 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAGACGAT TCCTGCTTAT TATTGCTCTC ACCCTGATCA CCGGCTGTAT GCCATCGGTC GAGAGACCAC CGGAAGACTA CCGACCTACG GCGACACCAC GATCCGCTAC CGGTACCGCC CACGCTCCGC CACCACCACC GACTCCGGTA GCGATGGTTA CTCCCGCACC GAACAACGAC GTGAATGAAA TCGCGCTGAT CGATGCGGCT GCTCGTCTGC CGCGCGATCA AGTTGAGCTA GCCCGCCAAC TCGGCGCGTG TCGCCCGGCA CCAGAGGAAT GCCTGTATGT AGCGCGTACT ATCCCCCCTG ACGTGCAGCT TGGCGAACGC CGATCTTTTT CTGTAACCGA TTTTAGTAAC GATAGCCAAT ACGAAATCAC TGCTGATCTA CGCTACATTG GGCCGGTGGT GTTGATGTAT GTCGAAACCG GCGTACCTTT TGATCAGGGC GCATTGGAAC GTGCTGCGCG TACCTTTGAA CAAGAGATTT ACCCGCGTAC CCGCGAGATC TTCGGCAGTG AGGCACAACC CGGGGTTGAC GGTGACAACC GGATTACCAT TCTGAACGCG GTAGAGCGCA GTCGTCAGAT CCTCGGTTAT TATTCGTCGA GCGACTCATT ACCGAAACAG GTCAACCGCT ATAGCAATGA GCGTGAGATG TTCTTTATGA ACATCGAGCT GATGCCTTTC GATAGCGATA CCTACCTCGA CGTGCTGGCC CACGAATTTC AACACATGAT CCATCAGCAC GAACAGCCGG GCAGTGCCTT GTGGCTCAAC GAGGGAATGT CACAATTGGC CGAAGACCTC AACGGCTTTC AGAGCGAAGG CTTCATTCCG CTCTATCTGC GCAATACCGA CATTCAATTG ACCGGGTGGG GCTTTGCGCC CGGCCAGTCA GGCGTGCATT ACGGCGCTGC TCACCTCTTT ATGCGCTACA TCTATGCCCA ATACGCCGGC AAAGACCAAT TACGCTCGTT GATTCGGGCC AACGCCGGTA ACAATCTCGA AGCGTTCGTC GAGTTAGCCG CCCGTGTTCG ACCCGATATT ACGCACTTTC GGCAGATCAT GGCCGATTGG GCCGTCGCCA ACTTACTTAA CGACCCACGG GTAGGCGATG GTCGTTACAC CTACGATACC GGTACCGAAT TGAGGAATCT CCTGCCGCAC ACAGTACGCC CAACGCCGGT CGAGCGGCGG CATCAAGACG ATATTGTGCA GTTCGGCGTT GATTACCTTG CATTACCGGC GAACGCGCGC TCGATAACCT TCCGTGGCGA CACTACCGTG CGCATTGCCG GACAGATGCC ACAGGGACGC TACGCCTGGT GGAGCAACCG TAGTGATGAT AGCATTGCAA CGCTCACGCG CAAGATTGAT CTACGTGGGG TCAGTTCAGC GACACTCACC TTCGACACGT GGTTCGAGAT CGAAGACGAT TATGACTACG CTTTCGTCAC TGTTTCGACT GACGGCGGGC GGACGTGGGA GACCCTACCC GGCAAGTGGA CGACCGACTA TGATCCACAA GGCGTGAATT ATGGTCACGG TCTGACCGGT GTTTCGGGGA GACCGGAGGC CGACGTTGAA GACGGCTTGC GCGGGCGTTG GGTCAACGAG CGGATGGATT TAACCCAGTT TGTCGGCCAA GAAGTGTTGT TGCGATTTTG GTCGATTAAT GATCAGGGAG TACATGCACC TGGTATCTTG ATTGATAACA TTACCATCCC CGAAATCGGT TTTCGTGATA CCGTCGAAGA GGGGGAGAAT GGCTGGGAAG CAGCAGGATT TGTGCGGGTC GATGGCGATC TGCCGCAGCA ATGGGATTTG CACCTGGTGC GCACGGCGGC CAATGGACAG ATCACCGTTG AGGCATTGCC GGTTGATGAA GACGGTATCG CAACGGCAAC GCTGAATGAC GGTGAGCGGG GGGTATTGGT GGTGATTGCC GTCACGCCGC ACACGAGTGA ACGGGTGCAA TACGAGGTTA TCAGCGAATA G
|
Protein sequence | MRRFLLIIAL TLITGCMPSV ERPPEDYRPT ATPRSATGTA HAPPPPPTPV AMVTPAPNND VNEIALIDAA ARLPRDQVEL ARQLGACRPA PEECLYVART IPPDVQLGER RSFSVTDFSN DSQYEITADL RYIGPVVLMY VETGVPFDQG ALERAARTFE QEIYPRTREI FGSEAQPGVD GDNRITILNA VERSRQILGY YSSSDSLPKQ VNRYSNEREM FFMNIELMPF DSDTYLDVLA HEFQHMIHQH EQPGSALWLN EGMSQLAEDL NGFQSEGFIP LYLRNTDIQL TGWGFAPGQS GVHYGAAHLF MRYIYAQYAG KDQLRSLIRA NAGNNLEAFV ELAARVRPDI THFRQIMADW AVANLLNDPR VGDGRYTYDT GTELRNLLPH TVRPTPVERR HQDDIVQFGV DYLALPANAR SITFRGDTTV RIAGQMPQGR YAWWSNRSDD SIATLTRKID LRGVSSATLT FDTWFEIEDD YDYAFVTVST DGGRTWETLP GKWTTDYDPQ GVNYGHGLTG VSGRPEADVE DGLRGRWVNE RMDLTQFVGQ EVLLRFWSIN DQGVHAPGIL IDNITIPEIG FRDTVEEGEN GWEAAGFVRV DGDLPQQWDL HLVRTAANGQ ITVEALPVDE DGIATATLND GERGVLVVIA VTPHTSERVQ YEVISE
|
| |