Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cpha266_0636 |
Symbol | |
ID | 4569790 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium phaeobacteroides DSM 266 |
Kingdom | Bacteria |
Replicon accession | NC_008639 |
Strand | + |
Start bp | 722346 |
End bp | 725171 |
Gene Length | 2826 bp |
Protein Length | 941 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 639765234 |
Product | hypothetical protein |
Protein accession | YP_911115 |
Protein GI | 119356471 |
COG category | [R] General function prediction only |
COG ID | [COG1483] Predicted ATPase (AAA+ superfamily) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCCTGA AACCCTGGCG TGAAATCATT ATACCTCACT CGGATGTGCT GAATGGCATC TTTCAGCAAT CCGAGTTTGC CGCAGACTTG ACCGCTGTTT GGACGGGAAG GGCAACTCCG GAATATGGTG ATGCCAATGC TTTTTATGAG CGAACCTTTA TCACAGAAGG TATGGGTAAG CTGCTCATCC AGGTATCACA GCGACTCAAT AGCAAGGGTG GTGAGCCGGT CATTCAGCTT CAAACCTCCT TCGGTGGTGG AAAAACGCAT ACGATGCTTG CTGTTTACCA TCTGGCTACT CGCCACTGTG CACTTGGTGA CCTCTCCGGT ATTCCATCAC TGCTCGATCA AGCGGGGGTA ATAGATGTAC CGCTGGCAAA GGTAGCCGTA TTGGATGGTA CTGCCCACTC TCCAGGTCAA CCTTGGATGC ACGGCGATCA GGCGGTGTGC ACGCTTTGGG GGGAACTCGC TTGGCAACTT GGTGGCAGGG AAGGGTTTGC TTTGGTGCAA CAGACTGATG CCAACGGAAC TTCACCGGGA AAGGATGTGC TTTGCACGTT GCTGACCCGG TTTGCACCGT GCGTTGTACT GCTGGATGAA CTGGTTGTCT ATATCCGTAA TTTTGTTGAG AGCCAGCCGC TGAGTGGTGG TACCTACGAT AGTAATCTTT CCTTCATTCA ATCACTGACT GAAGCAGCCA AACTGGTGCC AAATGCTGTG GTTCTTGCTT CACTACCCGA ATCAAATAGC CAAGCTGGTG GACCGAGGGG TGTTGCTGCA TTGCAGGCGC TGGAATCGGT TTTTAATCGT GTGCAGGCCC TATGGAAGAC CGTGGCGCCA GAGGAGGCTT TTGAAATTGT TCGGCGACGT TTGTTTGAAA CAATTCGGGA TATCAATGGC CGTGATGAAG TTTGTCGTGC CTTTGCTGAC GCCTATATCG CTGAGGGTGT CAAGGTACCA CAGGAAACTC AGGAAGCCCG TTATTATGAT CGGATGGTGC AAAGCTATCC TATCCATCCG GAAGTATTCA CGCAGCTCTA TGAAGAGTGG ACAACCATAG AAGGATTTCA GCGTACACGT GGCGTCCTGA AATTGTTGGC CAAGGTTATC TATCGGTTGT GGCAGGATAA TAATAAGGAT TTGATGATTC TTCCTGCCAG TATTCCTCTT TATGACGGGA GCGCGCGTAA TGAATTGATC TACTATCTTG GACCTGGTTG GGATCCGGTG ATTGATCGGG ATATTGATGG TGAGCGGGCT GAGACCACTT TCATTGAGGC TAATGAGACT CGCTTTGGCT CCGTTCAGGC TGCTCGCCGC GTTGCCCGGA CGGTCTTTCT TGGCAGTGCG CCATCCTCCG TTACCATGAA ACCCGGAATA CGAGGTCTTG ATCGAGCGCG AGTACTGCTG GGGTGTCTCC AGCCGGGGCA AACATCCTCT CTTTATTCCG ATGCTCTTAA CCGACTTGCA AACCAACTGC ACTATCTGAA CTCCTCCGGA GACAGGGCTC AGGAGGCGAC CCGCTATTGG TTCGATATAA GGGCAAATTT ACGGCGGGAG ATGGAGGAAC GGAAGATGCG GTTTGACGAT AAAAACGAGG TGCGTGGTCG TATGGCCGAG GTCCTCAAAA AGCTTGTCGG CAGTGCCTCA TTTTTTGATG GTACCCATAT ATTCACTCCC CACAGTGACG TGCCGGATGA TAGTTTTCTT CGACTTGTTG TGCTTTCACC AGAGCAGTAT TATTCACGGG AAGAGTCGCG ATTTGCTTTT GATGGTGTGC TCGATCATGT CCGGAACAAT GGGGCAAAGC CTCGATATCG CGGTAACCGG CTGATTTTTC TTGCTCCTGA TCATGGTGCT CTTGCCCGGC TCCGCGATTC CATTCGTGTT GCTCTGGCCT GGAACTCCAT TGTGGAAGAT GTGGCAGCAA TGCGTCTGGT GCTTGACAAT CTTCAGGCAC AGCAAGCGAA GAAGGAACTG CAGGGAGCGG AAGATGTGCT GCTTCGTGTT GCACGGGAGT GCTACAAGTG GCTGCTCTGC CCTGCGCAGC ATAACCACAC GGATGCCAAA CCAATTGTGG AGGTTTATCC GCTTAACACG GTTGGCGCAT CGCTTGGCTC AGAGATTGAG CGAGTATGTA TTGACACTGA ACTGGTCATC ACTACATGGT CACCGATACA TCTGCGAGAT GAATTGAAAA AGCTCTATTG GAAGCCTGAT AAGTCGTTTT GTAGCGCCAT GGAGTTTTGG GAAGATACGT TGCGCTATCT CTTTTTACCA CGCTTGAAAA CATGTAGCGT TCTGGAACAG GCGATTATCA AGGGTGCGGG AAGCAAGGAT TTCTATGGTA CGGCATACGG ACTCTATGAA GGGGTGTTCG AGGGCTTCAA ATTTGGCGAT GCCAACGTTC AGCTCGATGA TACGTTGCTC CTGATCGAAT CCGGAGCTGC AAAAAAATAT GAAGAGGAGC ATGCGCCAAA ACCGCCTTTG GTACCTTTGG CAGGCATTTT GGGCGAAGGT ACACCCCCGC TAAATAATCC TGCCGGTTCT TCTCAGGTGG GCCCCGGAAC AATACCACCA ACAACAGGCG TAACGAAGTC GAAAACATTT ATCGGCACCG CTGACGTGAG TGCAGCCACT GCCAGAATTC GTCTACTTGA GATTGCTGAA GAGATAATCA GTGTCCTTGC GAGCGATCCG ACGGCGAAAA TACAAGTCAG TGTTGAGATT AGTGCCGATT TCCCTGAGGG AGTATCCGAT CAGATCAAAC GAGCTGTATC TGAAAATGCT TCAAGTCTGG GATTTAAGAA TAAGACTTGG GAGTAG
|
Protein sequence | MSLKPWREII IPHSDVLNGI FQQSEFAADL TAVWTGRATP EYGDANAFYE RTFITEGMGK LLIQVSQRLN SKGGEPVIQL QTSFGGGKTH TMLAVYHLAT RHCALGDLSG IPSLLDQAGV IDVPLAKVAV LDGTAHSPGQ PWMHGDQAVC TLWGELAWQL GGREGFALVQ QTDANGTSPG KDVLCTLLTR FAPCVVLLDE LVVYIRNFVE SQPLSGGTYD SNLSFIQSLT EAAKLVPNAV VLASLPESNS QAGGPRGVAA LQALESVFNR VQALWKTVAP EEAFEIVRRR LFETIRDING RDEVCRAFAD AYIAEGVKVP QETQEARYYD RMVQSYPIHP EVFTQLYEEW TTIEGFQRTR GVLKLLAKVI YRLWQDNNKD LMILPASIPL YDGSARNELI YYLGPGWDPV IDRDIDGERA ETTFIEANET RFGSVQAARR VARTVFLGSA PSSVTMKPGI RGLDRARVLL GCLQPGQTSS LYSDALNRLA NQLHYLNSSG DRAQEATRYW FDIRANLRRE MEERKMRFDD KNEVRGRMAE VLKKLVGSAS FFDGTHIFTP HSDVPDDSFL RLVVLSPEQY YSREESRFAF DGVLDHVRNN GAKPRYRGNR LIFLAPDHGA LARLRDSIRV ALAWNSIVED VAAMRLVLDN LQAQQAKKEL QGAEDVLLRV ARECYKWLLC PAQHNHTDAK PIVEVYPLNT VGASLGSEIE RVCIDTELVI TTWSPIHLRD ELKKLYWKPD KSFCSAMEFW EDTLRYLFLP RLKTCSVLEQ AIIKGAGSKD FYGTAYGLYE GVFEGFKFGD ANVQLDDTLL LIESGAAKKY EEEHAPKPPL VPLAGILGEG TPPLNNPAGS SQVGPGTIPP TTGVTKSKTF IGTADVSAAT ARIRLLEIAE EIISVLASDP TAKIQVSVEI SADFPEGVSD QIKRAVSENA SSLGFKNKTW E
|
| |