Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cpha266_0460 |
Symbol | |
ID | 4569385 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium phaeobacteroides DSM 266 |
Kingdom | Bacteria |
Replicon accession | NC_008639 |
Strand | + |
Start bp | 508115 |
End bp | 511012 |
Gene Length | 2898 bp |
Protein Length | 965 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 639765060 |
Product | hypothetical protein |
Protein accession | YP_910942 |
Protein GI | 119356298 |
COG category | [S] Function unknown |
COG ID | [COG1262] Uncharacterized conserved protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.0102271 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATGTATG CACGTTTTTT CATGACTTTC CTTGTTGCGC TCTGCATGGC TGCGAATACG GCAGGCGTGG CGTATGGCGA ACAAGGGGAG AGGAAGGTCG GCGGGCAGGT GGCTGAAAAC GCCGGTGCTC CTGTTGTACA GAACGTATTG CGGGAGCGAG CCGTAACAAG AGAGTCGGGT TTGCCCGAAT CACATGAAGA TACTGCAGAG AAAAATTTTC TGGATTATGT GCCCGCCGCA GCTTTGGTTG CATTGCTGGT TGGTGTGGGG ATTCCGATTT ACAGGAGATG GGAAAAAAAA CATAAAGACC GCTTACAGGA AAAACGATAC CGCTCATCAC TGCACGAAGA GCTTTGCTGG ATTCGGATGC CTGGACTTCC GGGTGGTATT GAGAGTATTC AGGTCAATCT GGATGATGAC ACCTTTGTTC AGCTTCGTTT TTCCGAGAGT TCGTCCGAAG GAGAGCCATC AATGTCTGAT GATCTTTTGA AGCAGCGGAA TGAGTTGAGT GCCGAGCGTT TGCCTCATGA GATTATCCAG CGGGCGTTCA CCAGTAATCG TCGGTTGCTC CTTGTTCTTG GCGATCCCGG CGCCGGCAAA ACTACACTGC TCCAGTACTA TGCCTTGTGT GCGCTTGATC AGAAGCGATA CAGAAAGCTT GGCTTTACAA AACCGCCAAA AGTATTCTAT CTGCCATTGC GCGAACTGAC CAATCATGGC TCAACACTGC CTGAAAACCT TTCGCAACAC GCTAAAAAGT GGTACCAGCA GGATATTGAC AGCAGAATTA TTGATAAATG GTTGCAGAAT GAAACGGTAA CATCACTTGT TCTCCTTGAT GGCCTTGATG AAATCAGCGA TACCGTAAAA AGGCAGGAGG CCTGCAAGTG GATCGACAGG ATTTTGACAG GCTTCCAGCA TGTTCACATT GTTGTGACCT CACGAAGGAC GGGGTATGGG AGGGAGGATG CTCTGAAGCC GCAGGTTGTG CTGACCTCTG ATCACAAACG CGCCAGTGTG CAGGATTTTT CACGGGAACA GCAGGAGCTG TTTCTGAAAA ACTGGTTCAA TGCCGCTTTT CTGCGTGAAG AGAGACCGGA TAACGTTGAT GAGAAAATCT GGAAAGAGCA GCAGCTTGCA AAAGCGAAAA ATCGAAGCGA CAGGATTATT GCCGTGCTGA AAGAGGAGAA ACATAAGGCG TTACGGCAGC TTGCCGCTGT ACCGATGATT CTTCAGTTAA TGGCACTGCT CTGGAAAAAC AATGAGTTTC TGCCGGGCAG TCGTATGGAA CTTTACCATT CCGTCCTCAA TTATATGCTC GAACTCAGGG ATAAACAGAA GGAGATCGAC TCACAGCTTT CTGCCGAACG TGCTCGCAAG GTGCTTGCCC CGGTAGCACT CTGGATGCAG GAGGAGCTGA AACAGGATGA GGCCGACAAG CAGCTCATGC AGAAGCAGAT GCAGCAATGG CTTGATACGC TTGTCGACAG ACAGTACGTT CCGCCAACTG CCGATGATTT TTGCGATCAT CTGGTGCTTC GTGCCGGTTT GCTTGCTAAG ATAGGTGAGA GCAGATATGT TTTTCGCCAC AAATCCTTCA GGGAATATCT TGCCAGTATC GAGCTTGTCA AGAAAACATT GCGTAGTACC TGCTATATCG ACACGTTGAT TTCAAGCTTT GGCGATCCCT GGTGGAATGA GCCGATGAGG TTCTTTATAG CTCAAACCGA TGCCGTGCAA TTCGATTGTT TTATGGATAA GCTTGTTGCC TCTGTTGGTG AAGAGGAGTT TTCCCCGGAC AAGATGCCGT TGCTCTACAC CCTTATCGAA GAGGCGCCTC AGAAAAAGGT TCAGGCTCTC AGTGAAAGGT TGCTCGCTGA GGCAACGACA GCAAGCTGTC AGAGGGTGAT ACTCGATTGC CTGAACGCCA TCGGTCAACC GATTGCTCTT GAGACGCTGC AGAGTTTTGT GAACCTGAAA CGTGCAAAAA ACAGTTCCGT CGCCTCCCGT GCTGAAGAGG TGATGCTTGC TCTTCTCCCT GCTGCGGGTG TAACGAGCAG TTCTGGGTCT GCTGCGGATG CACATTCAGA AAATTCAGGC TCATATCGGA ACCGGTTTGA ACAATATGCC GAGTATATAA GGATACCTGG AGGCAGTTTT CGGTACTCGG TAACAGGGGG TATTGAAAAC GTATCGGAGC TCCGCGTTGC CAGATATCCC GTAACCAACA AGCGGTATCG AGCCTTTATT TCATATCTGC AATCAAGAAA ACCGGAAGAT GAAGCCCTTA ATAAGTTTTA CCGGAAGGTA CTGGGCGAAA TTGCTCGTGA TCATACATGG GACAGCGGGT TCAGTGACTA TTACCATAAA GGCAGTAAAG ATCTTGCCGG ATTATTTCGT TCCAAACTGG ATGAAGACCG GAAATTCGGC GGTGACGATC ATCCGGTTGT CGGGGTTACC TGGTATGCGG CGAAGGGCTA CTGCCTGTGG CTCTCACTGC TCGAAAGTGA AGGACGTGAG AGGCAGCTTT ACCGGCTGCC CACAGACCTG GAGTGGGAGT TTGCTGCTGC AGGAGAAGAA GGGCGCAATT ATCCATGGGG AAATGAGGAG CCCACACCGC AGCGGGCCAA TTTCAATGAA AATATCGGCG CGACGACACC GGTTGAAAAT TATCCCGAAG GAGCGACTCC CGAGGGGCTG TACGATATGG CCGGCAATGT GTGGGAGTGG ACGAATAGCT GGTATGATGA AGTAAAAAAG GAGTCCTTTT CGTTGCGCGG GGGTTCGTGG CTCAACTTAT CTGCCAATCT GTCTTGCTCT GCCCGGATCT TCGTCAATCC GGTCAGCAGG AACTACGGTT TCGGTTTTCG TGTTGTTCGT CCCAGTCATC TTTTGTGA
|
Protein sequence | MMYARFFMTF LVALCMAANT AGVAYGEQGE RKVGGQVAEN AGAPVVQNVL RERAVTRESG LPESHEDTAE KNFLDYVPAA ALVALLVGVG IPIYRRWEKK HKDRLQEKRY RSSLHEELCW IRMPGLPGGI ESIQVNLDDD TFVQLRFSES SSEGEPSMSD DLLKQRNELS AERLPHEIIQ RAFTSNRRLL LVLGDPGAGK TTLLQYYALC ALDQKRYRKL GFTKPPKVFY LPLRELTNHG STLPENLSQH AKKWYQQDID SRIIDKWLQN ETVTSLVLLD GLDEISDTVK RQEACKWIDR ILTGFQHVHI VVTSRRTGYG REDALKPQVV LTSDHKRASV QDFSREQQEL FLKNWFNAAF LREERPDNVD EKIWKEQQLA KAKNRSDRII AVLKEEKHKA LRQLAAVPMI LQLMALLWKN NEFLPGSRME LYHSVLNYML ELRDKQKEID SQLSAERARK VLAPVALWMQ EELKQDEADK QLMQKQMQQW LDTLVDRQYV PPTADDFCDH LVLRAGLLAK IGESRYVFRH KSFREYLASI ELVKKTLRST CYIDTLISSF GDPWWNEPMR FFIAQTDAVQ FDCFMDKLVA SVGEEEFSPD KMPLLYTLIE EAPQKKVQAL SERLLAEATT ASCQRVILDC LNAIGQPIAL ETLQSFVNLK RAKNSSVASR AEEVMLALLP AAGVTSSSGS AADAHSENSG SYRNRFEQYA EYIRIPGGSF RYSVTGGIEN VSELRVARYP VTNKRYRAFI SYLQSRKPED EALNKFYRKV LGEIARDHTW DSGFSDYYHK GSKDLAGLFR SKLDEDRKFG GDDHPVVGVT WYAAKGYCLW LSLLESEGRE RQLYRLPTDL EWEFAAAGEE GRNYPWGNEE PTPQRANFNE NIGATTPVEN YPEGATPEGL YDMAGNVWEW TNSWYDEVKK ESFSLRGGSW LNLSANLSCS ARIFVNPVSR NYGFGFRVVR PSHLL
|
| |