Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cpha266_0902 |
Symbol | |
ID | 4570516 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium phaeobacteroides DSM 266 |
Kingdom | Bacteria |
Replicon accession | NC_008639 |
Strand | + |
Start bp | 1031117 |
End bp | 1032877 |
Gene Length | 1761 bp |
Protein Length | 586 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 639765497 |
Product | hypothetical protein |
Protein accession | YP_911374 |
Protein GI | 119356730 |
COG category | [S] Function unknown |
COG ID | [COG1262] Uncharacterized conserved protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.530608 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACTGTTG AAGAAATTAT CAGGATGCTC GATAATCCGT CAACGCTGGG CGATGCTTTG ATTGAAGCGG CGAAGTTGCC ATTCAAAGGT GACGACAGGA TACGGTTCCA GCAAAAACGA CAGCGATTCA TTGATGGATT GAGTGATCAT GAACGGCCTC AATGGGTCGG GGAGATGAAA ACTTTTTTGT CATTGTATGA ATCGACCGGT AGCGTTGATA GCGCAGTCAC ACCACTCAGT GCGTATCGCA ATGATTTCAG TACCGATATA TTTATAAGCT ACGCCCGCGA AGACATGAAA CGGGTTGAGC CAATTGTCAG GGAGCTGGAA AAACACGGCT GGAGTGTTTT CTGGGATCCT GAGATTCCAC CGGGGGAGAC CTGGCGGGGT TATATCAAGA AAAAGCTGGA TGAATCACGT TGCGTGCTCG TTGCCTGGTC TCGTCTTTCA GTCACTTCCG AATGGGTTAT CGCTGAAGCT GATGAAGCAA AAAAACGAGG CATTCTGGTT CCTGTGCTAC TCGATGCTGT TGAGCCTCCA TTCGGGTTCA GCCATATTCA TGCGGCAAAT CTTTCCTGCT GGAAAAATGA TAGTAATAGT CGAGCATTCA AGGAACTCGT GAATGCGGTC ACCCTGAAAA TTTCTTCGTC ATCACCATCT TTCATGAACT CAATATTGGG TGAGACGTCA GTTCCTATCC CTAAGACAGC ACCAGTTATT GTACCATCAA CCTCAATGCT GGAAGCACTT CGTGTAATGG TTCAAAATTG GATTTCAGTA GTTGCGAGTA TCAGAACATG GCAACCAAAA AGCCGGCAAA CAGCAATTGC TATTGTGCTG GTCATGGTGT TTATTTTTGC GGTTTTATCG TATCGTTATT TGCGTGTAAC GGGTCCTTCG TCACCATCTG TTCCGGAAAC TGTTCAGCCT GTGAACGCAT CGCCAGCAAG GCCAGGAAAC TTTGTTCTGA TACGCGGAGG TGAGTTTACA ATGGGGAGCC CGGCGAATGA ATCTGGTCAC GAGAGTGACG AGACTCAGCA TCAGGTCAAA GTGAGTGATT TTTACCTGTG CAAATATGCG GTTACCCTTG CCGAGTTTAA AAAATTCATT GAAGATTCAG GCTATCAGAC TGATGCTGAA AAAGATGGCG GCAGTTATAG TTGGGATGGA ATAAGTTGGG TGAAGAATGC TGGAGTTGAC TGGCGATATG GGGTTTCAGG CAGTGTACGA CCTCAAAGTG AAGAGAACCA TCCTGTGTTA CATGTGAGCT GGAATGACGC TGTGGCCTAT TGCAAGTGGA TATCGAAAAA AACAGGAGAT GCATTTCGTT TGCCAACGGA AGCTGAGTGG GAATATGCGT GTCGAGCAGG AACAACCACA CCGTTCCATA CCGGCGATAA CCTGACAACC GGTCAGGCGA ACTATAACGG AAACTATCCG TATACCAACA ATCAGAAAGG AGTGTATCGG GAGAACACGG TTAAGGTTGA TGAGTTTGCT CCGAACGCGT GGGGGTTATA CCATATGCAT GGCAATGTGT GGGAGTGGTG TGGCGACAGG TATGGGGATA AATATTATGA TGAATGCAAA GCCGAAGGTG TTGTTGAAAA TCCGGTTGGC CCGGAAACCG GTTCGCTCCG TGTGCTTCGT GGAGGTGGCT GGAGCTTCAA TGCGAGGAGC TGTCGGTCGG CTTTTCGCAT CGACGTCGCC CCCGACTACC GCAGCAACTA CGCCGGCTTC CGCCTGGCCT TCGTCCCGTA G
|
Protein sequence | MTVEEIIRML DNPSTLGDAL IEAAKLPFKG DDRIRFQQKR QRFIDGLSDH ERPQWVGEMK TFLSLYESTG SVDSAVTPLS AYRNDFSTDI FISYAREDMK RVEPIVRELE KHGWSVFWDP EIPPGETWRG YIKKKLDESR CVLVAWSRLS VTSEWVIAEA DEAKKRGILV PVLLDAVEPP FGFSHIHAAN LSCWKNDSNS RAFKELVNAV TLKISSSSPS FMNSILGETS VPIPKTAPVI VPSTSMLEAL RVMVQNWISV VASIRTWQPK SRQTAIAIVL VMVFIFAVLS YRYLRVTGPS SPSVPETVQP VNASPARPGN FVLIRGGEFT MGSPANESGH ESDETQHQVK VSDFYLCKYA VTLAEFKKFI EDSGYQTDAE KDGGSYSWDG ISWVKNAGVD WRYGVSGSVR PQSEENHPVL HVSWNDAVAY CKWISKKTGD AFRLPTEAEW EYACRAGTTT PFHTGDNLTT GQANYNGNYP YTNNQKGVYR ENTVKVDEFA PNAWGLYHMH GNVWEWCGDR YGDKYYDECK AEGVVENPVG PETGSLRVLR GGGWSFNARS CRSAFRIDVA PDYRSNYAGF RLAFVP
|
| |