Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cphamn1_0347 |
Symbol | |
ID | 6374008 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium phaeobacteroides BS1 |
Kingdom | Bacteria |
Replicon accession | NC_010831 |
Strand | + |
Start bp | 359303 |
End bp | 360565 |
Gene Length | 1263 bp |
Protein Length | 420 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 642682866 |
Product | hypothetical protein |
Protein accession | YP_001958796 |
Protein GI | 189499326 |
COG category | [R] General function prediction only |
COG ID | [COG4277] Predicted DNA-binding protein with the Helix-hairpin-helix motif |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGATGTAC TGGATAAATT AGAGATACTT TCCGGTTCGG CCCGGTACGA TGCTTCCTGT GCTTCGAGCG GCAGCAGCAG AGGCGGCGGA GGTGAGGGTA CGCTCGGCAG CACGTCAAAA GGAGGGATCT GTCATTCATG GTCTGATGAC GGTCGCTGTA TCTCCCTGCT TAAAGTTCTT TATTCCAATG ACTGCAGCTA TGATTGCGTC TATTGCGTCA ACCGCAGATC AAATCCTCAT CCCCGAACGT CGTTTACCGT GCATGAACTG GTTGAACTGA CGATCAGATT CTATCGCAGA AACTATATCG AGGGCCTTTT TCTGAGCTCC GCTGTCATGC AAAGCCCTGA ATCTACAATG GAGAGCATGG TCGAGGTGAT CCGGAAACTT CGGGTAGAGG AGTCCTTTGC CGGTTACATA CACATGAAAG TCATTCCGGG ATGCTCTGAG GATCTGGTTC GAAAGGCAGG ATTCTATGCC GATCGTCTCA GTGTCAATAT CGAGCTCCCT TCCGGTGATT CATTGAAGCT GCTCGCGCCG CAGAAACAGA GAGAGGATAT TCTCAAGCCG ATGGCTTGCC TCGGCGACGC GATCATCGCA AGCCGTAAAG AGAGAAAGAA GAACCGCAAG GCACCCGCTT TTTCTCCCGC AGGCCAGAGC ACGCAGATGA TCATCGGTGC TTCTCCCGAG TCGGATTTCA GGATTCTGAG CCTTTCGCAG GGGCTCTATA AACAAATGCA TCTCAAACGG GTCTACTATT CCGCGTTTGT TCCTGTGAAC AGTGACAACC TTCTGCCGGT TCATGCAAAA CCGCCTTTGC AGCGCGAGCA TCGTCTCTAT CAGGCCGACT GGCTGCTCCG CAACTACGGC TTCACGGCTG ACGAGATCCT TTCGGAAGAG TCTCCTTTCC TTGATGAGCA TCTTGATCCT AAAGCGTCAT GGGCGTTACG CAATCCCGGG TTTTTTCCGG TCGATGTCAA CAGGGATGGT TACTTCGCGT TGCTGAGGGT TCCGGGAATC GGTGTGACCT CCGCAAAGCG GATTGTTGCA GCGCGAAGGT TTGCGGTGAT AACTCCGGAG GGGCTGAAGA ATATCGGGGT CGTCATGAAG CGGGCAAGAT ACTTTATCAC CTGTTCCGGC AGGCCGGTGG AGCGTTTGTT CGACAGACCG GCACTTGTTC GCCGGAAACT GCTCATCGCT GAAACCGGAA AAGACCCCCG CGCACTGAAG CAGCGGCAGC TTGATTTTTT TAGTAACAAG TAA
|
Protein sequence | MDVLDKLEIL SGSARYDASC ASSGSSRGGG GEGTLGSTSK GGICHSWSDD GRCISLLKVL YSNDCSYDCV YCVNRRSNPH PRTSFTVHEL VELTIRFYRR NYIEGLFLSS AVMQSPESTM ESMVEVIRKL RVEESFAGYI HMKVIPGCSE DLVRKAGFYA DRLSVNIELP SGDSLKLLAP QKQREDILKP MACLGDAIIA SRKERKKNRK APAFSPAGQS TQMIIGASPE SDFRILSLSQ GLYKQMHLKR VYYSAFVPVN SDNLLPVHAK PPLQREHRLY QADWLLRNYG FTADEILSEE SPFLDEHLDP KASWALRNPG FFPVDVNRDG YFALLRVPGI GVTSAKRIVA ARRFAVITPE GLKNIGVVMK RARYFITCSG RPVERLFDRP ALVRRKLLIA ETGKDPRALK QRQLDFFSNK
|
| |