Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cphamn1_0706 |
Symbol | |
ID | 6374371 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium phaeobacteroides BS1 |
Kingdom | Bacteria |
Replicon accession | NC_010831 |
Strand | + |
Start bp | 742704 |
End bp | 743873 |
Gene Length | 1170 bp |
Protein Length | 389 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 642683218 |
Product | protein of unknown function DUF1016 |
Protein accession | YP_001959144 |
Protein GI | 189499674 |
COG category | [S] Function unknown |
COG ID | [COG4804] Uncharacterized conserved protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCGATA TGATCCAGCC GGAAGGCAAT TCCCTTTTTG ACCGTGTTGT CTCCATTCTT GAACAGGCAA GGGGAAATGT GCTGCGAGCA GCCAATACCA ACATGGTCCT GGCCTACTGG TTGATTGGCC GGGAAATCGT ACAGGAAATA CAGGGTGGAG AAACGCGGGC CAAATATGGC AAGCAAATCA TAGAGGAGTT GTCCGCTCGT CTGAAAACCC GTTTCGGTAG AGGATTTTCC ACGACCAATC TGCGCTATAT CCGTACCTTT TATACGGTTT ATGCAGATCG TCACCCCGAG ATTCGCCAGA TCCCATCTGG CGAATTGATG TCTGATTCAA AACGCCAGAC CCAATCTGGC GTTTTGGAAG ACATGTCAAT GGCTGTTGAG ACAGGCGCTG CTCTCAGAGG ATTTTCTCCT GTGCTGAGCT GGTCGCACTA TCAGGTGCTG ATGGGGGTAG AAAACGTCAA CGAACGGCTC TTTTATGAGA TCGAAGCTGA AAAAGAGGGC TGGGAGGTCG AGCATCTGAA ACGCCAGATT CATTCGTTTC TCTTTGCCCG CCTGCTGAAA AGCCATGATA AAGCGGCGGT GATGATACTG GCCAGCCAGG GACAGGTGCT GCAAACTGCG GCTGACGCCA TCCGCAATCC CTATATCCTC GATTTTCTGG GACTTCCCGA AGCCGATGTT CTGCATGAAT CGGGGCTTGA GTCGGCGATT ATTCAAAACC TTCAGTCCTT TCTGCTTGAA CTTGGCAAAG GCTTCGCCTT TGTCGGTCGG CAGAAGCGCC TGCAATTTGA CGCAGACTAC TTCTATGTCG ATCTGGTGTT CTACAACTGC ATTTTGAAGT GCTACTTGCT GATTGACCTG AAAATTGGGG AACTCACCCA TCAGGACGTT GGGCAGATGG ACAGCTATGT TCGCATGTTC GACGACAAAT ACCTGACACA AGGGGATAAT CCAACCATTG GCCTGATTCT TTGCGCCAAG AAAAATGAAA CGATTGCCCG GTACTCTGTC TTGAACGAGA ACCGCCAGAT ATTCGCATCA AAATACATGC TTTATTTGCC AACCGAGGAG GAGCTACGAC TGGAAATTGA AAAAGAACGC AGACTGATTG AATCCGCCAT TGAAGATGGT GGCGAGGGAG GTGAGCATGG CGAGAGGTAA
|
Protein sequence | MSDMIQPEGN SLFDRVVSIL EQARGNVLRA ANTNMVLAYW LIGREIVQEI QGGETRAKYG KQIIEELSAR LKTRFGRGFS TTNLRYIRTF YTVYADRHPE IRQIPSGELM SDSKRQTQSG VLEDMSMAVE TGAALRGFSP VLSWSHYQVL MGVENVNERL FYEIEAEKEG WEVEHLKRQI HSFLFARLLK SHDKAAVMIL ASQGQVLQTA ADAIRNPYIL DFLGLPEADV LHESGLESAI IQNLQSFLLE LGKGFAFVGR QKRLQFDADY FYVDLVFYNC ILKCYLLIDL KIGELTHQDV GQMDSYVRMF DDKYLTQGDN PTIGLILCAK KNETIARYSV LNENRQIFAS KYMLYLPTEE ELRLEIEKER RLIESAIEDG GEGGEHGER
|
| |