Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cphamn1_1861 |
Symbol | |
ID | 6375552 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium phaeobacteroides BS1 |
Kingdom | Bacteria |
Replicon accession | NC_010831 |
Strand | - |
Start bp | 2016457 |
End bp | 2017575 |
Gene Length | 1119 bp |
Protein Length | 372 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 642684357 |
Product | hypothetical protein |
Protein accession | YP_001960259 |
Protein GI | 189500789 |
COG category | [S] Function unknown |
COG ID | [COG2855] Predicted membrane protein |
TIGRFAM ID | [TIGR00698] conserved hypothetical integral membrane protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.00940925 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.0859323 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACGATA CAAGTAAAAG AATCCATCAC TCTTCTGGAA ATCCAGGAAA GCCGGTCGAT ACGGAAATTC CGACTCAGGA GCATATCGCA GAGGCTGAAC GCTGCGGTTT CCAGTGCCGC GCAAAGGCGG CGAAAGTCCG GTTTGACGAG TTGTTTCCCG GGATTATCGC TTCGATAACG GTCGCAGCTG CGGCAACGTT TCTTTCCGAA CACTATGGGG CGCCAACCAT GCTGTTCGCT CTTCTGCTCG GCATGGCGTT TCGCTTTCTG TCTGAAGGCG GTAAGGCTAT TGCCGGTATT CAGTTCGCAT CAACTACCAT TCTGCGCATA GGCGTTGCTT TTCTGGGAAT GCGTATTACA CTGGACCAGA TTCTTTCACT CGGGGCGGGG CCGATGGCCG TGGTCATCGG ATCGGTGTTG CTTACCATTC TCTTCGGTCT CGGTCTTTCA AAATTGATGG GGAGAGGAAA GCGGTTCGGC GTGCTTACCG GAGGCAGCGT CGGGATCTGT GGCGCTTCTG CGGCACTTGC GATATCGGCT ATCCTTCCAA AAGACGAATA CAGTGAACGC AACACCATCT TTACCGTCAT CAGCGTGACC GCCCTGAGTA CCATTGCCAT GATACTCTAT CCGGTTATTG TTCAACAGTT CGGTTTTGAC AATGAGGCGG CAGGCATTTT TCTCGGAGGT ACCATTCATG ATGTCGCGCA GGTGGTCGGC GCGGGTTACT CTGTTTCCGA AAAGACCGGT GACACGGCTA CATTTATCAA ACTTCTGCGT GTCGCCATGC TCGTTCCGGC AGTTTTTACT CTTTCACTGA TCTTTCATAC CCGCAACAAG GAAGAGGGCA ACGACGCGGG CAGGGTCTTT TTTCCGCCGT TCATTATTTT TTTCATTCTC TTTGTCGGCA TAAACAGTTC CGGTTTTACT CCTGAACCTC TCAGGGCTTT TATTGTTGAT ACATCCCGCT GGTGTCTGGT GACGGCTATT TCTGCCCTTG GTATGAAAAC ATCGTTAAAA GCTCTTTTCG ATGTGGGCTG GAAGCCTGTC TCGATACTTG TTGCAGAGAC CGTGTTTCTT GCAGCGCTTG TGCTTGGCGC CATATACTGG ATGGCCTGA
|
Protein sequence | MNDTSKRIHH SSGNPGKPVD TEIPTQEHIA EAERCGFQCR AKAAKVRFDE LFPGIIASIT VAAAATFLSE HYGAPTMLFA LLLGMAFRFL SEGGKAIAGI QFASTTILRI GVAFLGMRIT LDQILSLGAG PMAVVIGSVL LTILFGLGLS KLMGRGKRFG VLTGGSVGIC GASAALAISA ILPKDEYSER NTIFTVISVT ALSTIAMILY PVIVQQFGFD NEAAGIFLGG TIHDVAQVVG AGYSVSEKTG DTATFIKLLR VAMLVPAVFT LSLIFHTRNK EEGNDAGRVF FPPFIIFFIL FVGINSSGFT PEPLRAFIVD TSRWCLVTAI SALGMKTSLK ALFDVGWKPV SILVAETVFL AALVLGAIYW MA
|
| |