Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cphamn1_0978 |
Symbol | |
ID | 6374647 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium phaeobacteroides BS1 |
Kingdom | Bacteria |
Replicon accession | NC_010831 |
Strand | + |
Start bp | 1052450 |
End bp | 1054618 |
Gene Length | 2169 bp |
Protein Length | 722 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 642683479 |
Product | ComEC/Rec2-related protein |
Protein accession | YP_001959402 |
Protein GI | 189499932 |
COG category | [R] General function prediction only |
COG ID | [COG0658] Predicted membrane metal-binding protein |
TIGRFAM ID | [TIGR00360] ComEC/Rec2-related protein [TIGR00361] DNA internalization-related competence protein ComEC/Rec2 |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.515935 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.134973 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCTCAGT TTCTGGCTCC CTATCCTGCG GTTCGTCTGC TTGCAATAGC TTCTGCTGGA ATAGCTGCCG GAGTCTGTTG GCGCATCCCG CTCTGGTATT GGGTATCTGC TGCCCTAATC TCTTTCCTGC TGCTTGGAAT CTTTTTGTTT GTCTGCCGGG GGAAGCCTCT CTCCAATGCT GCGGTCACGG CATATTCCCT TTTTGTTTTC TGCGGTTTTA GTCTGTACAC CGGTTCCCTG TACAACAACC TTCCTTCTTC AACCGTGCGT AACTGGCTTG ACAAAGAGGT ACTTCTTTAT GGCAAGGTTG TATCGAGGCC GCGGGTGTAT GACAAAGGGG CAGGATGGAT ACTTCAGACG AGAGAGGTCT TTGCTGATGG TGAGGCTCGT GAGGCGTCAG GAAACGTCAA GGTTTTTCTG CGCATGCGAC ACGGCACGGA ACCGGAAGTC GAGAAAGGGG ACATGATCCG CGTCAAAGGG CGTGTCGGTT TGATTCCTCG TGCGGAAAAT CCAGGTGACT TTGATCCTCG TGAATATTAC CGAAAAAAAA GGGTGCATGC CGAACTGTTC TGTTACGGGC CCTGGTTGAT GCACAACTAC GGGATCGACG AGAGTGATTA TTTTGAGGCT CTGCTCGTAC GGCCTGTCAG GCGGTATCTT TCCGGCACGA TAGCGGCACT TGTTCCTCCC GGACATGAAC AGCAGTTTAT TCAGGGAGTT TTTCTCGGTC AGAAGGAGTT GCTCGACAGG GAGGTGTACC GGACGTTTAA GGCAGCGGGA ACCGCTCATG TGCTTGCCGT TTCAGGGCTG CATGTCGGAC TGATCGTGAT GGTCCTGCTT GTCGTGTTGC AGAGATTGAG AATAACGGTT GCCGGAAAAT GGATCGTGCT TGTTCTGATA GCCTTTGTCC TGCTGGTCTA TTCTTCAGTG ACCGGCAACG CGCCGTCGGT AAGGAGGGCT TCGCTGATGG TTGTCGCTCT GACAGGCAAC TCGGTTCTGT CGAGGCAGTC TTTTCCCCTG AACTCCCTCG CAGTGGCCGA TCTGATACTG TTGTGTATCG ATCCTCTCGA GCTTTTCAAT GCAGGTTTTC TGATGACCAA TGCAGCTGTA GCCGCGATCA TTCTGCTCTA TCCGGTGTTG AGCTCGCCGT CGGAAACATG GAAAGGCCTC GCGGGAGAGG TTTTCAGGCC CGTATGGAAA GCCTTCAGCG TCAGTCTTGC GGCAATTATC GGTGTGTCGC CTGTTATCGC ATGGTTTTTC GGAACCTTTT CTGTCGTCGG GATCATCGCA AACCTGCCGG TTGTGTTTTT GGCAAGCTGC ATGCTTTACG CGATGCTGCC TGCATTTCTT CTGAACTCTG TTCTGCCGGA TCTGGCGCTC TATCCTGCAT CGAGTGCATG GTTTTTTGCC AGGATGACGC TTGGCGTGAC GGAATATTTC GGTGACCTCT CATGGGCGGT AGTGCGGGCG CAGCCCGATA GCGTATGGAT TGCTGTCTAT TATGCGGCTG TTGCCGCAGT ACTTTTCTTT CTTTACCGTA AGAACCCGGC GGGAGTGATG ATTGCCTTTT TGTGCGCGGC AAACTATTTT GTCTGGGCGC CTGTTCTGCA AGCGGAACGT TCGTCGCCTG CTTTTGTCAG GATTGTCGCA GGCAACGATA CCGCTCTGCT TTGTTCGATT GGCGGTTCTT CAATTCTGAT CGATGCGGGA TCCAAACCCT GGCATCGGGA AACAATCAAC CGGCAGATGC GTCGTAACGG GATAAAAAAA ATCGACGCGG CTATACAGTT TGCCTCGCCT GATTCACTTG TAGGGGCAAT CGATGCTCAA AACCACATGC TTTCAGAAGA GCACCATCTC GCACAATCCT CATTCCTGGT CGCAAGATCA GGCGATGATG TACTCAAGTT CTGGGGAGGC GACGGATCAT CCATGCTGGT TGTCAGCGAT CCTGACGCGC TCAGGCGTCT CGACAGTGAG AGGCCGGACA GCGTTGTTGT CAAGGTGAAT CGTTTCGGAA TCGACGACTA TCGCAGACTT GCCGAATGGT TGGACCGTGT GCACCCCCAG GAGTGTGTCG TCTTGTGCTC TCCGCGGCTG AAGGATAGAG GGCGGGGGTT GCTCGCGCAT CTGGCGGGGA AGCGCGGGGA GGTGACTATA GTCGAATGA
|
Protein sequence | MAQFLAPYPA VRLLAIASAG IAAGVCWRIP LWYWVSAALI SFLLLGIFLF VCRGKPLSNA AVTAYSLFVF CGFSLYTGSL YNNLPSSTVR NWLDKEVLLY GKVVSRPRVY DKGAGWILQT REVFADGEAR EASGNVKVFL RMRHGTEPEV EKGDMIRVKG RVGLIPRAEN PGDFDPREYY RKKRVHAELF CYGPWLMHNY GIDESDYFEA LLVRPVRRYL SGTIAALVPP GHEQQFIQGV FLGQKELLDR EVYRTFKAAG TAHVLAVSGL HVGLIVMVLL VVLQRLRITV AGKWIVLVLI AFVLLVYSSV TGNAPSVRRA SLMVVALTGN SVLSRQSFPL NSLAVADLIL LCIDPLELFN AGFLMTNAAV AAIILLYPVL SSPSETWKGL AGEVFRPVWK AFSVSLAAII GVSPVIAWFF GTFSVVGIIA NLPVVFLASC MLYAMLPAFL LNSVLPDLAL YPASSAWFFA RMTLGVTEYF GDLSWAVVRA QPDSVWIAVY YAAVAAVLFF LYRKNPAGVM IAFLCAANYF VWAPVLQAER SSPAFVRIVA GNDTALLCSI GGSSILIDAG SKPWHRETIN RQMRRNGIKK IDAAIQFASP DSLVGAIDAQ NHMLSEEHHL AQSSFLVARS GDDVLKFWGG DGSSMLVVSD PDALRRLDSE RPDSVVVKVN RFGIDDYRRL AEWLDRVHPQ ECVVLCSPRL KDRGRGLLAH LAGKRGEVTI VE
|
| |