Gene Cphamn1_0978 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCphamn1_0978 
Symbol 
ID6374647 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium phaeobacteroides BS1 
KingdomBacteria 
Replicon accessionNC_010831 
Strand
Start bp1052450 
End bp1054618 
Gene Length2169 bp 
Protein Length722 aa 
Translation table11 
GC content53% 
IMG OID642683479 
ProductComEC/Rec2-related protein 
Protein accessionYP_001959402 
Protein GI189499932 
COG category[R] General function prediction only 
COG ID[COG0658] Predicted membrane metal-binding protein 
TIGRFAM ID[TIGR00360] ComEC/Rec2-related protein
[TIGR00361] DNA internalization-related competence protein ComEC/Rec2 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.515935 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.134973 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTCAGT TTCTGGCTCC CTATCCTGCG GTTCGTCTGC TTGCAATAGC TTCTGCTGGA 
ATAGCTGCCG GAGTCTGTTG GCGCATCCCG CTCTGGTATT GGGTATCTGC TGCCCTAATC
TCTTTCCTGC TGCTTGGAAT CTTTTTGTTT GTCTGCCGGG GGAAGCCTCT CTCCAATGCT
GCGGTCACGG CATATTCCCT TTTTGTTTTC TGCGGTTTTA GTCTGTACAC CGGTTCCCTG
TACAACAACC TTCCTTCTTC AACCGTGCGT AACTGGCTTG ACAAAGAGGT ACTTCTTTAT
GGCAAGGTTG TATCGAGGCC GCGGGTGTAT GACAAAGGGG CAGGATGGAT ACTTCAGACG
AGAGAGGTCT TTGCTGATGG TGAGGCTCGT GAGGCGTCAG GAAACGTCAA GGTTTTTCTG
CGCATGCGAC ACGGCACGGA ACCGGAAGTC GAGAAAGGGG ACATGATCCG CGTCAAAGGG
CGTGTCGGTT TGATTCCTCG TGCGGAAAAT CCAGGTGACT TTGATCCTCG TGAATATTAC
CGAAAAAAAA GGGTGCATGC CGAACTGTTC TGTTACGGGC CCTGGTTGAT GCACAACTAC
GGGATCGACG AGAGTGATTA TTTTGAGGCT CTGCTCGTAC GGCCTGTCAG GCGGTATCTT
TCCGGCACGA TAGCGGCACT TGTTCCTCCC GGACATGAAC AGCAGTTTAT TCAGGGAGTT
TTTCTCGGTC AGAAGGAGTT GCTCGACAGG GAGGTGTACC GGACGTTTAA GGCAGCGGGA
ACCGCTCATG TGCTTGCCGT TTCAGGGCTG CATGTCGGAC TGATCGTGAT GGTCCTGCTT
GTCGTGTTGC AGAGATTGAG AATAACGGTT GCCGGAAAAT GGATCGTGCT TGTTCTGATA
GCCTTTGTCC TGCTGGTCTA TTCTTCAGTG ACCGGCAACG CGCCGTCGGT AAGGAGGGCT
TCGCTGATGG TTGTCGCTCT GACAGGCAAC TCGGTTCTGT CGAGGCAGTC TTTTCCCCTG
AACTCCCTCG CAGTGGCCGA TCTGATACTG TTGTGTATCG ATCCTCTCGA GCTTTTCAAT
GCAGGTTTTC TGATGACCAA TGCAGCTGTA GCCGCGATCA TTCTGCTCTA TCCGGTGTTG
AGCTCGCCGT CGGAAACATG GAAAGGCCTC GCGGGAGAGG TTTTCAGGCC CGTATGGAAA
GCCTTCAGCG TCAGTCTTGC GGCAATTATC GGTGTGTCGC CTGTTATCGC ATGGTTTTTC
GGAACCTTTT CTGTCGTCGG GATCATCGCA AACCTGCCGG TTGTGTTTTT GGCAAGCTGC
ATGCTTTACG CGATGCTGCC TGCATTTCTT CTGAACTCTG TTCTGCCGGA TCTGGCGCTC
TATCCTGCAT CGAGTGCATG GTTTTTTGCC AGGATGACGC TTGGCGTGAC GGAATATTTC
GGTGACCTCT CATGGGCGGT AGTGCGGGCG CAGCCCGATA GCGTATGGAT TGCTGTCTAT
TATGCGGCTG TTGCCGCAGT ACTTTTCTTT CTTTACCGTA AGAACCCGGC GGGAGTGATG
ATTGCCTTTT TGTGCGCGGC AAACTATTTT GTCTGGGCGC CTGTTCTGCA AGCGGAACGT
TCGTCGCCTG CTTTTGTCAG GATTGTCGCA GGCAACGATA CCGCTCTGCT TTGTTCGATT
GGCGGTTCTT CAATTCTGAT CGATGCGGGA TCCAAACCCT GGCATCGGGA AACAATCAAC
CGGCAGATGC GTCGTAACGG GATAAAAAAA ATCGACGCGG CTATACAGTT TGCCTCGCCT
GATTCACTTG TAGGGGCAAT CGATGCTCAA AACCACATGC TTTCAGAAGA GCACCATCTC
GCACAATCCT CATTCCTGGT CGCAAGATCA GGCGATGATG TACTCAAGTT CTGGGGAGGC
GACGGATCAT CCATGCTGGT TGTCAGCGAT CCTGACGCGC TCAGGCGTCT CGACAGTGAG
AGGCCGGACA GCGTTGTTGT CAAGGTGAAT CGTTTCGGAA TCGACGACTA TCGCAGACTT
GCCGAATGGT TGGACCGTGT GCACCCCCAG GAGTGTGTCG TCTTGTGCTC TCCGCGGCTG
AAGGATAGAG GGCGGGGGTT GCTCGCGCAT CTGGCGGGGA AGCGCGGGGA GGTGACTATA
GTCGAATGA
 
Protein sequence
MAQFLAPYPA VRLLAIASAG IAAGVCWRIP LWYWVSAALI SFLLLGIFLF VCRGKPLSNA 
AVTAYSLFVF CGFSLYTGSL YNNLPSSTVR NWLDKEVLLY GKVVSRPRVY DKGAGWILQT
REVFADGEAR EASGNVKVFL RMRHGTEPEV EKGDMIRVKG RVGLIPRAEN PGDFDPREYY
RKKRVHAELF CYGPWLMHNY GIDESDYFEA LLVRPVRRYL SGTIAALVPP GHEQQFIQGV
FLGQKELLDR EVYRTFKAAG TAHVLAVSGL HVGLIVMVLL VVLQRLRITV AGKWIVLVLI
AFVLLVYSSV TGNAPSVRRA SLMVVALTGN SVLSRQSFPL NSLAVADLIL LCIDPLELFN
AGFLMTNAAV AAIILLYPVL SSPSETWKGL AGEVFRPVWK AFSVSLAAII GVSPVIAWFF
GTFSVVGIIA NLPVVFLASC MLYAMLPAFL LNSVLPDLAL YPASSAWFFA RMTLGVTEYF
GDLSWAVVRA QPDSVWIAVY YAAVAAVLFF LYRKNPAGVM IAFLCAANYF VWAPVLQAER
SSPAFVRIVA GNDTALLCSI GGSSILIDAG SKPWHRETIN RQMRRNGIKK IDAAIQFASP
DSLVGAIDAQ NHMLSEEHHL AQSSFLVARS GDDVLKFWGG DGSSMLVVSD PDALRRLDSE
RPDSVVVKVN RFGIDDYRRL AEWLDRVHPQ ECVVLCSPRL KDRGRGLLAH LAGKRGEVTI
VE