Gene Cphamn1_1625 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCphamn1_1625 
Symbol 
ID6375305 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium phaeobacteroides BS1 
KingdomBacteria 
Replicon accessionNC_010831 
Strand
Start bp1752411 
End bp1754057 
Gene Length1647 bp 
Protein Length548 aa 
Translation table11 
GC content47% 
IMG OID642684114 
Productprotein of unknown function DUF814 
Protein accessionYP_001960026 
Protein GI189500556 
COG category[K] Transcription 
COG ID[COG1293] Predicted RNA-binding protein homologous to eukaryotic snRNP 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTTCGCA ACTATTTTAC GCTCTACCAC CTCGCCCGCG AACTTCATGA ACTGGTCGCC 
GGGGGTTATA TTTTCGAGAT CTATTCGCAA CAGAAAAGCG AAATCACCAT AAGCCTGATA
ACCAACGAGG GAAACCATCT GCAACTCATC GTAGTCACCG GTCATCCCAG GCTCTGCATC
TATACCAGAG AAGGGCTGCA AAGAAAGCAG CGAAATACTG CCGGTCTGAT GCCGGAGCTC
AATGAAAAAA AAATTTCGTG CATCACCATA GATCCGTGTG ACAGGATTAT CAGAATTGAA
ACTGAAGACA ACTACGCCAT TGTGCTTCAA CTGTTCAGCG CAAAAACAAA TATCTTTCTG
GAGCATAACG GCAATATTGC AGGCAGTTTC AAAAAAGGGA TAGCTCGTTC AGGTTCCGGC
AGTCAGGAAG CCATCCTTCG ACCGGATATT CTGCGTACTC TCGAAAGGAT GGTTCAAAAC
CGCCGTTACT TCATTGAGTC CTTTTCCGGA ACAGATCGAA AGGACACAGA AATACCCGCG
CAACTACTCC CCGGTTTTGA TCGTGGGCTC ATAAAAGAAC TGCTCGGGCG ATGCGGAAAA
AATCGTTCTC CTGAAACTAT ACACGAACAA CTCTCCACGC TCTTTTACGA ACTCATTGAT
CCCTGCCCGT CAGTACTTTT CACAAATGAA AACGGCCCGC TCTTCTCGAT ACTGCAGCAA
AAGAAAAAAG AGTGCGTAGA ATTTGACAGC GTTATTGAAG GATTGAACTT CTATAGCTCG
AAAACAAGAC AACACCGTAA AACCGTTGAG CTTGTTCATC AGATTGAAGG GAAACTGCTT
CAAAAAAGAA AAAAAATCGA CAGTGAACTA CAGCACTTTC AACCCGAATT GCTTCGGCGA
CAGTTTGATG CATATCAACG ATACGGGCAC CTTCTCATGG CGAATCTCTC CCTGGCTGAC
TGCAGGAAAG AGAGTATAAC AGTTCCCGAT ATTTTTGATC CTTCCGCCCT CCCCGTAACC
ATAGCCCTCA AGCCGGAACT CAACCTGCAG GAAAACGCGG CTCTCTGGTT CCGGAAAGCA
TCCAGAACAC GAGAAAAACT TCAAGGCGGC AGTCGAAGAA TAGCCGCTGT TGCAGAAGAG
AAGCAGGCAC TTGAAAAGCT CATTACAGAA CTCGGCAAAC TGGCAAAACC CTCGGAAGTT
ACACGCTTTG AAAAAAACAA CAGCGCTCTC CTGAAAAAAC TCGGATGTGA AAGCAATTCA
GGAAAAACCG GAAAGAGACT ACCGTTCCGC AGTTTTGAGC TTTCTGAAAA AGCGGCTCTG
TACGTCGGCA AAAACGCTGA AAACAACGAA AAGCTTACCT TTACCTTTGC CAGACCTCAT
GACATCTGGC TGCATGTACG AGGAGCAGCC GGTTCGCACT GTATTCTTCG CGGAACGACG
ATACAAAACA TCTCGGCAAT ACGAACTGCG GCCGAAATTG CCGCGTTTTA TTCTTCCTCC
CGTCATGCAG AACTAGTCCC TGTCGTCTAC ACCGAAAAAA AATATGTTCG ACGCGCAAAA
AATATGCCTC CGGGAAAGGT CGTCGTAGAA AAAGAGCAGG TGATACTGGT ACATCCTTCA
CGTTTTTTCG ACGCTGCAGA AAAGTAA
 
Protein sequence
MLRNYFTLYH LARELHELVA GGYIFEIYSQ QKSEITISLI TNEGNHLQLI VVTGHPRLCI 
YTREGLQRKQ RNTAGLMPEL NEKKISCITI DPCDRIIRIE TEDNYAIVLQ LFSAKTNIFL
EHNGNIAGSF KKGIARSGSG SQEAILRPDI LRTLERMVQN RRYFIESFSG TDRKDTEIPA
QLLPGFDRGL IKELLGRCGK NRSPETIHEQ LSTLFYELID PCPSVLFTNE NGPLFSILQQ
KKKECVEFDS VIEGLNFYSS KTRQHRKTVE LVHQIEGKLL QKRKKIDSEL QHFQPELLRR
QFDAYQRYGH LLMANLSLAD CRKESITVPD IFDPSALPVT IALKPELNLQ ENAALWFRKA
SRTREKLQGG SRRIAAVAEE KQALEKLITE LGKLAKPSEV TRFEKNNSAL LKKLGCESNS
GKTGKRLPFR SFELSEKAAL YVGKNAENNE KLTFTFARPH DIWLHVRGAA GSHCILRGTT
IQNISAIRTA AEIAAFYSSS RHAELVPVVY TEKKYVRRAK NMPPGKVVVE KEQVILVHPS
RFFDAAEK