Gene Cphamn1_0735 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCphamn1_0735 
Symbol 
ID6374400 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium phaeobacteroides BS1 
KingdomBacteria 
Replicon accessionNC_010831 
Strand
Start bp775055 
End bp778423 
Gene Length3369 bp 
Protein Length1122 aa 
Translation table11 
GC content58% 
IMG OID642683244 
Producthypothetical protein 
Protein accessionYP_001959170 
Protein GI189499700 
COG category[S] Function unknown 
COG ID[COG4913] Uncharacterized protein conserved in bacteria 
TIGRFAM ID[TIGR01633] putative phage tail component, N-terminal domain 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00640704 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGACTGAAG CCCTTGAATT CGGATTCATC TCAGACGACC GCCTCGCCGG CTTCCGCCTC 
CAGCGCCTCG AAGTGTTCAA CTGGGGCACC TTCGACGGAC GAGTCTGGAC ACTCCGGCTC
GACGGGCGGA ACAGTCTCCT CACAGGCGAT ATCGGTTCAG GCAAATCAAC ACTTGTCGAT
GCTGTAACCA CGTTACTGGT ACCAGGCCAG CGTATCGCCT ACAACAAGGC AGCCGGAGCC
GAAACGCGGG AACGCTCGCT TCGCTCCTAC GTGCTCGGTT TCTACAAATC CGAGCGGCAG
GAGTCCCTCG GAGGCGGCAC CAAACCGGTT GCCCTGCGCG ACAGCAACGC CTACTCTGTG
GTGCTCGGCG TCTTCCACAA TGAAGGCTAC GACAAAACCG TAACCCTCGC CCAGCTCTTC
TGGATGAAGG ATCCGACAGG GCAACCTGCC CGCCTTTATG CTGCCGCAGA ACGCGACCTC
TCCATCGCCA ACGACTTCTC CGGCTTCGGC ACTGAGATCA CCCCCCTGCG CAAGCACCTG
CGAGCAAAAG GCGTCGAGGT GTTCGACAGC TTTCCGAAAT ACGGCGCCTG GTTCCGCCGT
CGATTGAGTA TCAACAACGA ACAGGCGCTG GAGCTGTTTC ACCAGACAGT TTCTCTCAAA
TCGGTGGGCA ACCTGACCGA GTTCGTGCGC AGCCACATGC TCGAACCTTT CGACGTCCAG
CCGCGCATCG CAGCCCTGAT CCGCCACTTC GAAGACCTCA ACCGTGCACA TGAAGCTGTA
CTCAAGGCGA AACGGCAGAT CGAAAAGCTT ACCCCGCTCG TAGAGGATTG CGACCGCCAC
CGGGAGATCA ACGCCTCCAC CGAAGAGTTG CGTGGTTGCC GCGAAGCACT GCGCCCCTGG
TTCGCATCTC TCAAGCTGCA ACTGCTCGAA CACCGCCTCG CAAGCCTTAA AGAGGAGATG
GCCCGCCATG AGAGCGCGGT CGAACGACTT GAACAGCAAC GACGCGAAGG GCAAATGCGC
GAACGAAACC TCCGTCGCAC CATTGCCGAA AACGGTGGCG ACCGGATCGA AAGCATTGCC
TCCGAAATCA ACACAAAGCA GGAAGAACTC GACCGGAAAA AGCAAAAATC CTCCCGCTAC
GGCGATCTGG TACTCCAACT CGGTGAGGCA CAGGCAACCA ACGTCGAAGA GTTCTTCCGC
CAGCGTGCCG GACACGAAAC CATGCGTGAA GAGATTGAAG AGCGAGAGGT GAGCGTCCAG
AACGACCTCA ACGAAACCGG CGTCTCTGTC GCCAGTATGC AACAGGAGTA CCGGGAACTG
ACAGCCGAAA TCAAGAGTCT CAAGGCCCGT CAAAACAACA TCGATGACAG ACAGATCGCC
ATGCGTCGCG AACTTTGTCA GGCGCTCGAA ATAGACGAGG AAATGATGCC CTTCGCCGGC
GAACTGCTCC AGGTGCGAGA AGAGGAAACT GCTTGGGAAG GGGTCATCGA ACGGCTGCTG
CGCAACTTCT CCCTCTCGCT TCTCGTGCCT GAGCATCACT ACGCACAGGT GGCTGAGTGG
GTTGACCGCA CCCACCTCAA GGGACGCCTC GTCTACTTCC GGGTGCGCCC CCCGGCACGG
CGCGAAACCA TGCCCGACAA TCCGGGCTCA CTGCCCCGCA AGCTCATGAT CAAGACCGAC
TCTCCTCACT TCCACTGGCT CGAACGTGAG ATCGGCCACC GCTTCGACGC AGTCTGCTGC
GACAGCCAGG AGGAATTCCG ACGAGAGAAA AAAGCGATCA CGATGGCCGG ACAGGTCAAA
ATGCCCGGCG AGCGTCACGA AAAAGACGAC CGCCACCGCC TCGACGATCG TAGCCGCTAC
GTACTCGGCT GGAGCAACGC AGCCAAGATA GCCGTGCTCG AAAAAAACGC CCGACAGAAA
CGTGACGAAC TCGGCGAACT CAACCACCGC ATGAAAGTCA TGCAACAGGA GCAATCGACC
CTGAAAGAGC GCATTACGAT CCTCTCTCGC CTCGACGAGT ACCCCGACTT CCACGATCTC
GACTGGCAGC CGGTAGCCGT CGCCATCGCC CGGCTTGAAG CCGAAAAACG CGAGCTTGAG
AGCACTTCCG ACAAGCTCCG CACCCTCACC ATGCAACTCG ATGAGGTAGA AAAAGGACTC
GAGAAGACCG AGCGCCTCCT CGACGAACGC AAGGACAAGC GATCCAGAAC CAAGGAGAAG
ATCAGCAGCG CAAGAGAGCT TGAAGAACAG GCCCTGCTGT TCGTCAACGA AGCAGCATCA
GGGACAACCG ACCGCTTTTC CCGCCTCAAA GCCCTTCAGT CTGAGATGCA GGAGAGCCGC
GTTCTTACCG TTGAATCGTG CGACAACCGC GAGCGCGAAA TGCGCGACTG GCTGCAGGCA
AGAATCGACG CGGAAAACAA AAAGCTCTCG CGCCTGAACG AACAGATCAT CCGTGCGATG
ACAGAGTACA GCGAAAAGTG GAAGCTCGAA ACCCGCGAGG TCGATATCAA TATCGCCTCT
GCTGACGAGT ACCGCTCAAT GCTACAAGAG CTCCGTAAAG ACGACCTGCC GCGCTTCGAA
GAAAAATTCA AGGAACTGCT CAACGAAAAC ACCATCCGCG AGGTAGCCAA CTTCCAGTCA
CAGCTCACCC GTGAGCGCGA AACCATCAGG GAGCGTATCG CCCGGATCAA CGAATCGCTC
ACGAAAATCG ACTACAACCC CGGCCGCTAC ATCAGCCTCG AAGCGCAGAT AAACCTCGAT
GCCGACATCC GCGAGTTCCA GGCTGAGCTT CGAAGCTGCA CTGAGGGCAC ACTGACCGGT
TCGGACAACG CGCAGTATTC CGAGGCAAAG TTCCTGCAAG TACGCAAGAT TATCGACCGG
TTCCGGGGAC GCGAAGAGTT CGCCGACCTC GACCGCCGCT GGACGGTCAA GGTTACCGAC
GTGCGAAACT GGTTCGTCTT CGCGGCCAGC GAAAGATGGC GTGAGGACGA CACCGAGCAT
GAACACTACG CCGATTCGGG AGGCAAATCG GGCGGACAGA AGGAGAAGCT CGCCTACACG
GTGCTCGCCG CCAGTCTCGC CTACCAGTTC GGTCTCGAGT GGGGAGAGGT TCAGTCGCGG
TCATTCCGCT TCGTGGTCAT AGACGAGGCA TTCGGACGCG GATCGGACGA ATCGGCCAAT
TACGGTCTGC AGCTCTTCGA GCAGCTCAAC CTGCAACTGC TCATCGTCAC TCCCCTGCAG
AAGATCCACA TCATTGAACC CTTCGTCTCC AGTGTCGGAT TCGTTCACAA CGAGGACGGC
CGGAACTCCG TGCTGCGCAA CCTCAGCATC GAAGAGTACC GCACCGAAAA ACGAAACATG
CAGAAATGA
 
Protein sequence
MTEALEFGFI SDDRLAGFRL QRLEVFNWGT FDGRVWTLRL DGRNSLLTGD IGSGKSTLVD 
AVTTLLVPGQ RIAYNKAAGA ETRERSLRSY VLGFYKSERQ ESLGGGTKPV ALRDSNAYSV
VLGVFHNEGY DKTVTLAQLF WMKDPTGQPA RLYAAAERDL SIANDFSGFG TEITPLRKHL
RAKGVEVFDS FPKYGAWFRR RLSINNEQAL ELFHQTVSLK SVGNLTEFVR SHMLEPFDVQ
PRIAALIRHF EDLNRAHEAV LKAKRQIEKL TPLVEDCDRH REINASTEEL RGCREALRPW
FASLKLQLLE HRLASLKEEM ARHESAVERL EQQRREGQMR ERNLRRTIAE NGGDRIESIA
SEINTKQEEL DRKKQKSSRY GDLVLQLGEA QATNVEEFFR QRAGHETMRE EIEEREVSVQ
NDLNETGVSV ASMQQEYREL TAEIKSLKAR QNNIDDRQIA MRRELCQALE IDEEMMPFAG
ELLQVREEET AWEGVIERLL RNFSLSLLVP EHHYAQVAEW VDRTHLKGRL VYFRVRPPAR
RETMPDNPGS LPRKLMIKTD SPHFHWLERE IGHRFDAVCC DSQEEFRREK KAITMAGQVK
MPGERHEKDD RHRLDDRSRY VLGWSNAAKI AVLEKNARQK RDELGELNHR MKVMQQEQST
LKERITILSR LDEYPDFHDL DWQPVAVAIA RLEAEKRELE STSDKLRTLT MQLDEVEKGL
EKTERLLDER KDKRSRTKEK ISSARELEEQ ALLFVNEAAS GTTDRFSRLK ALQSEMQESR
VLTVESCDNR EREMRDWLQA RIDAENKKLS RLNEQIIRAM TEYSEKWKLE TREVDINIAS
ADEYRSMLQE LRKDDLPRFE EKFKELLNEN TIREVANFQS QLTRERETIR ERIARINESL
TKIDYNPGRY ISLEAQINLD ADIREFQAEL RSCTEGTLTG SDNAQYSEAK FLQVRKIIDR
FRGREEFADL DRRWTVKVTD VRNWFVFAAS ERWREDDTEH EHYADSGGKS GGQKEKLAYT
VLAASLAYQF GLEWGEVQSR SFRFVVIDEA FGRGSDESAN YGLQLFEQLN LQLLIVTPLQ
KIHIIEPFVS SVGFVHNEDG RNSVLRNLSI EEYRTEKRNM QK