Gene Cphamn1_2080 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCphamn1_2080 
Symbol 
ID6375774 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium phaeobacteroides BS1 
KingdomBacteria 
Replicon accessionNC_010831 
Strand
Start bp2246626 
End bp2247987 
Gene Length1362 bp 
Protein Length453 aa 
Translation table11 
GC content47% 
IMG OID642684571 
Productprotein of unknown function DUF849 
Protein accessionYP_001960470 
Protein GI189501000 
COG category[S] Function unknown 
COG ID[COG3246] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.550211 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCACTATC ACATTGAAAA AGCCTCTCCT GATGATTATT CGGCAATACT TGACATCATG 
GAGATCTGGA ACATGCATCA TGTTCCTTCA GAAGAAATGT CCGAACTTGA TATCTCCTGT
TTTTTCGTCG CGCGTGTTTC AGGGCATATC GCCGGGGCTG CCGGTTACAA ACTTCTGTCA
CCCGAAAAAG GGAAAACGAC TCTTCTCGGG ATCAGACCGG AATTTTCAGG TATGGGACTG
GGCAGAGCGC TTCAGGATGC CCGGCTCGAA GCCATGTATG CAAAAGGGAT CAAACGGGTC
GAGACAAACG CCGATCGGCA GGAAACCATC ATATGGTATA AAAAACACTA CGGCTATCAT
CAAACAGGGT CGCTGAAGAA GGTCTGTGAT TTCAGCCTTT CTGACGTCGA TCACTGGACT
ACGCTTGAGA TGAACCTTGA AGCATACATG CTTTCAAGAA AAACGCATCA GGAATATCGA
AGCAACTATA TTGCGAAAAA CGATCCCCAC CCGCTCTCTC CCTACGATCC ATTGATCATC
AATGCCTGCC TGACCGGCAT GATACCCACG AAATTCAGTA ACCCTCATGT TCCCATCTAT
CCTGATGAAA TCATTGAAAA TGCTGTCCGG GTACATGATG CAGGGGCAAG AATCGTTCAT
CTGCACGCGA GAGACGAGAA AGGCGAACCT ACTCCTGATG CGAAATACTA TGAAAAGATT
ATTCAGGGCA TACGGGAGGA AAGACCCGGC ATGGTATGCT GTGTGACAAC TTCGGGAAGA
AACTGGAAAT CTTTCGAACA GCGATCAGAG GTACTGCACT TACATGGGAA GGCAAAACCG
GATATGGCCA GTCTGACACT TGGATCACTG AATTTCTTTA CCGGTCCCAG TGTCAGTTCC
ATTGAAATGA TCGAACGCCT TGCCATCACC ATGTATGAAC GGAATATCAA ACCGGAACTT
GAGGTGTTCG ATACCGGAAT GATCACGCTC GCCAAATATC TTGAAAGAAA CCGGCTGCTG
TCAGGAAAAA AGTATTTCAA TCTTCTGTTC GGCAATATCA ATACCGCTCC GGCGACAATT
TCAAGCCTTG CCTTGATGAC TCAAGCTCTT CCGGATAATT CAATCTGGGC GGGAACCGGC
CTGGGACAGT TTCAGCTCCC AATGAATGCG GCTGCGATTA TCGCAGGGGG ACATGTACGG
GTTGGTATAG AAGATGCGAT CTATTTTGAC TATGGAAAAG AAAGACTTGC CACAAATGAG
CAACTTGTTA AAAGAGTAGC AAGGATCGCT GAAGAAATGC AACGGCCTCT TGCGACCACT
GAGGAAACAA GGAAACTCAT TGGGTTGGAA AACCAGGAAT AA
 
Protein sequence
MHYHIEKASP DDYSAILDIM EIWNMHHVPS EEMSELDISC FFVARVSGHI AGAAGYKLLS 
PEKGKTTLLG IRPEFSGMGL GRALQDARLE AMYAKGIKRV ETNADRQETI IWYKKHYGYH
QTGSLKKVCD FSLSDVDHWT TLEMNLEAYM LSRKTHQEYR SNYIAKNDPH PLSPYDPLII
NACLTGMIPT KFSNPHVPIY PDEIIENAVR VHDAGARIVH LHARDEKGEP TPDAKYYEKI
IQGIREERPG MVCCVTTSGR NWKSFEQRSE VLHLHGKAKP DMASLTLGSL NFFTGPSVSS
IEMIERLAIT MYERNIKPEL EVFDTGMITL AKYLERNRLL SGKKYFNLLF GNINTAPATI
SSLALMTQAL PDNSIWAGTG LGQFQLPMNA AAIIAGGHVR VGIEDAIYFD YGKERLATNE
QLVKRVARIA EEMQRPLATT EETRKLIGLE NQE