Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cphamn1_2080 |
Symbol | |
ID | 6375774 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium phaeobacteroides BS1 |
Kingdom | Bacteria |
Replicon accession | NC_010831 |
Strand | + |
Start bp | 2246626 |
End bp | 2247987 |
Gene Length | 1362 bp |
Protein Length | 453 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 642684571 |
Product | protein of unknown function DUF849 |
Protein accession | YP_001960470 |
Protein GI | 189501000 |
COG category | [S] Function unknown |
COG ID | [COG3246] Uncharacterized conserved protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.550211 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCACTATC ACATTGAAAA AGCCTCTCCT GATGATTATT CGGCAATACT TGACATCATG GAGATCTGGA ACATGCATCA TGTTCCTTCA GAAGAAATGT CCGAACTTGA TATCTCCTGT TTTTTCGTCG CGCGTGTTTC AGGGCATATC GCCGGGGCTG CCGGTTACAA ACTTCTGTCA CCCGAAAAAG GGAAAACGAC TCTTCTCGGG ATCAGACCGG AATTTTCAGG TATGGGACTG GGCAGAGCGC TTCAGGATGC CCGGCTCGAA GCCATGTATG CAAAAGGGAT CAAACGGGTC GAGACAAACG CCGATCGGCA GGAAACCATC ATATGGTATA AAAAACACTA CGGCTATCAT CAAACAGGGT CGCTGAAGAA GGTCTGTGAT TTCAGCCTTT CTGACGTCGA TCACTGGACT ACGCTTGAGA TGAACCTTGA AGCATACATG CTTTCAAGAA AAACGCATCA GGAATATCGA AGCAACTATA TTGCGAAAAA CGATCCCCAC CCGCTCTCTC CCTACGATCC ATTGATCATC AATGCCTGCC TGACCGGCAT GATACCCACG AAATTCAGTA ACCCTCATGT TCCCATCTAT CCTGATGAAA TCATTGAAAA TGCTGTCCGG GTACATGATG CAGGGGCAAG AATCGTTCAT CTGCACGCGA GAGACGAGAA AGGCGAACCT ACTCCTGATG CGAAATACTA TGAAAAGATT ATTCAGGGCA TACGGGAGGA AAGACCCGGC ATGGTATGCT GTGTGACAAC TTCGGGAAGA AACTGGAAAT CTTTCGAACA GCGATCAGAG GTACTGCACT TACATGGGAA GGCAAAACCG GATATGGCCA GTCTGACACT TGGATCACTG AATTTCTTTA CCGGTCCCAG TGTCAGTTCC ATTGAAATGA TCGAACGCCT TGCCATCACC ATGTATGAAC GGAATATCAA ACCGGAACTT GAGGTGTTCG ATACCGGAAT GATCACGCTC GCCAAATATC TTGAAAGAAA CCGGCTGCTG TCAGGAAAAA AGTATTTCAA TCTTCTGTTC GGCAATATCA ATACCGCTCC GGCGACAATT TCAAGCCTTG CCTTGATGAC TCAAGCTCTT CCGGATAATT CAATCTGGGC GGGAACCGGC CTGGGACAGT TTCAGCTCCC AATGAATGCG GCTGCGATTA TCGCAGGGGG ACATGTACGG GTTGGTATAG AAGATGCGAT CTATTTTGAC TATGGAAAAG AAAGACTTGC CACAAATGAG CAACTTGTTA AAAGAGTAGC AAGGATCGCT GAAGAAATGC AACGGCCTCT TGCGACCACT GAGGAAACAA GGAAACTCAT TGGGTTGGAA AACCAGGAAT AA
|
Protein sequence | MHYHIEKASP DDYSAILDIM EIWNMHHVPS EEMSELDISC FFVARVSGHI AGAAGYKLLS PEKGKTTLLG IRPEFSGMGL GRALQDARLE AMYAKGIKRV ETNADRQETI IWYKKHYGYH QTGSLKKVCD FSLSDVDHWT TLEMNLEAYM LSRKTHQEYR SNYIAKNDPH PLSPYDPLII NACLTGMIPT KFSNPHVPIY PDEIIENAVR VHDAGARIVH LHARDEKGEP TPDAKYYEKI IQGIREERPG MVCCVTTSGR NWKSFEQRSE VLHLHGKAKP DMASLTLGSL NFFTGPSVSS IEMIERLAIT MYERNIKPEL EVFDTGMITL AKYLERNRLL SGKKYFNLLF GNINTAPATI SSLALMTQAL PDNSIWAGTG LGQFQLPMNA AAIIAGGHVR VGIEDAIYFD YGKERLATNE QLVKRVARIA EEMQRPLATT EETRKLIGLE NQE
|
| |