Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cphamn1_0795 |
Symbol | |
ID | 6374462 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium phaeobacteroides BS1 |
Kingdom | Bacteria |
Replicon accession | NC_010831 |
Strand | + |
Start bp | 851554 |
End bp | 852591 |
Gene Length | 1038 bp |
Protein Length | 345 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 642683303 |
Product | hypothetical protein |
Protein accession | YP_001959227 |
Protein GI | 189499757 |
COG category | [C] Energy production and conversion [G] Carbohydrate transport and metabolism |
COG ID | [COG1819] Glycosyl transferases, related to UDP-glucuronosyltransferase |
TIGRFAM ID | [TIGR00661] conserved hypothetical protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.0695397 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGATCC TTTTTGGTGT TCAGGGAACG GGAAACGGTC ATATCAGCAG GAGCAGAGAG CTGGTGACCA GACTGCGTGA GTCCGGCCAT GAGATTGACG TCATAATCAG CGGCCGAAAA GAGGAGGAAC TGAAGGAAAT CGAGGTTTTT TCTCCTTACA GGGTATACAA AGGGCTCACA CTGGTGACCT ATCGCGGCAG AATGAACTAC ATCGAAACGA TGTTCAGACT TGATCTCGGC AAGCTGATGA CCGATGTGTT ATCGCTTGAC GTGAACGGGG TGGATCTCAT TATTACTGAT TTCGAGCCGG TGACGGCCAC TGCGGCCAGG ATGAAAAACA TCCCGTCGCT TGGTTTCGGT CACCAGTACG CTTTCCCGTA CGATGTTCCC ATTGCACGAG GGAACATCTT TGAACGCTAT ACGTTGCTTA ATTTTGCTCC TGCACAGTAT AATGCCGGTC TGCACTGGCA TCACTTCGAT CAACCCATAT TTCCTCCTGT TATTCCCCGG CACCTCTATG AGTCGGGAGC TGTGACAGTA CGTCAGGAAA AAGTGCTTGT CTATCTTCCT TTTGAGGAGT TAGAGGATGT GGCGGCACTG CTGCTGCCAT TTGACGGAAA GGAGTTCTAT ATATACGGTA AGTCAAGTGA AGACCATGAT GACGGCCACC TTCATTACAG GGCGTATTCC CGTGAAGGTT TTCTTCAGGA CCTTCAGGAA TGCAGCGGCG TTGTCTGCAA TGCAGGATTT GAACTGCCCG GAGAGGCTCT TCACCTTGGA AAAAAGCTGT TACTCAGACC GCTTGACGGT CAGATTGAAC AGGAATCGAA CGCGATGGCC ATAGAGGAAC TCGGGTACGG TATGACGATG CACTCTCTCG ATGGAAATGT TTTGAGGGAC TGGCTGCATA AACCCGGCAG AGAGCCGCTC GTTTACAGCA GAACGGTTGA TTATATCGCT GAATGGATAA CGAAAGGAGG GTGGGACGAT CTTTCCGGGT ATGTTGAAGC GGCCTGGGCT GATTGTGGAA AATATTGA
|
Protein sequence | MKILFGVQGT GNGHISRSRE LVTRLRESGH EIDVIISGRK EEELKEIEVF SPYRVYKGLT LVTYRGRMNY IETMFRLDLG KLMTDVLSLD VNGVDLIITD FEPVTATAAR MKNIPSLGFG HQYAFPYDVP IARGNIFERY TLLNFAPAQY NAGLHWHHFD QPIFPPVIPR HLYESGAVTV RQEKVLVYLP FEELEDVAAL LLPFDGKEFY IYGKSSEDHD DGHLHYRAYS REGFLQDLQE CSGVVCNAGF ELPGEALHLG KKLLLRPLDG QIEQESNAMA IEELGYGMTM HSLDGNVLRD WLHKPGREPL VYSRTVDYIA EWITKGGWDD LSGYVEAAWA DCGKY
|
| |