Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cphamn1_1079 |
Symbol | |
ID | 6374753 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium phaeobacteroides BS1 |
Kingdom | Bacteria |
Replicon accession | NC_010831 |
Strand | + |
Start bp | 1168069 |
End bp | 1168896 |
Gene Length | 828 bp |
Protein Length | 275 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 642683580 |
Product | acid phosphatase (Class B) |
Protein accession | YP_001959498 |
Protein GI | 189500028 |
COG category | [R] General function prediction only |
COG ID | [COG2503] Predicted secreted acid phosphatase |
TIGRFAM ID | [TIGR01533] 5'-nucleotidase, lipoprotein e(P4) family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.00000160418 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.248613 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAAATC TCTACAGATC GATCTGCTAC GGTGTTCTCG TCCTGTTTGC GACAGGTTGT GCATCGACCG CGAACGACAA TTTCAACAGC CTTCTGTGGA TGCAGTCTTC GGCAGAATAC AAGGCCAACA CGACACAGGC GTACCAGGCG GCCATGAAGC ATATCGACGC AGCTATCTCC GACAGGTCAT GGGTCGCTGC TGAAGAACAG ACCGGAGACT GTTCGAAACT GCCTCCGGCC GTTGTCTTGG ACATCGACGA GACCGTTCTG GACAATTCGA AGTACATGGG AAAGGTGGTG CTCGAAAACG GCGAATGGAG CGCGGTGACC TGGGACGAGT GGGTCGCCCT GAAAGACGCG ACGGCTATTC CCGGAGCGGT GGGTTTCATC AACGCGATGA AGAAAAAAAA TGTCACAGTC ATCTTCATCT CGAACAGGGA GTGCGGCAAA CGTGACGGTT CGGAATCAGG ATGCATGCAG GAAACCGATA CAATAGAAAA CTTGGCGAAG GTCGGCGTGA CGGACGTTTT TCCTGAACAC GTTCTGTTGA AAGGGGAGAA GGAAGGCTGG ACGTCGGAGA AAAAGAGCCG GAGAGAATAC GTTGCAAAAA AATACAGGAT CGTCATGCTT TTCGGGGACG ACCTGGGAGA TTTCCTGCCC GACGTCAAGA AAAACATCAC CCCCGCCGAG CGTGATCGTT TGGTCGAGGA AAACCGGGCG AACTGGGGCA AAAAGTGGTT CATACTTCCG AACCCGACCT ACGGGTCGTG GCTAAACGTA CTCGGCGATC CGAAATCACA GTATATCAGG GTATACGACG GAAAATAA
|
Protein sequence | MKNLYRSICY GVLVLFATGC ASTANDNFNS LLWMQSSAEY KANTTQAYQA AMKHIDAAIS DRSWVAAEEQ TGDCSKLPPA VVLDIDETVL DNSKYMGKVV LENGEWSAVT WDEWVALKDA TAIPGAVGFI NAMKKKNVTV IFISNRECGK RDGSESGCMQ ETDTIENLAK VGVTDVFPEH VLLKGEKEGW TSEKKSRREY VAKKYRIVML FGDDLGDFLP DVKKNITPAE RDRLVEENRA NWGKKWFILP NPTYGSWLNV LGDPKSQYIR VYDGK
|
| |