Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cphamn1_2052 |
Symbol | |
ID | 6375745 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium phaeobacteroides BS1 |
Kingdom | Bacteria |
Replicon accession | NC_010831 |
Strand | + |
Start bp | 2214369 |
End bp | 2215580 |
Gene Length | 1212 bp |
Protein Length | 403 aa |
Translation table | 11 |
GC content | 44% |
IMG OID | 642684543 |
Product | hypothetical protein |
Protein accession | YP_001960443 |
Protein GI | 189500973 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.000000379131 |
Plasmid hitchhiking | No |
Plasmid clonability | unclonable |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.0792841 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAAATAA CCGAAAACCC GAACTGGCGA ACCCTTCATA AAGACAGTAA CAACGGGTAT CAGAAAGTCG TTAAACGTCT CGGAAATGAC GTTATTCTCT ATAGCATAGA AACAGATCAT GACATTTCTC TTGACACCAT GAACATCGAC ATGCTTCAGA CTGTACTGCA GGATTCGAAG ATCGGGAACA AGCCTGTCTG CCTCTTATGG AACATGAAGC ACATTACCAA TATTTCGCTG ACGTATAAAA AACAGATTGC CAACCTTATC TATAACCGCA GAGTACACTT CGGCATTGTC GTCTTCTTTA ACGTGGAGCC CGTGTGCATG ACACTGGTCC AAACGTTTGC CGCAATGGTA CCTGAAGATA TGACGGTACT CATCAAGCAA AACTATACCG AAGCTGTAAA CACGACTTTG GCCTGGAAAG AAGGATTACC TGTTGACACA ATCTATGAAA GTGCCGAGGA AGAGAAATAC GAACTTCAGA AAAATGAGTT TCTTGCTGCA CTTGCCAGAA TTTCCTGGCT TGACATGATG GAACAGAGCA TTCCCATGCC GTCAAACGAC GACAAACTGC TCCCCTTTTT CCAGGCGATC AGTCATCTTC AGAGCGATCT TCTGGAAATA TCACGCAATA AGGAACAGGA ACTGAGACAG ATTGAACAGG ACGGCGAAAA AACTCTTACC GAAAAAAATA TTCTGCTCAA CGCACAGAAG GAACTGTATA AAAAGCTCAA AAATCAGCTG GAAAAGGAAA AATCGGCACT CACAGCAAGA ATCGCCACGC AGGAGATGGA GCTTACGAGG ATATCTACAG CGGTTGTTGA AAAAACATCG GCCCTTCGCC AGCTTCTCGA CCTGATCACC ACGCTGGATA TCGACCAGAG TCAAAAAAGA ACCATGATCG ATATATGCTC AAACATGATC GACACCGAGC TTATAGAAAA GAGACTCAAT ATTGAGCTTA CGACAACTGA TTCCGAGTTC CTGTCAAAAC TCCAGAAAAA ACACCCTAAC CTGAACCAGC GGGAACTACG AATCTGCCTG CTGATAAAGC TGAATTACAA CACAAGGGAT ATCGCGCGTT CAGTGGGTAT TTCTACCCGG GGAATGGAAA GCATTCGCTA CAGAATGCAC AAAAAAGTAG GGCTGTCAAA ACACCAGTCC CTTAAAAGCT ATCTCACTGA ACTGATCATG CAGAGAGACT GA
|
Protein sequence | MEITENPNWR TLHKDSNNGY QKVVKRLGND VILYSIETDH DISLDTMNID MLQTVLQDSK IGNKPVCLLW NMKHITNISL TYKKQIANLI YNRRVHFGIV VFFNVEPVCM TLVQTFAAMV PEDMTVLIKQ NYTEAVNTTL AWKEGLPVDT IYESAEEEKY ELQKNEFLAA LARISWLDMM EQSIPMPSND DKLLPFFQAI SHLQSDLLEI SRNKEQELRQ IEQDGEKTLT EKNILLNAQK ELYKKLKNQL EKEKSALTAR IATQEMELTR ISTAVVEKTS ALRQLLDLIT TLDIDQSQKR TMIDICSNMI DTELIEKRLN IELTTTDSEF LSKLQKKHPN LNQRELRICL LIKLNYNTRD IARSVGISTR GMESIRYRMH KKVGLSKHQS LKSYLTELIM QRD
|
| |