Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cpha266_1472 |
Symbol | |
ID | 4570242 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium phaeobacteroides DSM 266 |
Kingdom | Bacteria |
Replicon accession | NC_008639 |
Strand | + |
Start bp | 1670297 |
End bp | 1671316 |
Gene Length | 1020 bp |
Protein Length | 339 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 639766058 |
Product | hypothetical protein |
Protein accession | YP_911923 |
Protein GI | 119357279 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 28 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATCAACG GAAATACTTA TCACGCCCTT CCCTGCCCCT GCGGAAGCGG AAAACCGTTT GACGAGTGCT GCTTCAGACC GAACAACCAG AAAGCCGGTA ATCCAATGGA TGAGGTTATG CGGGCGATTC ATGAGGCCCA ACAAAGCCGG GAGTTCAGCT CCATCAAGGA TGCCGAACAG TTTATGCATG AATTCATGAT CCAGCAGAAC ACAGCGCCTC GCGATGAATT TGCAGGATTA TCGCCAACAG AAATGCGCAG TATCCTCTCG ACCCCTTTTG ATGCCGGTGA AGTTGCAACC TTTGTAGACG TTCTTCCGCA GGAGCCGGAC TGCCCCGCAG CAGCACTGTT CAAAGCGCTT GCTGACGCGA TCGGCGACAA GGGACTCAAG CCCACAGCAA CAGGCAATCT GCCAAGGAAT ACCGTTCGCG AAATTTCAAT AGCCACCAAC GGCACCGATA CCTTCGGGAC CGAACACTCA CGATACCAGC TACAGAAAGA GGCCGATTAT ATCGATCTGC ACATAACACG ACTTATAGCA GGTCTTGCAG GCCTCATCAG AAAATACAAG GGCCGGTTTA TCCTGACGAA AAAGTGCCGG GCTCTCCTTG ACAGGCATGG CATGGCCGGA ATCTGGCCGG AGCTTTTCAG GGCTTATGCA GAAAAGTATA ACTGGGCCTA TAGTGACGGA TACGGTGAGC TCTATTTCAT GCAGCGTTCG TTTCTCTATA CGCTCTACCT GCTGCACCGG TTCGGCTCGG AAGAACGTGA CGGACTGTTT TATGCAAATG CCTATTTCAA AGCGTTTCCG ACATTTTACG ACGAGATTCC TCTGCGGTAC GCCACGAGTT CGGAAGAGAT CGCAATATCC TCGTATCTGC ACCGGGTAAT TGACCGGTTC GCAGGATTCT TCGGACTTGC ACATGTCGAG AAAACCGATT GGAAGTTTGG CGTCGACAGA ATCTACAAGG TTCGGGCACT GCCGCTTCTG GAAGATGCGA TTCGATTTCA TGTGCCATAA
|
Protein sequence | MINGNTYHAL PCPCGSGKPF DECCFRPNNQ KAGNPMDEVM RAIHEAQQSR EFSSIKDAEQ FMHEFMIQQN TAPRDEFAGL SPTEMRSILS TPFDAGEVAT FVDVLPQEPD CPAAALFKAL ADAIGDKGLK PTATGNLPRN TVREISIATN GTDTFGTEHS RYQLQKEADY IDLHITRLIA GLAGLIRKYK GRFILTKKCR ALLDRHGMAG IWPELFRAYA EKYNWAYSDG YGELYFMQRS FLYTLYLLHR FGSEERDGLF YANAYFKAFP TFYDEIPLRY ATSSEEIAIS SYLHRVIDRF AGFFGLAHVE KTDWKFGVDR IYKVRALPLL EDAIRFHVP
|
| |