Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cpha266_2667 |
Symbol | |
ID | 4568771 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium phaeobacteroides DSM 266 |
Kingdom | Bacteria |
Replicon accession | NC_008639 |
Strand | - |
Start bp | 3060341 |
End bp | 3061615 |
Gene Length | 1275 bp |
Protein Length | 424 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 639767233 |
Product | hypothetical protein |
Protein accession | YP_913075 |
Protein GI | 119358431 |
COG category | [S] Function unknown |
COG ID | [COG4198] Uncharacterized conserved protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.0707942 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCAGACA TTATGCCTTT CAGGGCACTG CACTACAAGC AGGAAACCAT GAACCACGCC GAAAAGGTTC TTTGTCCGCC CTACGACGTT ATCTCTCCAG CCCGCCAGCA GGAGCTCTAC GAACTCTCGC CCTGTAACGC CGTCAGGCTC GAACTCCCGC TTGAGTCCGA TCCCTACCAG GCAGCCATGG AACGACTGCT CGAATGGAGC CGCATCGGCG AACTGGTAAG GGACGCTGAA CCGGCGATCT ACCCGTACAT GCAGACCTTC GAAGACGCCG AAGGCGCCGT CTACAACCGA ACCGGCTTTT TTTGCGCCAT GCGCCTCCAT GATTTTGTCG AACGCAAGGT TCTGCCTCAC GAAAAAACCC TCTCGGGGCC AAAAGCAGAC CGTCTCAACC TCTTCAGAAA GACAAAAACC AATATCAGCC CCGTTTTCGG GATCTATGCC GATCCCGATA AAGCAGCCGA TCGTCAAATA GCCGCCTTTG CATCCAGCAA CCCCCCCCTG ATCGACGCCG TGTTTCAGGA TGTCAGAAAC CGGATGTGGA AGATAACCGA CAAAGAGATC GTCGAAAAGG TGCGGGCGGG GCTTCAGCAT CGCACCGTTT TCATCGCCGA CGGCCACCAC CGCTACGAAA CAGGGCTCAA CTACCGAAAC GAACGAGCGG CAATGAACCC CGCACACACC GGCAATGAGG CATACAACTT TATTCTGGCC TGCCTTGCCA ACATGCATGA CGAAGGGCTG ATTATTTTCC CGATCCATCG ACTCCTGCAC AGCCTTGAGC AGTTTGACGC CCTGGCGTTC CGCCGACAGC TTGAGCAATA CTTCGTCGTC ACGGAACTTC CCCACAGAGA GGCCCTCAAA CGCTACCTGG CCGACGAACC TTCGATCTAC GCATACGGCG TGGTAACGCG GGAATATATG CTCGGCATCG TCCTCAAAGG GAGCCCCGAG GAGCTCCTTG ACCACGCAAC TCCCGACTCC CTCCGGAAGC TTGGTCTGGT GGCCCTCCAC GAGATCGTGC TCGGCAGGCT GCTTGGCATA ACCCCCGAAG CCATGGCGAA ACAGAGCAAC ATCAAGTACA TCAAGGATGA AGCCGAACTG TATGCTGCCG TCGAAAACGG AGCCGCGCAG GCCGGGATCG TCGTCAAGCC AACAACGGTT CAACAGGTCG TAGCCGTGTC GGAATCAGGA GAGGTCATGC CTCAGAAATC AACGTTTTTT TATCCGAAAA TAATGACAGG ACTTGTCTTC AACCCGCTCG ACTGA
|
Protein sequence | MPDIMPFRAL HYKQETMNHA EKVLCPPYDV ISPARQQELY ELSPCNAVRL ELPLESDPYQ AAMERLLEWS RIGELVRDAE PAIYPYMQTF EDAEGAVYNR TGFFCAMRLH DFVERKVLPH EKTLSGPKAD RLNLFRKTKT NISPVFGIYA DPDKAADRQI AAFASSNPPL IDAVFQDVRN RMWKITDKEI VEKVRAGLQH RTVFIADGHH RYETGLNYRN ERAAMNPAHT GNEAYNFILA CLANMHDEGL IIFPIHRLLH SLEQFDALAF RRQLEQYFVV TELPHREALK RYLADEPSIY AYGVVTREYM LGIVLKGSPE ELLDHATPDS LRKLGLVALH EIVLGRLLGI TPEAMAKQSN IKYIKDEAEL YAAVENGAAQ AGIVVKPTTV QQVVAVSESG EVMPQKSTFF YPKIMTGLVF NPLD
|
| |