Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cpha266_2226 |
Symbol | |
ID | 4569448 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium phaeobacteroides DSM 266 |
Kingdom | Bacteria |
Replicon accession | NC_008639 |
Strand | - |
Start bp | 2550665 |
End bp | 2551948 |
Gene Length | 1284 bp |
Protein Length | 427 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 639766794 |
Product | transposase IS116/IS110/IS902 family protein |
Protein accession | YP_912648 |
Protein GI | 119358004 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.820995 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTACAACG AAACAGAAAG ACTCAACGCT TATCGGCAGA TGCGATCGAG TATCCGTAAT TCCGGGCAGT ATCTGATAGT GGGTATCGAT ATTGCCAAAG AGAAACATCA TGCATTTTTT GGTACTGCCG CCGGTAAAGT CTTGTGCAAG CAACTGATCT TTACCAATGA CAAAAGCGGT TTTGAACTGC TTATAGCACG AGCCGAGCAG CTCAGATTGC AGCATCAGCT AACGTATGTG GTGTTCGGCA TGGAACCTAC AGCCAATTAT CATAAGCCAT TAGCTGAATA CCTTATTCAA GAGGATGCGA TGGTGGTGCA GGTTTCCGGT ACGGCCGTTG TACGAAACCG GGAGTTGCTC GATAATCGCT GGGATAAGCA TGATCGCAAG GATGCCGCCA ATGTGGCCGA TCTGGTGGGG GCGGGAAAGT GCCAGTTTTA CGACAATCCG CCACAGGCGA TTCATGATCT TCGGGAGTTG CTTAGCCTGA GGCGTCGGTA CCGTAAACTG GAATCCGGAA TCAGAACCCG TATCAGGAAC AACCTGCTTG CGTTGTACTT CCCGGAGCTG GATTGCCGGT TCACTTCCTT CCAGCAGGAT TGCCTGACCA TTATCAAGAC CTGCCTCTCA CCGGCAGCGA TTGCGGCAAT GCCCTTCGAG GAGTTTAAGC GGCGGATCGT CATCCGGCAA AAAGGAAAAC GACAGGAATC GTTTCTGGAG GATATCTGGA ACTCCGCTCA CCACACCGTC GGCAGGCCGG TCGATGAGAC CGTACAATAC ATGGCTGCGC AGTCGGTAAG TCAGCTTGAA CACTTCAGAG CGGAAATCGA CAATCTTGAC CGGCAGATTT TCATGATCGC CTCTTCTTTG CCAGAGTACA AGTATCTGAT CAGTATTCCG GGATTCGGGC CGTTCATCAG CGCAAAACTT CTTGCCACTA TCAATGATCC GGATCGATTC TCCAATGAGG CACAGGTGAT TAAACTGGCC GGTTTTGATC TCTGTGCCTC ACGAAGTGGC AAGCCATCAG GTAAAGCGAT TCCTCAGATA TCCAAGAAAG GCAATGCTGA ACTGCGCTTT GCCTTGGTAC AGGCGGCTAT TGTTGCCACC ACAAGAAATA CCCTGTTCAT CAGGTACCTG AACCAGAAAC TGCAAGGACG AGAACAGGAA AAAGGAATTC TGAAGAAAAT GCGAACCAAG GTAGCCTCAA AACTCCTTGT CATCGCTTGG ACACTGATGA AACAACACGA GTATTTTAAT GGTGAACATT TGAGACTCAC ATAA
|
Protein sequence | MYNETERLNA YRQMRSSIRN SGQYLIVGID IAKEKHHAFF GTAAGKVLCK QLIFTNDKSG FELLIARAEQ LRLQHQLTYV VFGMEPTANY HKPLAEYLIQ EDAMVVQVSG TAVVRNRELL DNRWDKHDRK DAANVADLVG AGKCQFYDNP PQAIHDLREL LSLRRRYRKL ESGIRTRIRN NLLALYFPEL DCRFTSFQQD CLTIIKTCLS PAAIAAMPFE EFKRRIVIRQ KGKRQESFLE DIWNSAHHTV GRPVDETVQY MAAQSVSQLE HFRAEIDNLD RQIFMIASSL PEYKYLISIP GFGPFISAKL LATINDPDRF SNEAQVIKLA GFDLCASRSG KPSGKAIPQI SKKGNAELRF ALVQAAIVAT TRNTLFIRYL NQKLQGREQE KGILKKMRTK VASKLLVIAW TLMKQHEYFN GEHLRLT
|
| |