Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cpha266_0298 |
Symbol | |
ID | 4570788 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium phaeobacteroides DSM 266 |
Kingdom | Bacteria |
Replicon accession | NC_008639 |
Strand | - |
Start bp | 327416 |
End bp | 328633 |
Gene Length | 1218 bp |
Protein Length | 405 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 639764897 |
Product | hypothetical protein |
Protein accession | YP_910784 |
Protein GI | 119356140 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.00012102 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAACTTG GTAATCCTGC AACCGGAGAT GACTTTTTTG GTCGTACTCA GGAACTGAGT GACCTTTGGC GTTATCTGGA ATCCGACCAC ATACGTTTTC CCGGTGTCCG CCGTCTGGGA AAAACCTCCA TTCTGCGACG ACTGGAAAGC GAAGCGGCGG ATCATGGCCT TCTTGCAAGA TGGCTGGATG TTTCGAATAT TGATTCTGCG CCGGGATTTA TTTCGCTTCT GGATCAGGCA TTTCCTGAAA AATCCATCAG GAGCTTTCTC TCCGACAGGG CGCAACAGGC AGGAAGCTGG TTCAATCGTA TCCGCAAAAT TGACGCCACC CTGCCCGATG CCGTCGGTGG CGGGGGATTC GGCATAGAAT TCGGCGGCGA AACAGTTCCG GAGTGGGAAA AGGATGCCGG CTCGCTGCAT TCGAGATTGT GCAATCAGCC CTTGCTGATA CTGCTCGACG AGTTCCCCTG GATGCTGGAA AAGCTCATTC AGCGCGATCG ACAGGAAGCA GAGCAACTCC TCTCATGGTT ACGTATCTGG CGCCAGTCGC AGGGAAGCTG CCGTTTCGTC TTTACCGGCT CAATCGGCCT GCAATCCCTG CTGGAACGCC ACGGCCTTGG TGAAACCATG AATGACTGCT ATCCATACCC ACTGGGGCCA TACAAGCTGT CGGAAGCCCG AGGTCTTTGG CAATATTTCG CTCCGATTGC AGACAAAACC CCCTGGGAAA TTGCGGATCC GGTGATCGAC TACGCCCTGG GCCGCGTCGG CTGGCTGTCG CCCTATTTTT TAAGCCTGTT GCTCGATGAA AGCATGCGGG CGGCAAGGGA GCGGCGACAG GAGTGTCCCG CCGATGCGAC AGGCGAAGCG CGCATTGAAA TCGAGGACGT CGATGACGCC TACGAGAACC TGCTTGCAGA ACGTTCGCGC TTCCATCACT GGGAAAAGCG CCTCAAAAGC GCTCTGGAAC CCGCCGACCT TGATCTCTGT CTCAGCCTGC TTTCTCATCT TTCCCGTCAT GCAGATGGGC TGACCCTGCC TCAGTTAAGC AGCCGGCTGG CACGGCGGGA ACCAGAGCCC GACCTTCGCA CCCGGCGGAT CCAGGATCTT CTGGTGCGTC TGACCGACGA AGGCTATACC AGCCCGCCAG ATAGCAATAA ACGTATCCAG TTTCTTTCAT TCCCTTTACG CGATTGGTGG AACCGCAACC ATGTCTGA
|
Protein sequence | MKLGNPATGD DFFGRTQELS DLWRYLESDH IRFPGVRRLG KTSILRRLES EAADHGLLAR WLDVSNIDSA PGFISLLDQA FPEKSIRSFL SDRAQQAGSW FNRIRKIDAT LPDAVGGGGF GIEFGGETVP EWEKDAGSLH SRLCNQPLLI LLDEFPWMLE KLIQRDRQEA EQLLSWLRIW RQSQGSCRFV FTGSIGLQSL LERHGLGETM NDCYPYPLGP YKLSEARGLW QYFAPIADKT PWEIADPVID YALGRVGWLS PYFLSLLLDE SMRAARERRQ ECPADATGEA RIEIEDVDDA YENLLAERSR FHHWEKRLKS ALEPADLDLC LSLLSHLSRH ADGLTLPQLS SRLARREPEP DLRTRRIQDL LVRLTDEGYT SPPDSNKRIQ FLSFPLRDWW NRNHV
|
| |