Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cpha266_2378 |
Symbol | |
ID | 4569288 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium phaeobacteroides DSM 266 |
Kingdom | Bacteria |
Replicon accession | NC_008639 |
Strand | + |
Start bp | 2764341 |
End bp | 2765369 |
Gene Length | 1029 bp |
Protein Length | 342 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 639766936 |
Product | hypothetical protein |
Protein accession | YP_912790 |
Protein GI | 119358146 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGATCGCC CGCCTTCTGG CGGACCTGTT GACGCCGCAC CGCTGCAGGT TATTTTTTCC GATCCCCTGA CAGGAGCGAC AAAAACAAGT CCGAAAATCA TACGACTCTC TTTCACTCAT GCCATTGCCC TGAAAGAGTT GCAGAAATCG ATTTTTTTCA CTCCCCGTAT CAGCAACTAC GATATTTTTG TTCACGGACG GGAAGCGGAG ATAAGAATTT ATGAACCGCT TCAGGATGAC CGCACCTACT CGGTCATGAT CCGACGACAG CTCAGCGACT ATCGCGGCGC GACACTTTCG ACACCCTGGA CGCTCTCGTT TGCCACCGGG CCGGTTATCG ACAACGGATC GCTCTCCGGC AGGGTGTTCA ACGCCGATCT ATCGCCTGCC GAAAATGCAC TGATCATGCT TTACAGAAAA GCGGCTTCAG GTACGGATCC TGATTACGTC GTACAGACAG ATCAGACCGG AGCATTCAGG CTTGACCATA TCCGTACAGG GCTCTATCGC CTCATTGCCG TGCATGACCG CAATCTCAAC CTGCACTTCG ACCCGGCGAC AGAAGAAACT GCCATTCCGG GCAGCGAGTT CGTTGCAGCG GGAACCTCAT CGCTCCTCAT GCGCTTTGCT CCGCAAAAAG ACGCGAAACC GGCACTCTTC GCTGATACCG AAAAAAACAG GACGGAAGAA ACCGGTACCA TCAAAGGAAG CTGCCTTGCC GATGCCAGAT GGCTCATCAT CGAGGCTAAA GCGCTCAACA GCTCAAAAGT ATGGAGAATA ACCGCTGCTC CGACTCAAAA AGGTCTGTTC CGTTTTACCT TTGCCAGTCT GCCTGAAGGC GACTATACCA TCAGCGCCTT TATCCCCTCG GGAAACAGAA AACCGGACGC TCATATAACA TGGAGTCCCG GCAAGCTTGA TCCGTTCACC CCCTCAGACC CTTTCGGTCT GTATCCAAAG GCTGTTCACG TCCGACCAGG ATGGGACGCC GAAACCATCG ATTTCACCAT CAGCGCTCCT GATCGCTGA
|
Protein sequence | MDRPPSGGPV DAAPLQVIFS DPLTGATKTS PKIIRLSFTH AIALKELQKS IFFTPRISNY DIFVHGREAE IRIYEPLQDD RTYSVMIRRQ LSDYRGATLS TPWTLSFATG PVIDNGSLSG RVFNADLSPA ENALIMLYRK AASGTDPDYV VQTDQTGAFR LDHIRTGLYR LIAVHDRNLN LHFDPATEET AIPGSEFVAA GTSSLLMRFA PQKDAKPALF ADTEKNRTEE TGTIKGSCLA DARWLIIEAK ALNSSKVWRI TAAPTQKGLF RFTFASLPEG DYTISAFIPS GNRKPDAHIT WSPGKLDPFT PSDPFGLYPK AVHVRPGWDA ETIDFTISAP DR
|
| |