Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cpha266_1443 |
Symbol | |
ID | 4570177 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium phaeobacteroides DSM 266 |
Kingdom | Bacteria |
Replicon accession | NC_008639 |
Strand | - |
Start bp | 1644568 |
End bp | 1645743 |
Gene Length | 1176 bp |
Protein Length | 391 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 639766029 |
Product | hypothetical protein |
Protein accession | YP_911895 |
Protein GI | 119357251 |
COG category | [S] Function unknown |
COG ID | [COG4924] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACTGGA CAACCCCTGC CGAACTCAAG GCCCAGGTTC AGAAACTCTG GGATCGGGGC CTCATCCTCT CCGCTCTGAC CAACGGCGAA GAACTTTTCC CCCGTCGCCT GACGCTGAAA GGGCCCGACT CCAGAGAGTT AAGCAACTCT TTTGCCGAAG TACGCGACTG GATTATGCGA CTTTCCGGTG CTGCCAAACA GTACCGGATT GTATGGCGCA CGGTCAATCA CCGCATTCTG GGAGCCAATG AACTTCCGGC GGAAATCTGG ATCGATTCAC TCGACGACGC CCTCGGGCTT ATTGGCAAAC AGCGAGAAGC CCGGCAATTT GCCGCCATGG TCACGCTCAC CCGCGACTGG CAACCCGCAC TTCTACCCTG GCTCGCAAAA CGCCCCCTGC GAGCCCTCGA ACTGGCCGAA GAGTGGTCGC ATATTCTCGA AATTGTCGCC TGGCGTCTCA AACATCCCCA CCCGGATATC TACCTGCGCC AGATCGACCT GTCCGGCGTG CACAGCAAGT TCATCGAAGG GCACCGGGGC GTACTTGGGG AGCTCTTCGA CCTCCTCCTT CCACCGGAGG AGATTGACGC AACGGTTACA GGAGCCGGAG GGTTCTCTCT CCGTTACGGC TTTAGGGACA AACCCCTCCG GGTGCGATTC CGAATCCTCG ACCCGAAACT GGCGCTTCTC CCGACGAATA CCGATCAGGA TATCACCCTG ACGCAGGCAA CGTTTGCCCG ACTTGAAATC CCCGTTACAA AAATCTTCAT CACCGAAAAC GAAATCAATT TCCTGGCCTT CCCTGAGGTT CCCGAGGCAA TGGTGATTTT CGGAGCAGGG TATGGTTTTG AGAACATGGC TTCAGTCGAG TGGATGCGTG ACCGTGTTAT CCACTACTGG GGAGACATCG ATACCCACGG TATGGCAATC CTCAACCAGT TACGGAGATT CTTTCCGCAG GCCGCCTCTC TTCTGATGGA CCATGAGACG CTGATGGAGC ACCAACCGCT TTGGGGCGCT GAACCATCTC CCGAAACCGG TACGCTCACG CGCCTGACCG CTCAAGAGGG TGCGCTTTAT GATCAGTTAC GACGAAATGA ACTGGGCAGT CGAATTCGGC TGGAGCAGGA GAAGATCGGG TTTGAGTGGC TGGTTGAGGC GTTAAAAAAG CTCTAA
|
Protein sequence | MNWTTPAELK AQVQKLWDRG LILSALTNGE ELFPRRLTLK GPDSRELSNS FAEVRDWIMR LSGAAKQYRI VWRTVNHRIL GANELPAEIW IDSLDDALGL IGKQREARQF AAMVTLTRDW QPALLPWLAK RPLRALELAE EWSHILEIVA WRLKHPHPDI YLRQIDLSGV HSKFIEGHRG VLGELFDLLL PPEEIDATVT GAGGFSLRYG FRDKPLRVRF RILDPKLALL PTNTDQDITL TQATFARLEI PVTKIFITEN EINFLAFPEV PEAMVIFGAG YGFENMASVE WMRDRVIHYW GDIDTHGMAI LNQLRRFFPQ AASLLMDHET LMEHQPLWGA EPSPETGTLT RLTAQEGALY DQLRRNELGS RIRLEQEKIG FEWLVEALKK L
|
| |