Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_4449 |
Symbol | |
ID | 4070932 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | + |
Start bp | 5282045 |
End bp | 5283271 |
Gene Length | 1227 bp |
Protein Length | 408 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 637986488 |
Product | hypothetical protein |
Protein accession | YP_593523 |
Protein GI | 94971475 |
COG category | [R] General function prediction only |
COG ID | [COG4447] Uncharacterized protein related to plant photosystem II stability/assembly factor |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGTAGTG ATTTCCAAAA CCGTTTGCGC AAGAACCTGG AACATCCCGT TCCGGCAGAG CACCCTGCTC CCGATGTGCT GAATGCGTAC ATCGAGAAGG TGCTCACCGG GGCAGAGGAG CGCCAGGTAA CGGAGCATCT CGCGGCCTGC AGGGAGTGCC GCGAGGTGGT GTTTCTTGCG ACGGGCGCAG CTGAGGAGCC CGTGCAGCCG GTGGTCGCCG CTGTTCCAGT GAAGCGGGTG CGCTGGTGGG CGTGGGCGAT GCCGATTGTT GCGGTAGTCG TAATTGCCAT CTTCATTGGT CAACCGTCGC TGCTCAGAAG CAAGCACACT GTAGAAATGG CGCAGGCGCG GCACGATGAA CCGCAAGTTC CGGCTTCGAC GACAGTTGCG ACAAAGACAG AGGTTGCGCC AGCGAACAAA GAAGAAGATA AAGCCAAGTC TCTCGAGACC TATCAACCAC GAAAGCGGAT TGTGCCGGCA CAGCCGCTTG GTGGACTTGC GACATCCCCG GCCGCGCCAC CATCGCCAGC TCCGACTGAG CGGCGTGAGT TGGCAAAAGA GAAAGACGTA AACGGACCAG TGCAAAACGA AATGGCTCGT CGAGCGGGGG TGGGAGGAAG GATCGCAGAG GAAGCCAAGC CGGCGGTCGC GATGAGTGCA CCCGCGCCTG CTGCGGCAGA CAAAGTCCAG ACCTTAAAAC AATCGGAGGG AGCTGCGGCG GCAAATTTGC AGCAAGATTC GAAGCTGCGG GATGACCGAT ACGCGTACAG CACGGAGTCA ACCAACGGCG CATCCCTGTC GGCGAATGGC GCAAGCCGCA GCAAGGCCGC CAATCTCGAC ACCAAGGCCG GACAGGCGTT TGGGGGCTTC GCGAAATCCG CGGCGAAGAA AGTAGATGCA GCGACGCAAT GGCGCGTCAC CACCACGGGC GGGCTCGAGC ACGCCCTGCT CGGCGAATGG AAGCCCGCGC TTGGCGACTC CAGTTCGCAC TTTCTTGCTG TCACGACCTT CGGGGAAAAC GTGTGGGCCG GCGGGAAGAA CCTCGCCCTC TATCACTCGC CTGACAACGG AGTGACCTGG GAGCGGCAGA CGCTGCGAGT GCGGATCGCC GCGGACATAA CGCAGATCCA GTTCACCTCG GCAAACGACG GCGTGTTGAC CACGAACTTA GGGACGTCGT TCGTGACGCA TGATGGCGGG AAGAGTTGGG CACAGGAGAA GCCCTAG
|
Protein sequence | MSSDFQNRLR KNLEHPVPAE HPAPDVLNAY IEKVLTGAEE RQVTEHLAAC RECREVVFLA TGAAEEPVQP VVAAVPVKRV RWWAWAMPIV AVVVIAIFIG QPSLLRSKHT VEMAQARHDE PQVPASTTVA TKTEVAPANK EEDKAKSLET YQPRKRIVPA QPLGGLATSP AAPPSPAPTE RRELAKEKDV NGPVQNEMAR RAGVGGRIAE EAKPAVAMSA PAPAAADKVQ TLKQSEGAAA ANLQQDSKLR DDRYAYSTES TNGASLSANG ASRSKAANLD TKAGQAFGGF AKSAAKKVDA ATQWRVTTTG GLEHALLGEW KPALGDSSSH FLAVTTFGEN VWAGGKNLAL YHSPDNGVTW ERQTLRVRIA ADITQIQFTS ANDGVLTTNL GTSFVTHDGG KSWAQEKP
|
| |