Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cphamn1_1504 |
Symbol | |
ID | 6375182 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium phaeobacteroides BS1 |
Kingdom | Bacteria |
Replicon accession | NC_010831 |
Strand | + |
Start bp | 1624442 |
End bp | 1626028 |
Gene Length | 1587 bp |
Protein Length | 528 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 642683997 |
Product | Sel1 domain protein repeat-containing protein |
Protein accession | YP_001959911 |
Protein GI | 189500441 |
COG category | [R] General function prediction only |
COG ID | [COG0790] FOG: TPR repeat, SEL1 subfamily |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 26 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTGAAAA AAGCCATACT TTCTGCTCTC TTTCTAACCA TACTTACCCT GCATTCTTTT GCGGAATCCG CTCTTGAAAA AACAGCGGAA TTCAAAACCC TTCTTTCAGA TGCCTCACAG GGCAACGAAG AGCATCAATT AAAACTTGGA TTCATCTACG CCAATGGTGA CGGAGTCGAA CAGAATTATA CCAAAGCCGT CAAATGGTAC CGAGTGGCCG CCGATCAGGG CAACATGATC GCTCAGAATA ACCTGGGCCA GCTCTACGCG ACTGGAAAAG GAGTGACGCA AAATCATACA GAAGCCGCCA AATGGTTTCG CATGGCCGCC GAACAGGGCC ATGCTAAGGC ACAGAGCAAT CTCGGCCTGA TTTATTTTTC AAATCAGGGA GTACAACAGG ACTATGTTGA AGCCGCCAAA TGGTTCGGGA TGGCTGCTGA TCAGGGTCAT ACAAGAGCTC AATTTTTCCT CGGAAGAATG TACTATTCTG GTGAGGGTGT AACGAAAAAC CACAAAACCG CAGCCCGATT ATTTCAGCTT GCAGCGAAAA ATAATGACGC GAAAGCACAG CATAATCTCG GCGTGATGTA TGCAGAGGGT CAGGGAGTTG AGCAGAACTA TACAGAAGCA GCCAGGTGGT ATAGGAAAAG TGCGGAACAG GGCGATCCTG ATGCCGCCTT TCATCTGGGC ATGCTCTTCT CTGGAGGCAG AGGCGTCGCA CAAAACAATG CCGAGGCGTT CAAGTGGTTG CATATCGCAT CTGAAAAAGG CCATACTCAG GCACAGTTAC AACTTGCCGG CATGTATGAG ACCGGTACAG GAACCTCTCA GAACAGTGAA GAAGCGCTCA AATGGTATCG TAAAGCCGCC GAAAAAGGTA TCACTCAGGC TCAGAGTAAA CTCGATTCAC TGCTGAGCAA AAAACCGCTT GTAGAGAGTA GTCCCGCCGA GAGCCTTCCT GTGCCCCCTG TGCTTGTCCC GAAAGACAAC GAAATTTCCA CACCAGAGGT TGCAGAGACG GCGCCGGAAG ATCGTGGGGC CCCGGAAAAC AACACCTCGG ATCGGGCACA CTATCTCAGT GCGGCTCAAG AGGGTGACAG TGAGGCCGCA CTCAAACTTG CCGATATGCT CTCAGAAGGT CGCGGTGGCG AACAAAATGA TGCTGAAGCC CGCTCATGGT ATCAGAAGGC TGCTGAAATG GAAACTGGTG AAGCGGCTTT CAAACTTGCT GGCATGATTA TAGAAGGACG CGGTGGAAAA CAGAGCAATT CCGATGGCCG CTCCTGGTAC AAGAAAGCCG CGGCAATGGA ATACAGTGAA GCAGCTCTTC AATTAGGCTT CATGTACCAG GCCGGGAAAA ATGCTCCGAG AAACAACTGG CTCGCGCGTC AATGGTTTCT CGTCGCAGCT GAAAAAGGAT TGCCCCGGGC ACAGTATCAG CTCGGGAACA TATTCGCAGA GGGGCGTGGC GTAGACAAGA ATGTTGAAAA AGCGGCTGAA TGGTACCGAA AAGCCGCCGA ACAGGGTCTG GAAGAAGCAC GCGACCGGCT CAGCAAAATG TCGGGAGACG AACAAACGGC ACGCTGA
|
Protein sequence | MLKKAILSAL FLTILTLHSF AESALEKTAE FKTLLSDASQ GNEEHQLKLG FIYANGDGVE QNYTKAVKWY RVAADQGNMI AQNNLGQLYA TGKGVTQNHT EAAKWFRMAA EQGHAKAQSN LGLIYFSNQG VQQDYVEAAK WFGMAADQGH TRAQFFLGRM YYSGEGVTKN HKTAARLFQL AAKNNDAKAQ HNLGVMYAEG QGVEQNYTEA ARWYRKSAEQ GDPDAAFHLG MLFSGGRGVA QNNAEAFKWL HIASEKGHTQ AQLQLAGMYE TGTGTSQNSE EALKWYRKAA EKGITQAQSK LDSLLSKKPL VESSPAESLP VPPVLVPKDN EISTPEVAET APEDRGAPEN NTSDRAHYLS AAQEGDSEAA LKLADMLSEG RGGEQNDAEA RSWYQKAAEM ETGEAAFKLA GMIIEGRGGK QSNSDGRSWY KKAAAMEYSE AALQLGFMYQ AGKNAPRNNW LARQWFLVAA EKGLPRAQYQ LGNIFAEGRG VDKNVEKAAE WYRKAAEQGL EEARDRLSKM SGDEQTAR
|
| |