Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cphamn1_1625 |
Symbol | |
ID | 6375305 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium phaeobacteroides BS1 |
Kingdom | Bacteria |
Replicon accession | NC_010831 |
Strand | + |
Start bp | 1752411 |
End bp | 1754057 |
Gene Length | 1647 bp |
Protein Length | 548 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 642684114 |
Product | protein of unknown function DUF814 |
Protein accession | YP_001960026 |
Protein GI | 189500556 |
COG category | [K] Transcription |
COG ID | [COG1293] Predicted RNA-binding protein homologous to eukaryotic snRNP |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTTCGCA ACTATTTTAC GCTCTACCAC CTCGCCCGCG AACTTCATGA ACTGGTCGCC GGGGGTTATA TTTTCGAGAT CTATTCGCAA CAGAAAAGCG AAATCACCAT AAGCCTGATA ACCAACGAGG GAAACCATCT GCAACTCATC GTAGTCACCG GTCATCCCAG GCTCTGCATC TATACCAGAG AAGGGCTGCA AAGAAAGCAG CGAAATACTG CCGGTCTGAT GCCGGAGCTC AATGAAAAAA AAATTTCGTG CATCACCATA GATCCGTGTG ACAGGATTAT CAGAATTGAA ACTGAAGACA ACTACGCCAT TGTGCTTCAA CTGTTCAGCG CAAAAACAAA TATCTTTCTG GAGCATAACG GCAATATTGC AGGCAGTTTC AAAAAAGGGA TAGCTCGTTC AGGTTCCGGC AGTCAGGAAG CCATCCTTCG ACCGGATATT CTGCGTACTC TCGAAAGGAT GGTTCAAAAC CGCCGTTACT TCATTGAGTC CTTTTCCGGA ACAGATCGAA AGGACACAGA AATACCCGCG CAACTACTCC CCGGTTTTGA TCGTGGGCTC ATAAAAGAAC TGCTCGGGCG ATGCGGAAAA AATCGTTCTC CTGAAACTAT ACACGAACAA CTCTCCACGC TCTTTTACGA ACTCATTGAT CCCTGCCCGT CAGTACTTTT CACAAATGAA AACGGCCCGC TCTTCTCGAT ACTGCAGCAA AAGAAAAAAG AGTGCGTAGA ATTTGACAGC GTTATTGAAG GATTGAACTT CTATAGCTCG AAAACAAGAC AACACCGTAA AACCGTTGAG CTTGTTCATC AGATTGAAGG GAAACTGCTT CAAAAAAGAA AAAAAATCGA CAGTGAACTA CAGCACTTTC AACCCGAATT GCTTCGGCGA CAGTTTGATG CATATCAACG ATACGGGCAC CTTCTCATGG CGAATCTCTC CCTGGCTGAC TGCAGGAAAG AGAGTATAAC AGTTCCCGAT ATTTTTGATC CTTCCGCCCT CCCCGTAACC ATAGCCCTCA AGCCGGAACT CAACCTGCAG GAAAACGCGG CTCTCTGGTT CCGGAAAGCA TCCAGAACAC GAGAAAAACT TCAAGGCGGC AGTCGAAGAA TAGCCGCTGT TGCAGAAGAG AAGCAGGCAC TTGAAAAGCT CATTACAGAA CTCGGCAAAC TGGCAAAACC CTCGGAAGTT ACACGCTTTG AAAAAAACAA CAGCGCTCTC CTGAAAAAAC TCGGATGTGA AAGCAATTCA GGAAAAACCG GAAAGAGACT ACCGTTCCGC AGTTTTGAGC TTTCTGAAAA AGCGGCTCTG TACGTCGGCA AAAACGCTGA AAACAACGAA AAGCTTACCT TTACCTTTGC CAGACCTCAT GACATCTGGC TGCATGTACG AGGAGCAGCC GGTTCGCACT GTATTCTTCG CGGAACGACG ATACAAAACA TCTCGGCAAT ACGAACTGCG GCCGAAATTG CCGCGTTTTA TTCTTCCTCC CGTCATGCAG AACTAGTCCC TGTCGTCTAC ACCGAAAAAA AATATGTTCG ACGCGCAAAA AATATGCCTC CGGGAAAGGT CGTCGTAGAA AAAGAGCAGG TGATACTGGT ACATCCTTCA CGTTTTTTCG ACGCTGCAGA AAAGTAA
|
Protein sequence | MLRNYFTLYH LARELHELVA GGYIFEIYSQ QKSEITISLI TNEGNHLQLI VVTGHPRLCI YTREGLQRKQ RNTAGLMPEL NEKKISCITI DPCDRIIRIE TEDNYAIVLQ LFSAKTNIFL EHNGNIAGSF KKGIARSGSG SQEAILRPDI LRTLERMVQN RRYFIESFSG TDRKDTEIPA QLLPGFDRGL IKELLGRCGK NRSPETIHEQ LSTLFYELID PCPSVLFTNE NGPLFSILQQ KKKECVEFDS VIEGLNFYSS KTRQHRKTVE LVHQIEGKLL QKRKKIDSEL QHFQPELLRR QFDAYQRYGH LLMANLSLAD CRKESITVPD IFDPSALPVT IALKPELNLQ ENAALWFRKA SRTREKLQGG SRRIAAVAEE KQALEKLITE LGKLAKPSEV TRFEKNNSAL LKKLGCESNS GKTGKRLPFR SFELSEKAAL YVGKNAENNE KLTFTFARPH DIWLHVRGAA GSHCILRGTT IQNISAIRTA AEIAAFYSSS RHAELVPVVY TEKKYVRRAK NMPPGKVVVE KEQVILVHPS RFFDAAEK
|
| |