Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cphamn1_1144 |
Symbol | |
ID | 6374819 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium phaeobacteroides BS1 |
Kingdom | Bacteria |
Replicon accession | NC_010831 |
Strand | - |
Start bp | 1227068 |
End bp | 1228084 |
Gene Length | 1017 bp |
Protein Length | 338 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 642683646 |
Product | ApbE family lipoprotein |
Protein accession | YP_001959563 |
Protein GI | 189500093 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG1477] Membrane-associated lipoprotein involved in thiamine biosynthesis |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.301499 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.219283 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAACCAC GCAATATCTC ATTCCCGGCA CTCCTCATTC TCTTCCTCAC TCTCCTCGCC TGCTCCGTTG AGAACAACGA TTTGCGGATA TACGAGCAGG AGAAGGTGAT GATGGGGACG ATCATGAAGA TCAAGGTTGT GACGAATGGT GAGGAGGAGA AAACAAAGGA GGCGTTCGAT GCTGCGTTTC GGGAGATTGC GGGGCTGGAG TCGGAGTTGA GTGAGTGGCA GCCTGCGAGT CCGGTTTCGG CGGTGAACCG CGAGGCGGGC GTGAAGAGCG TGGAGGTTCC GAATGCGGTC GTGACCGTGA CGGAAAAGGC GCTGGACATT GCTGATATAA CGGATGGTGC GTTCGATATC ACGTTCAAGC CGATCGGCAG GCTTTGGAAT GTGAAAGAGC GGACTTCACC GCCTCCGGCT GACAGCATCA GGATAGCGCT GAAACTGGTA GACTACAAAC AGATCGAGCT CGATACGCTG CAGAACATTC TTTTTCTGAA AAAGAAAGGG ATGGAGATCG GGTTTGGCGG GATCGCGAAA GGGTATGCCG CCTACCGGGC GGGCGAGGTG CTGGAAAAGC ATGGTATATA TAATTATATC ATCAATGCCG GTGGAGATCT CTACGTCAAG GGAAACAAGG GTGAACAGCG CTGGACATCA GGAATACAGA ACCCTGACAA GGAACAGACA AAACCGGTAC TTGCTTTCAG CGTCATCAAA CCGTGCGGCA TCGCGACGAG CGGCGATTAT GAGAGTTATT TCATCTATGA AGGAACCCGC TATCACCACA TCATCGACCT CAGCACAGGC TATCCGGCAA GAGGAGTGAA AAGCGTGACC GTCTTTTCCG AAGATCCGGC AAAAGCCGAC GCCTACGCCA CGGCGTTCTT CATTACAGGC TATGAAAAAG CCCTGCGCAT TGTCGAACAG GATCCCTCGC TCGCCTTTAT TATTATAGAC GATAAGGACA AGCTGTTCAA AAGCCCGAAT ATCGGTGCGT TTATCGAGGA GCTGTAA
|
Protein sequence | MQPRNISFPA LLILFLTLLA CSVENNDLRI YEQEKVMMGT IMKIKVVTNG EEEKTKEAFD AAFREIAGLE SELSEWQPAS PVSAVNREAG VKSVEVPNAV VTVTEKALDI ADITDGAFDI TFKPIGRLWN VKERTSPPPA DSIRIALKLV DYKQIELDTL QNILFLKKKG MEIGFGGIAK GYAAYRAGEV LEKHGIYNYI INAGGDLYVK GNKGEQRWTS GIQNPDKEQT KPVLAFSVIK PCGIATSGDY ESYFIYEGTR YHHIIDLSTG YPARGVKSVT VFSEDPAKAD AYATAFFITG YEKALRIVEQ DPSLAFIIID DKDKLFKSPN IGAFIEEL
|
| |