Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cpha266_0144 |
Symbol | |
ID | 4568785 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium phaeobacteroides DSM 266 |
Kingdom | Bacteria |
Replicon accession | NC_008639 |
Strand | + |
Start bp | 164039 |
End bp | 165280 |
Gene Length | 1242 bp |
Protein Length | 413 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 639764746 |
Product | twin-arginine translocation pathway signal |
Protein accession | YP_910638 |
Protein GI | 119355994 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG0715] ABC-type nitrate/sulfonate/bicarbonate transport systems, periplasmic components |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCAATG AGCAAAAACC AACTTCCCGC AGAGAATTTC TAAGGAGTTC GGCTTTCGGA CTAAGCGCAG TTCTGGGCGG ATTGCCACTC ATACAGGGGT GCTCCAAAGA GTCGGGGCAG GAGAAAGGCG CTCCGAATAT CGGCACATCG GGAAAAAAGC TCAAAAGCAG TTATTTGCCG ATAACCGATG CAACGCCGAT CATTCTCGCC CATGAACTGG GTTTTTACAA AGAGCTTGGC ATCGATTCCG AAAAACCTGT ACTGATAAGG GGTTGGGCGC CAATGGCCGA AGCCTTTATG GCAGGACGTT TCAATCTTAC CCATCTTCTT GCGCCGATCC CTATTTACAT GCGCTACAGC AAAGGCTTTC CCGTGAAAGT GGTTGCCTGG GACCACATCA ACGGTTCAGC GCTGACGGTC GGGAAGGAGA GCGGGATCAA ATCTTGCGCC GATCTGGGCG GCAAGCAGAT TGCCATTCCC TACTGGTATT CCATGCATAA TATCATTCTT CAGATGATTG CCCGAGAGCA CGGTATCGAG CCGGTTATTC AGAGCAAGAC GGCGCCTCTC ACCAGCAAAC AGATGAATCT GTTTGTCATG GCTCCCCCCG ACATGCCTAC AGCCATAGCC TCAAAGGCCA TTGACGGCTA CATTGTTGCC GAACCGTTCA ACGCTGCCGG TGAAGTTCTC GCGGGCGGAA GGATTGTCCG TTTTACCGGC GACGTCTGGA AAAACCATCC ATGCTGTGTT GCCGTCATGA ACGAAAAGGA GCTTGAAGAC AAAGAGTGGT CACACAAGGT GATTCAGGCT CTGGTCAAGG CGGAACTCTG GGCGCTCAAC AATGTGGAAG AGGCCGCGCA TATTCTCTCG AAGGATGGCG CTCAATACCT TCCGCTTCCG GAAAAAATTG TCAAGCGGGC AATGATGAAA TACGATCTGG AGACATACGG AGCAGGAAGC GGAACCGGTG CCATTCAGAA CCCCGACTGG CATGCAAGCA GGATTTCCTA CGAACCTTAT CAATTTGAAT CGGCAACCCG TCATATCGTG GAGATGCTGA AGCAGACCAG GGTGGACGGG GATGCCGCTT TTCTTAAGGC GCTTGATCCC GGTAAAGTAC ATAAGGAATT GATGTACACG GCAGGAGTCG AGGCGGCAGC CGCAGCGCTT GGCGGACTCG GTCAGTTTGC CGGAGTCAAT TCGGCCTCGC CGCTGGTCAG GGAAGAGGTT ATCAAGGTGT AA
|
Protein sequence | MSNEQKPTSR REFLRSSAFG LSAVLGGLPL IQGCSKESGQ EKGAPNIGTS GKKLKSSYLP ITDATPIILA HELGFYKELG IDSEKPVLIR GWAPMAEAFM AGRFNLTHLL APIPIYMRYS KGFPVKVVAW DHINGSALTV GKESGIKSCA DLGGKQIAIP YWYSMHNIIL QMIAREHGIE PVIQSKTAPL TSKQMNLFVM APPDMPTAIA SKAIDGYIVA EPFNAAGEVL AGGRIVRFTG DVWKNHPCCV AVMNEKELED KEWSHKVIQA LVKAELWALN NVEEAAHILS KDGAQYLPLP EKIVKRAMMK YDLETYGAGS GTGAIQNPDW HASRISYEPY QFESATRHIV EMLKQTRVDG DAAFLKALDP GKVHKELMYT AGVEAAAAAL GGLGQFAGVN SASPLVREEV IKV
|
| |