Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cpha266_1746 |
Symbol | |
ID | 4571108 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium phaeobacteroides DSM 266 |
Kingdom | Bacteria |
Replicon accession | NC_008639 |
Strand | - |
Start bp | 1972472 |
End bp | 1973812 |
Gene Length | 1341 bp |
Protein Length | 446 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 639766329 |
Product | TolC family type I secretion outer membrane protein |
Protein accession | YP_912187 |
Protein GI | 119357543 |
COG category | [M] Cell wall/membrane/envelope biogenesis [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG1538] Outer membrane protein |
TIGRFAM ID | [TIGR01844] type I secretion outer membrane protein, TolC family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGAGTT TTTGGTATGT AATTCTTATA GCAGGCAGTT TTGTTTCGCC GCCTCTTTAT GCCGCGCCAG TCACTATATC GGAAGCTTAT CTGAAAGCCC GAGATCATGA TGCCCTGCTG GGTGCGGCAA AGGCTGATAA TCTGGTGTAT AAAGAGGAGG TCGGCAAGGC GAGGGCCGGT TTACGCCCCA GTGTACGTCT GAACGCCTCA CGCGGCCGCA ACGGCACACA GCATGGTTAC ATGGGACTAT ATGAGGCGCC TGATTTTTAT AATACGGTCG TCTATGGTGT GACGCTTCGT CAACCATTGA TTAATCTGTC GAATATAGCT GAATACCAGC AGGCGAAAGC TGTGGCGGCA AAAAGTGATG CTGAATTGCG GAAAGAAGAG GCTGATCTGA TTGTCAGACT TGCCGAGACC TATTGTAACG CATTGTATGC CGAGGATAAC CTTACCTTCA GTCAGGCTCA TATCAAGGCT TCGGAAGAAC AGCTTCAGCA GGCGAAACGA CGGTTTGAAA AAGGGTTCGG AACCATTACC GAAATAAATG AAGCGCAGGC TGACCTCGAT ATGGCGCTTG CTGAGGGACT TGAAATCGTC AACAGCGTCG AGTTCAGCCG CCGTGAGCTG GAGCATCTTA TCGGAACGTA TCCTGATGAA CTCTGCAAGT TGGCGCCTGA AAAACTTGTG TTGGCCCGAC CGACGCCAGG TCGTGTGGAA TCTTGGATTG ATCGAGCTCA TGGAGAGAGT CCTCGAAGCT CGGCAGCGCA CCTTGAGATG CAGATCGCAA AAAAAGAGAT CGAAAAACAG AAGGCAAGCC GTTATCCGAC GATTGATCTT GTTGCCGGGA GAAGCTATTC CGAGAGTGAA AATAATTATT CAATCGGCTC AACCTATGAA ACCTATTCGA TCAGTCTGCA AATGAGCATG CCGATTTATA CCGGTGGTTA TGCCAGTGCA TCGATCCGGC AGGCAAAAGC AAAGTGGCTT AAAGCGGGTG AGCAGTTTTT CTGGCAGGAA CGGAGCATCG AATCTGAAGT ACGCAAGTAC TATAATACGG TTATCAGTAC GATCGCACAG ATCCAGGCCT ATGAACAGGC GGTCAAATCC CGGGAAATTG CTCTCGACGG CACAAAAAAA GGGTTTGGTG CTGGTCTGCG CAGTAATGTG GATGTTCTTG ACGCTCTTCA GAATCTTCTT GCCGCCAGAC GTAATCTTGC GAAGTCGCGA TACCAGTATA TTCTTGCCCG TCTTTCGCTC AAGCAGACCG CAGGAGCATT GTCGCCCGCT GATATTGAGG AGATCAATGG CTGGTTTGCA ACGGCAAAAA CAGCGAAATA G
|
Protein sequence | MKSFWYVILI AGSFVSPPLY AAPVTISEAY LKARDHDALL GAAKADNLVY KEEVGKARAG LRPSVRLNAS RGRNGTQHGY MGLYEAPDFY NTVVYGVTLR QPLINLSNIA EYQQAKAVAA KSDAELRKEE ADLIVRLAET YCNALYAEDN LTFSQAHIKA SEEQLQQAKR RFEKGFGTIT EINEAQADLD MALAEGLEIV NSVEFSRREL EHLIGTYPDE LCKLAPEKLV LARPTPGRVE SWIDRAHGES PRSSAAHLEM QIAKKEIEKQ KASRYPTIDL VAGRSYSESE NNYSIGSTYE TYSISLQMSM PIYTGGYASA SIRQAKAKWL KAGEQFFWQE RSIESEVRKY YNTVISTIAQ IQAYEQAVKS REIALDGTKK GFGAGLRSNV DVLDALQNLL AARRNLAKSR YQYILARLSL KQTAGALSPA DIEEINGWFA TAKTAK
|
| |