Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rsph17029_0557 |
Symbol | |
ID | 4897880 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodobacter sphaeroides ATCC 17029 |
Kingdom | Bacteria |
Replicon accession | NC_009049 |
Strand | - |
Start bp | 583964 |
End bp | 585349 |
Gene Length | 1386 bp |
Protein Length | 461 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 640111142 |
Product | type II and III secretion system protein |
Protein accession | YP_001042445 |
Protein GI | 126461331 |
COG category | [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG4964] Flp pilus assembly protein, secretin CpaC |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCACGA AGGACATATT CCGGGCCGGG CTGGCGGTCC TCGCGCTCGC GGCGGGCATG GCGCTCACGC CCGCGGCGGC GCAGACCCTG CGCGTCATGG AGGGCTCGGC CTCGGGGGCG CTCAATGTGC CGATGAACCG CGCGGTCGTG GTGGAAAGTG ACCGGCCCTT CGCCGAGCTC TCCATCGCCA ATCCCGGCAT CGCCGACATC TCCACCCTGT CCGAGACCTC GATCTATGTC CTCGGCAAGG CGCCCGGCCG CACCACGCTC ACGCTTCTCG GGGCCGACGG GCGGCTGATC TCGAACGTGG ACGTGCATGT CACGCCCGAC GTGGCCGAGT TCAAGGAGCG GCTGCGCCAG ATCCTGCCCG GCGAGAATAT CGAAGTCCGC ACCGCGAACG ACGGGATCGT GCTGTCCGGC ACCGTCACCA GCACGGCCAA GCTCGACCGC GCGCTCGATC TCGCCAGCCG GTACGCGCCC GACCGCGTGT CGAACCTGAT GAGCGTCGGC GGCACGCAGC AGGTCATGCT GAAGGTCCGC TTCGCCGAGA TGCAGCGCTC GGTCGCCAAA AGCCTCGGCT CGTCGATCTA CGCCCGCAAT GGATCGGGCT CGGTGATCGG CGGCACGGGG ACGCTGCTGC AGAACGGCAT TCCGCCGTCG GGAGCTCCGA TCAGCACCTC AGGGAATGGC GCCCTGAGCC TCGGCTTCTC GGTCGGCGCG CTCGAGTTTC AGGTGCTGCT CGAGGCTCTC GAATCGAAGG GGGTGGTGCG GACGCTGGCC GAGCCGAACC TCACCGCCCT CTCGGGGCAG GAGGCAAAGT TTCTCGCCGG CGGCGAATAT CCGATCCCGG TGGCCAGCGG CGACGAGAAG ATCTCGATCG AATACAAGCC CTTCGGCGTC GAGCTGAACT TCACCCCGGT GGTGGTGGAC GGCAACCAGA TCAACCTGAT GATCAATGCC GCCGTCTCCT CGATCGACAA TACCGTGACG CTGGAAAGCT CGGGCTTCAC CATCAACGCC TTCAAGCGGC GCGAGACCTC GACGACCGTC GAGATGCGCG ACGGCGAGAG CTTCGCCATC GCCGGGCTCC TGCAGGACGA TTTCCGCAAC CTCAACGGCC AGGTGCCGTG GCTGGGCGAC ATCCCGATCC TCGGCGCCCT CTTCCGCAGC GCCGAATACC AGCGCTCGCA GAGCGAGCTC GTCATCATCG TCACGCCGCA TCTCGTGACC CCCACCCGGG GCGAGGCGCT GGCGCTGCCC ACCGACCGGG TGAAACTGCC CTCTGAAAAG GATCTCTTCC TGTTCGGCCG CGTCACCGGC CGGGGGGCGG CGGCCGAGGT CGCCCGGCAG GATTTCAGCG GCTCCTACGG CTATGTGATG GAGTGA
|
Protein sequence | MSTKDIFRAG LAVLALAAGM ALTPAAAQTL RVMEGSASGA LNVPMNRAVV VESDRPFAEL SIANPGIADI STLSETSIYV LGKAPGRTTL TLLGADGRLI SNVDVHVTPD VAEFKERLRQ ILPGENIEVR TANDGIVLSG TVTSTAKLDR ALDLASRYAP DRVSNLMSVG GTQQVMLKVR FAEMQRSVAK SLGSSIYARN GSGSVIGGTG TLLQNGIPPS GAPISTSGNG ALSLGFSVGA LEFQVLLEAL ESKGVVRTLA EPNLTALSGQ EAKFLAGGEY PIPVASGDEK ISIEYKPFGV ELNFTPVVVD GNQINLMINA AVSSIDNTVT LESSGFTINA FKRRETSTTV EMRDGESFAI AGLLQDDFRN LNGQVPWLGD IPILGALFRS AEYQRSQSEL VIIVTPHLVT PTRGEALALP TDRVKLPSEK DLFLFGRVTG RGAAAEVARQ DFSGSYGYVM E
|
| |