Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rpal_4202 |
Symbol | |
ID | 6411886 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris TIE-1 |
Kingdom | Bacteria |
Replicon accession | NC_011004 |
Strand | + |
Start bp | 4504687 |
End bp | 4505958 |
Gene Length | 1272 bp |
Protein Length | 423 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 642714084 |
Product | putative pilus assembly protein CpaE |
Protein accession | YP_001993173 |
Protein GI | 192292568 |
COG category | [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG4963] Flp pilus assembly protein, ATPase CpaE |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.195195 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATCAGTT ACTCGCGCCA AAATCAGGAC GAACCGGCGA CGGCGGCACC GGATGTAGGG CCGGAGGAGC ACATCGCGCC AGCGCCGCGC GTCTCGGTTC AAGCGTTCTG CGAGACCGTC GAGACCGCCG CCGCGGTGCA GGCCGCGGGC GAAGACCGTC GTCTCGCAAA AGCACACCTG AAGATCCAAA TGGGCGGCGT GGTTGCGGCA ATCGAAGCCT ACCGATCCGC CCCGACGCCG AACGTCATCG TGCTCGAGAC CGACGCGCGC AGCGACGTGC TGGCAGGTCT CGACCAGCTC GCGACGGTGT GCGATCCGGG GACGCGGGTG ATCGTGATCG GCAAGGTCAA CGACGTCACG TTGTACCGCG AGTTGGTCCG TCGCGGCGTC AGTGATTACG CGATCGCGCC GGTGCAGCCG ATCGACGTGG TCCGCTCGAT CTGCAATCTG TTCTCGGCAC CCGAAGCCAA AGCCGTCGGC CGCATCATCG CTGTGGTCGG TGCGAAGGGA GGCGTTGGCG CATCCACGGT CGCCCACAAC GTGGCCTGGG CCATCGCTCG TGACCTGGCG CTGGACTCGG TGGTGGCCGA TCTCGATCTC GCGTTCGGCA CCGCCGGGCT CGACTACAAT CAGGACCCGC CGCAGGGGAT CGCCGAGGCG GTGTTTGCCC CCGACCGCGT CGATACCGCC TTCGTCGATC GGCTGCTGTC GAAATGTACC GACCACCTCA GCCTGCTGGC TGCGCCGGCG ACACTGGACC GCGTCTACGA CTTTGGCGCC GATGCATTCG ACTCGGTCTT CGACACGCTC CGCACCACGA TGCCGTGCAT TGTTCTCGAC GTGCCGCATC AATGGACCGG GTGGGCGAAA CGGTCCCTCA TCACCGCCGA TGATATCCTG ATCGTCGCGA CGCCCGACCT CGCCAACCTG CGCAACACCA AAAACCTGAT CGACCTCTTG AAGGGGGCCC GGCCGAACGA TCGTCCACCG CTGTACTGTC TCAATCAGGT CGGGGTGCCG AAGCGGCCCG AAATCTCCAC GAACGAGTTC GCCAAGGCGA TCGAGAGCCA GCCGATCGTC TCGATTCCGT TCGAGCCTCA GATCTTCGGT GCGGCCGCCA ACAACGGCCA GATGATCGCG GAAATCGCGG CGAACCATAA GACGACGGAG ATGTTCCTGC AGATCGCCCA GCGGCTGACG GGACGCGGAG AAGCCAAGAA GCCGAAAGGC TCGTTCCTGG GACCGATTCT GGAGAAGCTG CGGGCCAAGT AG
|
Protein sequence | MISYSRQNQD EPATAAPDVG PEEHIAPAPR VSVQAFCETV ETAAAVQAAG EDRRLAKAHL KIQMGGVVAA IEAYRSAPTP NVIVLETDAR SDVLAGLDQL ATVCDPGTRV IVIGKVNDVT LYRELVRRGV SDYAIAPVQP IDVVRSICNL FSAPEAKAVG RIIAVVGAKG GVGASTVAHN VAWAIARDLA LDSVVADLDL AFGTAGLDYN QDPPQGIAEA VFAPDRVDTA FVDRLLSKCT DHLSLLAAPA TLDRVYDFGA DAFDSVFDTL RTTMPCIVLD VPHQWTGWAK RSLITADDIL IVATPDLANL RNTKNLIDLL KGARPNDRPP LYCLNQVGVP KRPEISTNEF AKAIESQPIV SIPFEPQIFG AAANNGQMIA EIAANHKTTE MFLQIAQRLT GRGEAKKPKG SFLGPILEKL RAK
|
| |