Gene Rpal_4202 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRpal_4202 
Symbol 
ID6411886 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris TIE-1 
KingdomBacteria 
Replicon accessionNC_011004 
Strand
Start bp4504687 
End bp4505958 
Gene Length1272 bp 
Protein Length423 aa 
Translation table11 
GC content65% 
IMG OID642714084 
Productputative pilus assembly protein CpaE 
Protein accessionYP_001993173 
Protein GI192292568 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG4963] Flp pilus assembly protein, ATPase CpaE 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.195195 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATCAGTT ACTCGCGCCA AAATCAGGAC GAACCGGCGA CGGCGGCACC GGATGTAGGG 
CCGGAGGAGC ACATCGCGCC AGCGCCGCGC GTCTCGGTTC AAGCGTTCTG CGAGACCGTC
GAGACCGCCG CCGCGGTGCA GGCCGCGGGC GAAGACCGTC GTCTCGCAAA AGCACACCTG
AAGATCCAAA TGGGCGGCGT GGTTGCGGCA ATCGAAGCCT ACCGATCCGC CCCGACGCCG
AACGTCATCG TGCTCGAGAC CGACGCGCGC AGCGACGTGC TGGCAGGTCT CGACCAGCTC
GCGACGGTGT GCGATCCGGG GACGCGGGTG ATCGTGATCG GCAAGGTCAA CGACGTCACG
TTGTACCGCG AGTTGGTCCG TCGCGGCGTC AGTGATTACG CGATCGCGCC GGTGCAGCCG
ATCGACGTGG TCCGCTCGAT CTGCAATCTG TTCTCGGCAC CCGAAGCCAA AGCCGTCGGC
CGCATCATCG CTGTGGTCGG TGCGAAGGGA GGCGTTGGCG CATCCACGGT CGCCCACAAC
GTGGCCTGGG CCATCGCTCG TGACCTGGCG CTGGACTCGG TGGTGGCCGA TCTCGATCTC
GCGTTCGGCA CCGCCGGGCT CGACTACAAT CAGGACCCGC CGCAGGGGAT CGCCGAGGCG
GTGTTTGCCC CCGACCGCGT CGATACCGCC TTCGTCGATC GGCTGCTGTC GAAATGTACC
GACCACCTCA GCCTGCTGGC TGCGCCGGCG ACACTGGACC GCGTCTACGA CTTTGGCGCC
GATGCATTCG ACTCGGTCTT CGACACGCTC CGCACCACGA TGCCGTGCAT TGTTCTCGAC
GTGCCGCATC AATGGACCGG GTGGGCGAAA CGGTCCCTCA TCACCGCCGA TGATATCCTG
ATCGTCGCGA CGCCCGACCT CGCCAACCTG CGCAACACCA AAAACCTGAT CGACCTCTTG
AAGGGGGCCC GGCCGAACGA TCGTCCACCG CTGTACTGTC TCAATCAGGT CGGGGTGCCG
AAGCGGCCCG AAATCTCCAC GAACGAGTTC GCCAAGGCGA TCGAGAGCCA GCCGATCGTC
TCGATTCCGT TCGAGCCTCA GATCTTCGGT GCGGCCGCCA ACAACGGCCA GATGATCGCG
GAAATCGCGG CGAACCATAA GACGACGGAG ATGTTCCTGC AGATCGCCCA GCGGCTGACG
GGACGCGGAG AAGCCAAGAA GCCGAAAGGC TCGTTCCTGG GACCGATTCT GGAGAAGCTG
CGGGCCAAGT AG
 
Protein sequence
MISYSRQNQD EPATAAPDVG PEEHIAPAPR VSVQAFCETV ETAAAVQAAG EDRRLAKAHL 
KIQMGGVVAA IEAYRSAPTP NVIVLETDAR SDVLAGLDQL ATVCDPGTRV IVIGKVNDVT
LYRELVRRGV SDYAIAPVQP IDVVRSICNL FSAPEAKAVG RIIAVVGAKG GVGASTVAHN
VAWAIARDLA LDSVVADLDL AFGTAGLDYN QDPPQGIAEA VFAPDRVDTA FVDRLLSKCT
DHLSLLAAPA TLDRVYDFGA DAFDSVFDTL RTTMPCIVLD VPHQWTGWAK RSLITADDIL
IVATPDLANL RNTKNLIDLL KGARPNDRPP LYCLNQVGVP KRPEISTNEF AKAIESQPIV
SIPFEPQIFG AAANNGQMIA EIAANHKTTE MFLQIAQRLT GRGEAKKPKG SFLGPILEKL
RAK