Gene Rpal_4388 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRpal_4388 
Symbol 
ID6412072 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris TIE-1 
KingdomBacteria 
Replicon accessionNC_011004 
Strand
Start bp4716682 
End bp4717842 
Gene Length1161 bp 
Protein Length386 aa 
Translation table11 
GC content71% 
IMG OID642714270 
Producthypothetical protein 
Protein accessionYP_001993359 
Protein GI192292754 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG5653] Protein involved in cellulose biosynthesis (CelD) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.315553 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGGCCGAGA CGTTCCCCAC CACGCCGTTC GTGCATCGCC GCGGCGGTGC CCTCGCCACG 
ACGACCGCCG ACCCCGGCGT ACTGCCCGGG GTCGCTGCGA TCGACAAGGC AGCTTGGTGC
GATCTGGCGA CCCGCGTGAT CGAGCCGAAC GGCTACTACC TGCCGGAATG GGTGATGGCG
GCGAACGGCG ACGACGCGTC GCCGCGCGCG CTGACCGCGC ACGATTCCGC CAGCCGTCTG
ATCGGACTCC TGCCGGTGAT CTCGTGCTGG CGCGCGTTCC GCCTGCCGCT GCCGGCGCTG
GTATCGGCCG ATCCGTTCCG CTCGCTGGAT ACGCCGCTGC TTGACCGCGA TGCAGCCAAT
GACGCCGCCG CCAAGATCAT CGCGCAGGCA CGCGCCGCAG GCGCCCGCGC CTTGGTACTG
CGCGACGTCG CCCGCGAGGG CGAAGCCGTG GCTGCGTTCA CACGCGTGCT CGACGCCGAA
GGCCTCAACC CGCGCCTGAT CAACGGCTGG ACCCGCGCCG GCCTCGACGC CACCCGCGAC
GGCGAAACCC TGCTGCGCCA GGATCTCGAT ACGAAGAAGC TGAAGAACCT GCGCCGCCTC
GAGCGTCGTC TCGGCGAGCA CGGCGAGGTG CGCTTCACCG TCGCTGATAC CGCGGATGAG
GCAGCGCGCG CGTTCGACGT GTTTCTGGCG CTGGAAGACA GCGGCTGGAA GGGCCGCCGC
GGCAGCTCAC TGAAGCGGCA GCCGGAGCTT GCCGCGCGGC TGCGCAGCGC CGCGGTCGCG
CTCGCCTCGC GCGGCCAATG CGAGGTGATC ACCCTGTCTG CCGGTGTGAC GCCGGTCGCA
GCCGGAATCG TGCTGCGCCA CGCCGACCGC GCTTACTTCT TCAAGCTCGG CATCGACGAG
AGCTTTGCGC GCTGCTCGCC CGGCGTGTTG CTGACAATGG CGCTGACCCG GCATCTGTGC
GCCGACCCGG AGATCCGCTT CGCCGACTCC ACCGCCAGCG CCCAGCATCC AATGATCGAC
CCGCTGTGGC GCGGCCGGTT CGCGGTCGGC GACCTGGTGC TGCCGCTGCG CAAGCGCGAT
CCGCTGTTCG CACCGATCGT CGCGGCTCTG TCGGCGCGCG ACCGGCTGCG GCACCTCGCC
AAGCGGCTGT TGAAGCGCTG A
 
Protein sequence
MAETFPTTPF VHRRGGALAT TTADPGVLPG VAAIDKAAWC DLATRVIEPN GYYLPEWVMA 
ANGDDASPRA LTAHDSASRL IGLLPVISCW RAFRLPLPAL VSADPFRSLD TPLLDRDAAN
DAAAKIIAQA RAAGARALVL RDVAREGEAV AAFTRVLDAE GLNPRLINGW TRAGLDATRD
GETLLRQDLD TKKLKNLRRL ERRLGEHGEV RFTVADTADE AARAFDVFLA LEDSGWKGRR
GSSLKRQPEL AARLRSAAVA LASRGQCEVI TLSAGVTPVA AGIVLRHADR AYFFKLGIDE
SFARCSPGVL LTMALTRHLC ADPEIRFADS TASAQHPMID PLWRGRFAVG DLVLPLRKRD
PLFAPIVAAL SARDRLRHLA KRLLKR