Gene Rpal_3768 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRpal_3768 
Symbol 
ID6411446 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris TIE-1 
KingdomBacteria 
Replicon accessionNC_011004 
Strand
Start bp4047213 
End bp4048313 
Gene Length1101 bp 
Protein Length366 aa 
Translation table11 
GC content62% 
IMG OID642713649 
Productglycosyl transferase group 1 
Protein accessionYP_001992742 
Protein GI192292137 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAACCCAT TGCGCTTTCG TCGACTGACG ATCAACGGGA AGTTCCTGAC AGCGCGACCT 
ACCGGCGTGC ACAGGGTCGC CGATCAACTT ATTCGCCAGA TCGTTTTGAA CCAGGGCCTC
CTGGACGGCG TATTTGCGAC TTCGCCTGCG ATCGTTGCCC CAAGGTCCGC ACCGGACGGG
ACGCAAGGCG TTCACGTCGA GCGCTATGGC CGCCTGCGCG GGCAGTTGTG GGAGCAGATG
GACCTGCCCC GCGCCGCGCG CTCCGATCTG CTGCTCAACC TGTGCAATCT TGGACCTGTG
GCACTCGGCA GCGCGATCAC GATGATCCAT GATGCTCAAG TCTTCATCAC GCCGCAGTCC
TATTCGTTCG CTTTTCGCAC GTGGTATAAG ACCATTCTGC CGCTGATCGG GCAGAGACAT
CGCCGTATTC TGACGGTGTC CCATTTCTCC GCCGAGCAAC TGACGCGGGC CGGCGTCGCC
GATGCCGAGC GCATCTCAGT GATTCATAAC GGCGTCGATC ACGTCCTTGC GTATCCACGA
GCGCCCGAGA TCATCGAGCG TCTTTCGCTT GCGCGGCGGC GCTTTGTTGT CGCGCTTTCT
TCCACTCAGG CGCACAAAAA TATCAAGGTC CTGCTGGATG CCTTCTCCAG TCCGGAGCTT
GGCGACACCA AACTCGTCCT GTTTGGCGGA CATGATCGCG GCGACTTTGA ACGCCTGTCC
TCCAACGTGC CGGCCAATGT CGTGTTTGCA GGGCCGGTGA CCGACGGGGA GTTGCGGTCC
CTGTTTGAAG CGGCGCTGTG CGTGGCATTT CCATCCACCA CGGAGGGGTT CGGCCTTCCC
CCGTTGGAAG GAATGGCTTT GGGGTGTCCG GCCATCGTCG CGCCATGCGG TGCACTTCCC
GAGGTCGCCG GGCAAGGTGC GCTCTACGCG GCGGCAGACA ACCCCAGGGA GTGGATCGAA
GCGATCAGGT CCCTCGCGGC CTCGCCGCCA TTCTGGCTGG AGCGCTCCGC CGTGGGGGTA
GCGCAGGCAG CCAATTTCAC TTGGCGGAAA GCCGGTACGG ATCTCTGCAA TGTCATTCGA
CTCGTCGCCG AAGACCGATA G
 
Protein sequence
MNPLRFRRLT INGKFLTARP TGVHRVADQL IRQIVLNQGL LDGVFATSPA IVAPRSAPDG 
TQGVHVERYG RLRGQLWEQM DLPRAARSDL LLNLCNLGPV ALGSAITMIH DAQVFITPQS
YSFAFRTWYK TILPLIGQRH RRILTVSHFS AEQLTRAGVA DAERISVIHN GVDHVLAYPR
APEIIERLSL ARRRFVVALS STQAHKNIKV LLDAFSSPEL GDTKLVLFGG HDRGDFERLS
SNVPANVVFA GPVTDGELRS LFEAALCVAF PSTTEGFGLP PLEGMALGCP AIVAPCGALP
EVAGQGALYA AADNPREWIE AIRSLAASPP FWLERSAVGV AQAANFTWRK AGTDLCNVIR
LVAEDR