Gene RPC_1772 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPC_1772 
Symbol 
ID3972211 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB18 
KingdomBacteria 
Replicon accessionNC_007925 
Strand
Start bp1925968 
End bp1927617 
Gene Length1650 bp 
Protein Length549 aa 
Translation table11 
GC content65% 
IMG OID637924885 
Producthypothetical protein 
Protein accessionYP_531650 
Protein GI90423280 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1807] 4-amino-4-deoxy-L-arabinose transferase and related glycosyltransferases of PMT family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.707958 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.0570735 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCAGCCA TAACGACTTC TTCCTTCGAT ACGCCCGTGC GGCGCTCGGC GGAACGAACT 
TGCGACGACC TTTCGATCGC GGCGTTGGCC GTGGTCAGCC TGATCGCAGG GCTGACATTT
CGCGACTACG GGCTGGGCTG GGACGATTAC ACCCACGCCG AATACGCCGA TCTACTGTTG
CGAATGTACG GCTCCGGCTT CCGCGATACC GCCGCGCTAT CGTTTGCCAA TTTATACATG
TATGGCGGCG GCTTTGATAT CGCAGCGGCG CTGCTGCACA AGATCATTCC GCTGGAATTG
TTCGAAACCA GGCGGCTGCT CGGCGCGGTG GTCGGCGTCA TCGGCTTGGC GGTGACCTGG
CGGCTGGCGC GCCGGGTCGG CGGTCCGTTC GCCGGGCTCG CGGCGCTGTT GCTGCTGGCG
CTGTGCCCGA CCTTCTACGG CCACATGTTC ATGAATCCGA AGGATGCGCC GTTCGCCGTG
GCGATGGTGA TCCTGATGCT CGGCCTGGTT CGCCTCGCCG AAGAATATCC CCGGCCCTCG
CCGCACACCG TGCTGATCGT CGGGCTCGGC GCCGGGCTGT CGATCGGCTC GCGGATCCTC
GGCGGTCTGG CGCTGGTCTA TGCGCTGATC GGCTTCGTGC CGCTGCTGAT TGAGGAAGTC
CGCAGCAACG GCCTGCGCGA AGCCGCGCGC CGGTTTGCCC AGATGAGCTA CGCGCTGATC
CCCGGCCTGA TCTTCGGCTA CTTGGTGATG GGGCTGATCT GGCCGTGGTC GATCATGGAG
CCGGCCAATC CGCTGCGCGC CCTGACCTAT TTCTCGCATT TCTTCGAAAA GCCCTGGAAA
GAGATGTTCG ACGGCGCGCT GGTGTCGGTG CCGGACATGC CGTGGTCCTA CCTGCCGACG
CTGTTCGCGC TGCAACTGCC GGAAGTGCTG CTTGGGCTGG CCGCCGGCGG CGTGGTGATG
ACCATCGTGG CGCTGTCGCG CGACACCGTG ACGCCGCGGC GCAAGACCAT CCTGCTGATG
CTGACGGTGG CGGCGACGCT GCCGCTCGTC ATCGCCATGG TGAAACGCCC GGCGCTGTAC
AACGGCATCC GGCATTTCAT CTTCGTGATC CCGCCGATGA CGGTGCTGGC CGGCGTCGCT
TTAGCGCGGT TGATGGATTG GATCGGCGCC CGCCACCTCG GCTCGCAGGC CGCGGCGGTG
GCGATCTTCG CGTTCGGGCT GATGCTGCCG TTGGCCGAAA TGATCCGGCT GCATCCCTAT
CAATACACCC ACTTCAACCA CGTCGCCGGC ACCGTGCGCA GCGCCGAATC GCTGTTCATG
CTGGACTATT GGGGTCTAGC GCTAAAGCAG GCCTCGGACG CGCTGCGCGA CGAAATCGCC
GAGCGCGAGG AAGTGCCGCC GCGCGGGCGC AAGTGGAAGG TCGCGGTGTG CGGCCCGCAG
CGCCCGGCGC AGGTTGCGCT GGGGCCGGAC TTCACCATCG GCTGGGACAG CCACGCCGCC
GACTTCGCCA TGACGCTCGG CGAATTCTAC TGCAAAGGCC TGACCGCGCC GATCATCGCC
GAGATCAAGC GCGACGACGT GGTGTTCGCG CGGGTCTACG ACATCCGCAG ACAGAGCATC
TCCAGCCTGT TGTCGATCCC GGCGCCATAG
 
Protein sequence
MSAITTSSFD TPVRRSAERT CDDLSIAALA VVSLIAGLTF RDYGLGWDDY THAEYADLLL 
RMYGSGFRDT AALSFANLYM YGGGFDIAAA LLHKIIPLEL FETRRLLGAV VGVIGLAVTW
RLARRVGGPF AGLAALLLLA LCPTFYGHMF MNPKDAPFAV AMVILMLGLV RLAEEYPRPS
PHTVLIVGLG AGLSIGSRIL GGLALVYALI GFVPLLIEEV RSNGLREAAR RFAQMSYALI
PGLIFGYLVM GLIWPWSIME PANPLRALTY FSHFFEKPWK EMFDGALVSV PDMPWSYLPT
LFALQLPEVL LGLAAGGVVM TIVALSRDTV TPRRKTILLM LTVAATLPLV IAMVKRPALY
NGIRHFIFVI PPMTVLAGVA LARLMDWIGA RHLGSQAAAV AIFAFGLMLP LAEMIRLHPY
QYTHFNHVAG TVRSAESLFM LDYWGLALKQ ASDALRDEIA EREEVPPRGR KWKVAVCGPQ
RPAQVALGPD FTIGWDSHAA DFAMTLGEFY CKGLTAPIIA EIKRDDVVFA RVYDIRRQSI
SSLLSIPAP