Gene RPC_3742 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPC_3742 
Symbol 
ID3970337 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB18 
KingdomBacteria 
Replicon accessionNC_007925 
Strand
Start bp4166500 
End bp4168041 
Gene Length1542 bp 
Protein Length513 aa 
Translation table11 
GC content67% 
IMG OID637926852 
Productglycosyl transferase family protein 
Protein accessionYP_533596 
Protein GI90425226 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1807] 4-amino-4-deoxy-L-arabinose transferase and related glycosyltransferases of PMT family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00121597 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.955907 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCAGCGGC TGACGCAGCC GCAGAGCGAG GATCGGGGCG CCGTGCGGCG CGCGGTTTTG 
ATCGTCCTGG TCCTGGTCGG GCTGCGTCTG GTCGTGGCCG CGATCACGCC GTTGACCTTC
GACGAAGCCT ATTACTGGAC CTGGTCGAAG AACCTGGCCG GCGGCTATTA CGATCATCCG
CCGATGGTCG CGCTGATGAT CCGGCTCAGC ACGCTGATCG CCGGCGACAG TGAATTCGGC
GTCCGCTGGC TCTCGGTGCT GCTGGCATTG CCGATGAGCT GGGCGGTGTA TCGCAGCGGC
GCCATCCTGT TCGGCTCGGC GCGCGTCGGC GCCACCGCGG CGATCCTGTT CAACACCACG
ATGATGGCCT GGCTCGGCAC CATCATGGCG ACGCCGGACG TGCCCTTGAT GCTGGCGTCG
AGCCTGCTGC TGTGGTCGCT CGCCAAACTG CTGCAGAGCG GGCGCGGCGT ATGGTGGCTC
GCGGCAGGTG CCGCGGTTGG CGCGGCGCTG CTGTCGAAAT ACAACGCGTT GTTCTTCGGT
CCCACGCTGC TGATCTGGCT GATCGTGGTT GCGGATCTGC GGCGCTGGTT GCGCTCGCCG
TGGCCTTATC TGGGCGGGCT GGTGGCGCTG GCACTGTTCT CTCCGACGCT ATTGTGGAAC
GCACAGCACG AATGGGCGTC GTTCCTCAAG CAGTTCGGCC GCGTCGGCGC CGCTGATTTT
CGCCCCGGCT TCCTGCTCGG CATGCTGGGC GGCCAGTTCC TGGTGATGAC GCCGGCGGTG
GCGATCCTTG GCTGCAGCGG GCTGGTTGCG ATGGCGCGCG GCGCCACCGG ACTACGCGGC
GCCGCCGCGC TGCTGCACAT CACGATCTGG GTGGTGGTGG CCTATTTCCT GGTGCACGCG
TTGCACGAGG AGGTGCATCC CGACTGGCTG TGTCAGATCT ATCCGGCGAT GGCGATTGCC
GGGGCGGTCG CGCTGGAGCG GATGACGTGG CGGTCGCGCT GGCAACGCGT TGTGAATTTC
CTCGGCCGCT GGGCGGTGCC GGGCAGCGCG GCGATGGTGG CGCTGATCGT GCTGCAGCTG
CACACCGGCG TGCTCAGCGG CTATCGCAAC GAAGAGGGCG TGCGGCTGGT CGGCGTCGGT
TTTCGCGTTG CGGCGCACCA GATCGAGGCG ATCCGCGTTC GGCTCGGCGC CAGTTGCATC
CTGGCGGCAG ACTACGGCAC TACGAGTTGG CTGATGTTCT ATCTGCCGCC CGGCAGCTGC
GTGGCGCAGC ATTTCGAGCG GATCCGCTGG GCCAATGCCA AGGAGCCCGA TGCCGCGCTG
CTGAACGGCA AGTTGCTGTT CGTCGGGCGC TCTTCCTATC AGCATTGGCT TCACCCATGG
CTGCAGGAAG CATTCGCGAG CGTTGACAGC GTGGCGGAGG TCTCGCGCAT GCGCGGCGCG
ACGGTGATTG AAACCTACCG CATCGACCTG CTGGAAGGCG CCAAAGGCGA TATACTCCTC
CGGTGGCCGC CGCCGGAGTT GATCCGACGG CGCGGTCTCT GA
 
Protein sequence
MQRLTQPQSE DRGAVRRAVL IVLVLVGLRL VVAAITPLTF DEAYYWTWSK NLAGGYYDHP 
PMVALMIRLS TLIAGDSEFG VRWLSVLLAL PMSWAVYRSG AILFGSARVG ATAAILFNTT
MMAWLGTIMA TPDVPLMLAS SLLLWSLAKL LQSGRGVWWL AAGAAVGAAL LSKYNALFFG
PTLLIWLIVV ADLRRWLRSP WPYLGGLVAL ALFSPTLLWN AQHEWASFLK QFGRVGAADF
RPGFLLGMLG GQFLVMTPAV AILGCSGLVA MARGATGLRG AAALLHITIW VVVAYFLVHA
LHEEVHPDWL CQIYPAMAIA GAVALERMTW RSRWQRVVNF LGRWAVPGSA AMVALIVLQL
HTGVLSGYRN EEGVRLVGVG FRVAAHQIEA IRVRLGASCI LAADYGTTSW LMFYLPPGSC
VAQHFERIRW ANAKEPDAAL LNGKLLFVGR SSYQHWLHPW LQEAFASVDS VAEVSRMRGA
TVIETYRIDL LEGAKGDILL RWPPPELIRR RGL