Gene RPB_3657 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_3657 
Symbol 
ID3911459 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp4197974 
End bp4199464 
Gene Length1491 bp 
Protein Length496 aa 
Translation table11 
GC content68% 
IMG OID637885559 
Productglycosyl transferase family protein 
Protein accessionYP_487263 
Protein GI86750767 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1807] 4-amino-4-deoxy-L-arabinose transferase and related glycosyltransferases of PMT family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.658798 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGGCGG CCAAGCGCTA CGTCGGTGGT ACGGCTCTGG CGATCGCCGC GATGGTGGCG 
CTGCGGCTGG TCGCTGCGGC GGTCACGCCG CTGACCTTCG ACGAAGCCTA TTACTGGACC
TGGTCGAAGC ATCTCGCCGC CTCCTATTTC GATCACCCGC CGATGGTGGC GTGGCTGATC
AGGCTCGGTA CCCTGATTGC CGGCGACACC GAATTCGGCG TGCGGCTGAT CTCGGTGCTG
CTGGCGCTGC CGATGAGCTG GGCGACCTGG CGTTCCGCGG AATTGCTGTT CGGCGGCCAG
CGTCTGGCCG CGCATGCGAC ACTGCTGCTC AACGCGACGA TGATGGTCTC GGTCGGCACC
GTGATCGTGA CGCCGGATGC GCCGCTGCTC GTGGCGTCGA GCTTCGCGCT CTACGCGCTC
GCGCAGGTGC TGTCGTCGGG CAAAGGCGTG TGGTGGCTCG CGGTCGGCGT CGCGGTCGGC
GCGGGGCTGC TGTCGAAATA CACCGCGCTG TTCTTCGGCC CGGCGATCCT GATCTGGCTG
CTGTGGGTGC CCAAACAGCG TCGCTGGCTG CTGACGCCAT GGCCCTATCT GGGCGGGCTG
ATTGCGTTCG CGATGTTCAC GCCTGTGGTG CTGTGGAACG CCGAGCATCA GTGGATCTCG
TTTGCCAAGC AGCTCGGCCG CGCCAGGGTC GACGGTTTTC ATCCCGGCTA TCTGCTCGAA
CTTGTCCCGA CCCAGTTCGT GCTCGCGACC CCGCTGGTCT ACATCCTCGG GTTGATGGGT
TTGTACGCGC TGGCGCGTGG CGCCGGCGCG TCGGGCGCGC GCGTGCTGAT CAATGCGATC
GTCTGGACCA TCGCGCTGTA TTTCGCCTGG CAGGCGACCC ATGACCGCGT CGAGGGCAAT
TGGCTCGGCG CGCTGTATCC CGCCTTTGCG GTCGCCGCCG CGGTCGCCGC CGCTTTCGTG
CCATGGGGAC CGAGGGCGCA ACGTGTGGTC GATGTCTGCC GGCGTTGGGC CGCGCCGGTC
GGCGTGGTGA TGTTCGTGCT GGTGGTGGTC CAGGCCAACA CCGGGGTGCT GACCGGCTAT
CGACGCGACG CCAGCGTGCG TGCGGTCGGC GTCGGCTATC CCGAGATCGC CGCCGAGATC
GCGGCGGTGC GCGAGGCGAC GGGGGCGACC TGCGTGCTCG CCGACGATTA CGGCAACACG
GGGTGGTTGG CGTTCTATCT GCCGAAGGGC ACCTGCGTGG CGCAGCGCAA CGAGCGCTAT
CGCTGGCTTG CGGCGCCGCC GCCGAGCCCG GAGCAGCTCG CCGGCAAGCT GCTGCTGGTC
GGTGAGACCA ATGCCGCTGC GCACCCGGCG CTGCGGGCGA CGTTCAGCCG GATCGAGAAG
GTCGGCGCGG TCGAGCGCAA GCGCGGACCG CTGTTGATCG ACACCCTCGA ACTCGACATC
CTCGACGGTG CCAAGGGTCC GGTGCTGGAC AATTCGCCGC CCGTCTATTG A
 
Protein sequence
MTAAKRYVGG TALAIAAMVA LRLVAAAVTP LTFDEAYYWT WSKHLAASYF DHPPMVAWLI 
RLGTLIAGDT EFGVRLISVL LALPMSWATW RSAELLFGGQ RLAAHATLLL NATMMVSVGT
VIVTPDAPLL VASSFALYAL AQVLSSGKGV WWLAVGVAVG AGLLSKYTAL FFGPAILIWL
LWVPKQRRWL LTPWPYLGGL IAFAMFTPVV LWNAEHQWIS FAKQLGRARV DGFHPGYLLE
LVPTQFVLAT PLVYILGLMG LYALARGAGA SGARVLINAI VWTIALYFAW QATHDRVEGN
WLGALYPAFA VAAAVAAAFV PWGPRAQRVV DVCRRWAAPV GVVMFVLVVV QANTGVLTGY
RRDASVRAVG VGYPEIAAEI AAVREATGAT CVLADDYGNT GWLAFYLPKG TCVAQRNERY
RWLAAPPPSP EQLAGKLLLV GETNAAAHPA LRATFSRIEK VGAVERKRGP LLIDTLELDI
LDGAKGPVLD NSPPVY