Gene RPB_2397 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_2397 
Symbol 
ID3909531 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp2751683 
End bp2753608 
Gene Length1926 bp 
Protein Length641 aa 
Translation table11 
GC content71% 
IMG OID637884296 
Productglycosyltransferase 
Protein accessionYP_486013 
Protein GI86749517 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1807] 4-amino-4-deoxy-L-arabinose transferase and related glycosyltransferases of PMT family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.628145 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAGCGT CGCCGATCTG CAGAGGACGA CGCACCATCA AGCGCAAGCA GCCCAGGAAC 
ACGTCGAAAT ACCGTCGGCC GAACAAGAAG CGCCAGGGCG GCAAGAGCGG CGCGCGTCCG
GCTGCGGCCG CGAATTCGGC CGGCTCCGAT CTGGCTGCCG AGACCGCCAC GACCCCGGTC
ACTGCGCCCG CTCCGGTCGC TCCCCCGACG CCCGCCGTCG AGACGAAGGG CGCGCCGCCC
CGCCCGGCTC CCCACGTTTC AAAGCGGCCC ACAGCGCCGC CGAAGCCATC CAGTTCGAAG
ACATCGCCCG GTCAGCCGAC TGCACCACCG TCACCTGACG CCGTTGCGGT CACGATGCCC
GGGGGTGCGG CCCCCGGTCT GCTGCTGCGC GCCTGCGACG TGGTGCTGCA GATCCTGCGC
GGCGAGCCGC GGCGGCTGTT GCTGTGGATT CTCGGCATCT ATGGCGGGCT GTGGTTCGTC
ACAGCCTTCA GCTTCCCGAG CCTGCCGGCG ATCAGCTACG AGATGGCGCT GTTCGGTAAG
GAACTGCAGG CGGGCACTTG GAAGTATCCG CCGCTGGCGC CCTGGCTGAC CGAGATCGCC
TCGCTGCTGA CCGGCGGGTG GAGCGGATCG CAGCTCCTGC TCTCGATCGG CTCAGCGCTG
GCGACGCTGG TGCTGCTGTG GCGGCTCGGC GCTGGCATCG TCGGCGCAGC CGGTGCGACG
CTGGCGGTCG CGCTGACGAT CCTGATCGGC TGCTTCGGCC CGCAGGTCAC CGGCTATGAT
CCCGCCATCG CCGGCCTGCC GCTGACGGTC GCGGCGGTGC TGCTGTATCG GCAGGCGGTG
CTCGGGCAAG CGCGGTCGAG CTGGATCGGC CTCGGCGTCG TCTGCGCGCT GCTGGGCAAT
GCCAATCACG CCGGCTTCGC CCTGATCCTG GTGCTGCTCG GCCATCTGCT GCTGACGCGC
GAGGGTCGCC GGCAGTTGGG GACGCTGGGC CCGCCGATCG CCGCGGTGGT GTGCTTTGTC
GTGTTGCTGC CGCATCTGAT CTGGCTCGGT CAGGCGAATG CATCGGCGTC CGCCGCAGCG
GGCGCGTCGG CCGATTTGCT GCCACGGATC GGCGCGGCCT TCGCCTTCGT GTTCGGCCAG
GTCGGGCTGC ACGCTGGGCT GATCCTGATC GCGGTGCTGG CGGTGCTGCC GCGACTGCCG
CTGCAGGGCG CCCCCGCAAC GATCGAGCTC GACACGCCGA GCAGCTTCGA TCGCTCGCTG
ATCCTCGCCG CCGCCTTCGT GCCGTCGATG CTGGTTGCGG TCGGCAGCGT GCCGGACTGG
TTCACGATCG GCGCCTACAC CGGCAGTGCG CTGGTGCCGC TATCGGGGCT GGCGCTGCTC
CTGCTGCTGC CGCGGCGGCT GGTGCTACGC GCCCCGCGCC TCGCGGTGGT GGCGTGGCTG
CTGGTGCTGG TGGGCGTTCC GATCGCCACC ACGGCATCGA TCTACGCCAG AGCCTATGGT
GACGGCCCGC CGCCGACCGA GCTGTATCCG GCTCGCGCGC TGTCGCAGGC GATGCAGGCG
GCGTGGAGAA GCCGGACCAC CCGGCCGCTC GATAGCGTCA CCGGGAGTGC CCGCCAAGCC
GGCTTCGTCG CATTCGACGC CTCGCCGCGA CCATCGGTGT TCATCGATGC CGACTTCGCC
AAGAGCCCGT GGATCACGCC GCAACGGCTG AAGCAATCCG GCACGCTGGT GGTGTGGTCG
ACCGACGAAT TCGCCCGCAC CGACGAAATC CCGGCGCCCT ATCGCGGCAC GCTCGGCAGC
AGCACGCCGG TGTTCGGCAC CATGGTGCTG CCGCTCGGCC GCGGCAAACT GAAAGCCTAT
GGCTGGGCGA TGATCGCGCC GGAAGGCGAT CCACCGCAGG CGCCGGCACC GGCGCCGGCG
AAGTAA
 
Protein sequence
MAASPICRGR RTIKRKQPRN TSKYRRPNKK RQGGKSGARP AAAANSAGSD LAAETATTPV 
TAPAPVAPPT PAVETKGAPP RPAPHVSKRP TAPPKPSSSK TSPGQPTAPP SPDAVAVTMP
GGAAPGLLLR ACDVVLQILR GEPRRLLLWI LGIYGGLWFV TAFSFPSLPA ISYEMALFGK
ELQAGTWKYP PLAPWLTEIA SLLTGGWSGS QLLLSIGSAL ATLVLLWRLG AGIVGAAGAT
LAVALTILIG CFGPQVTGYD PAIAGLPLTV AAVLLYRQAV LGQARSSWIG LGVVCALLGN
ANHAGFALIL VLLGHLLLTR EGRRQLGTLG PPIAAVVCFV VLLPHLIWLG QANASASAAA
GASADLLPRI GAAFAFVFGQ VGLHAGLILI AVLAVLPRLP LQGAPATIEL DTPSSFDRSL
ILAAAFVPSM LVAVGSVPDW FTIGAYTGSA LVPLSGLALL LLLPRRLVLR APRLAVVAWL
LVLVGVPIAT TASIYARAYG DGPPPTELYP ARALSQAMQA AWRSRTTRPL DSVTGSARQA
GFVAFDASPR PSVFIDADFA KSPWITPQRL KQSGTLVVWS TDEFARTDEI PAPYRGTLGS
STPVFGTMVL PLGRGKLKAY GWAMIAPEGD PPQAPAPAPA K