Gene RPB_1603 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_1603 
Symbol 
ID3910074 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp1806268 
End bp1807740 
Gene Length1473 bp 
Protein Length490 aa 
Translation table11 
GC content69% 
IMG OID637883499 
Productbifunctional D-beta-D-heptose 7-phosphate kinase/D-beta-D-heptose 1-phosphate adenosyltransferase 
Protein accessionYP_485224 
Protein GI86748728 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG2870] ADP-heptose synthase, bifunctional sugar kinase/adenylyltransferase 
TIGRFAM ID[TIGR00125] cytidyltransferase-related domain
[TIGR02198] rfaE bifunctional protein, domain I
[TIGR02199] rfaE bifunctional protein, domain II 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACAATT TCGACACCCT GCTGCAATCG ATCGCCCGGA CGACCGTGGT GTGCGTCGGT 
GACCTGATGC TCGACGAGTT CGTCTATGGC GAGGTGTCAC GGATCTCGCC CGAGGCGCCG
GCGCCGGTGA TCGCGGTTCA GCGCAGCGAG ATCAATGTCG GCGGCGCCGG CAACGTCGCG
CGCAACATCG CTGCGCTCGG TGCGCGCTGC ATCTTCGTCG GCCTGATCGG CGACGACGAG
GCGGGCCGAA CGCTGAATGC GGAGCTCGCG AGCGAAGCCC GGATCGAGCC GCTGCTGGTG
TGCGACCCGG CGCGGCCGAC CACGCGCAAG GTGCGCTTCG TCTCCGAGCA TTTCTCCACT
CACATGCTGC GTGCCGATTG GGAGACTGCG GCGGCCGCGT CGGCCGAGAT CGAACAGCGC
CTGCTCGACG CTATCCTGTC GCAACTCGAG CGCGCCGACA TCGTGTTGCT GTCCGACTAC
GCCAAGGGCG TGCTGACGGT CCGCGTGATC GCCACCGTGA TCGAGGCCGC GCGAAAACTC
GGGAAGCGGG TGATCGTCGA CCCCAAGAGC GCCAATTTCG CGATCTATCG CGGCGCGACG
CTGCTGACGC CCAACCGCAA GGAATTCGTC ACCGCGACGC GCTGCGCGGC GGACTCCATG
GACGAGATCG CCACGGCGGC GCAGGAAGCG ATCGCATTCG CCGATTGCGA GGCGATGCTG
GTGACGCAGA GCGAGCACGG TATGACGCTG GTGCCGCGCG CCGGCGAGCC GATCCACGTG
CCGGCGATGC CGGCGAAAGT GCGCGACGTC TCCGGCGCCG GCGACACCGT CGCCGCCGTG
CTGGCGGTGG CGCTGGCGGC TGGGGCCGAT TGGGGCACCG CGATGCGGGC GGCGAGCGCT
GCGGCTGCCG TTGCTGTCAG CAAGAACGGC ACCGCGGTGG TGACGCCGGC GGAGCTGCGA
CGCAAGATCC TGCCGCACGC CTCGCTCGCG GCCGAAGACA AGATCATCGG CAGCGACGCC
GAACTCGATC TTCGCCTCGC CGAGTGGCGC CGCGACGGGC TGCGGGTCGG CTTCACCAAC
GGCTGCTTCG ACATTCTGCA TCCGGGCCAC GTCAAGGTGC TGACGGCGGC GCGCGGCGCC
TGCGACCGGC TGATCGTCGG CCTCAACAGC GACGCCTCGG TGCGCCGGCT CAAGGGCGAG
AGCCGTCCGG TGCAGAACGA GCGCGCCCGT GCCGAAGTGC TGGCGGCGCT CGAGGCGGTC
GATCTGGTGG CGATCTTCGA GGAGGACACG CCGCTGAAGC TGATCACCCG AATCGAACCG
AGCGTGCTGG TCAAGGGCGG CGACTACACC CGTGAGCAGG TGGTCGGCCA CGAGATCGTC
GCGGCCAGGG GCGGCGAGGT GCTGCTGATC GACGTGCTGC CGGGTTTCAG CACGACCTCG
CTGGTCGAGA AGGCACGCGA GGGGACGTCG TGA
 
Protein sequence
MNNFDTLLQS IARTTVVCVG DLMLDEFVYG EVSRISPEAP APVIAVQRSE INVGGAGNVA 
RNIAALGARC IFVGLIGDDE AGRTLNAELA SEARIEPLLV CDPARPTTRK VRFVSEHFST
HMLRADWETA AAASAEIEQR LLDAILSQLE RADIVLLSDY AKGVLTVRVI ATVIEAARKL
GKRVIVDPKS ANFAIYRGAT LLTPNRKEFV TATRCAADSM DEIATAAQEA IAFADCEAML
VTQSEHGMTL VPRAGEPIHV PAMPAKVRDV SGAGDTVAAV LAVALAAGAD WGTAMRAASA
AAAVAVSKNG TAVVTPAELR RKILPHASLA AEDKIIGSDA ELDLRLAEWR RDGLRVGFTN
GCFDILHPGH VKVLTAARGA CDRLIVGLNS DASVRRLKGE SRPVQNERAR AEVLAALEAV
DLVAIFEEDT PLKLITRIEP SVLVKGGDYT REQVVGHEIV AARGGEVLLI DVLPGFSTTS
LVEKAREGTS