Gene Rpal_4442 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRpal_4442 
Symbol 
ID6412126 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris TIE-1 
KingdomBacteria 
Replicon accessionNC_011004 
Strand
Start bp4770875 
End bp4771795 
Gene Length921 bp 
Protein Length306 aa 
Translation table11 
GC content65% 
IMG OID642714324 
ProductNucleotidyl transferase 
Protein accessionYP_001993413 
Protein GI192292808 
COG category[J] Translation, ribosomal structure and biogenesis
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1208] Nucleoside-diphosphate-sugar pyrophosphorylase involved in lipopolysaccharide biosynthesis/translation initiation factor 2B, gamma/epsilon subunits (eIF-2Bgamma/eIF-2Bepsilon) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value0.876023 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCGGAG GCGCCAAGGC GCTGCTGGTC GCGGCCGGCC TCGGCACGCG GCTCGCCCCG 
CTCACAGATG TGCTGCCGAA ATGCCTGATG CCGATCGCCG GGCGCCCGCT GCTTGGATTG
TGGCTGCAGA TGCTGAGCGA GGCAGGGTTC TCGGAGATCG TCGTCAATCT GCATCACCAT
GCGGATCTCG TGAGCGAGTA CGTCCGCCGC AGCCCGTGGG CAGAACGGGT GATCCTTGCA
CCCGAAACGA CGCTGCTCGG CACCGCCGGC ACGCTGCTGC GGCATCGCGC GCACTTCGCG
GATGGGCCGA CGCTGTTCGC CCATGCCGAC AATCTCAGCC TGTTCGATCC GCGCGCCTTC
CTCGCGGCCC ATGCGGGGCG GCCGCCCGAT ACGGCGATGA CGATGATGAG TTTCGTCACC
GATCATCCCC AGAGCTGCGG CATCCTCACC CTCGATCCCG CCGGCCGCGT CCTGGAGATG
GACGAGAAGC CGCAGCATCC CAAGGGCAAT CTTGCCAACG CGGCGGTGTA TATCGTCGAG
CCCGAGGTGA TCGACTTCAT CGCCTCGCTC GGCAAACCGG TGGTCGATTT CTCGACCGAA
GTGCTGCCGG TGTTCATGGG GCGGATCTTC TCGTTCCACA ACGGCAGCTA TCACCGCGAC
ATCGGCAATC CGTCGAGCCT GGCGCTGGCG CAGCTCGACT ATCCGCTGGC CGTGCTCGCC
TCTCCGCGTC CTTACGAGGA GGTGCAGCCT GCGAAGCAAT CCAGCCTGGT GCACGGGGCC
CCTGAATTGC TTCGCCTTAG GCTCGCAATG ACGGAGACCA ATAACGACGA TCCCTGGTAT
GGCCTGATGA CCGACAATAA CGGCGCTCTA GCGCGAGCCT TCGCCCAGGC TGCGGCAAAG
ACCTATGGGG CCCAGCGATG A
 
Protein sequence
MSGGAKALLV AAGLGTRLAP LTDVLPKCLM PIAGRPLLGL WLQMLSEAGF SEIVVNLHHH 
ADLVSEYVRR SPWAERVILA PETTLLGTAG TLLRHRAHFA DGPTLFAHAD NLSLFDPRAF
LAAHAGRPPD TAMTMMSFVT DHPQSCGILT LDPAGRVLEM DEKPQHPKGN LANAAVYIVE
PEVIDFIASL GKPVVDFSTE VLPVFMGRIF SFHNGSYHRD IGNPSSLALA QLDYPLAVLA
SPRPYEEVQP AKQSSLVHGA PELLRLRLAM TETNNDDPWY GLMTDNNGAL ARAFAQAAAK
TYGAQR