Gene Rpal_4440 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRpal_4440 
Symbol 
ID6412124 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris TIE-1 
KingdomBacteria 
Replicon accessionNC_011004 
Strand
Start bp4768246 
End bp4770048 
Gene Length1803 bp 
Protein Length600 aa 
Translation table11 
GC content66% 
IMG OID642714322 
Productthiamine pyrophosphate protein TPP binding domain protein 
Protein accessionYP_001993411 
Protein GI192292806 
COG category[E] Amino acid transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0028] Thiamine pyrophosphate-requiring enzymes [acetolactate synthase, pyruvate dehydrogenase (cytochrome), glyoxylate carboligase, phosphonopyruvate decarboxylase] 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.406956 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAAGCTCT CCGACTACGT CATCGATTTT CTGGCGCGGC GCGGCGTCAC CCATGTGTTC 
GGCATTTCCG GCGGCGCCGC GGTGCACATG TTCGACTCCG CCCAGCGTCA CCCTGACGTC
ACGCCGATCT TTCCCCAGCA CGAGCAGGCT GCCGCGATCG CTGCCGACGG CTACGCGCGC
GCCACCGGCA GGCTCGGCGT CGCCATCACC ACCTCGGGTC CCGGCGCCAC CAATCTGCTG
ACCGGGGTGT GCTGCGCCTA CTACGACTCC GTGCCGACGC TGATGATCAC CGGGCAGGTC
GCGACCCATC GGCTGAAGGG CAAGAACCAG ATCCGTCAGC TCGGGTTTCA GGAGACCGAC
GTCACCTCGA TCTTCGCCAC GGTGACTAAA TACGCGGTAC AGATCTCCGA CCCCACCACG
ATCCGTTATC ATCTGGAGAA GGCGTACCAT CTCGCGTTCG AGGGGCGGCC CGGTTCGGTG
CTGATCGATC TGCCGGACGA TCTGCAACGC GCCGAGATCG ATCCCGACAT GTTGCCGGGC
TTCACGCCGG AGCCAGAGGC CGCATCGAAC GATCTCGACG CCGAGATCGC GGCGCTGCTG
CCGCTGATTG CGCGGGCCGA GCGGCCGGTG CTGGTGCTCG GCGGCGGGCT GTCGACGCCG
CGGATCGGGT CGATGCTGGA TCACCTCGTC GACCGCCTCG GCATGCCGGT GCTGACCACC
TGGGCGGCGA CCGATCTGAT CGCAGCCGAC CATCCGCTGC GGGTCGGGCC GTTCGGCGTC
TATGGGCCGC GGCTCGGTAA CTTCACGGTG CAGAACGCCG ACCTGATCCT GTGCCTCGGC
AGCCGGCTGT CGCAGAACGT CACCGGCGGC ATCCTGCCGT CGTTCGCACG CGAGGCGACG
ATCGTGATGG TCGACGCCAG CCGCGGCGAG ATGGACAAGT TCGACGACCG CGGCATCCGC
ATCGCGACGC GGATTGCAGC GCGGCTCGAC GCCTTCGTGC CGAAGCTGCT CGCCGCGATC
GAGGCGGCGC CGCCGCGCGA GGCTTGGCTG AACACCATCG GGCATTGGCG CAGCGCGTTG
CCGGATGATC GGCCTGGCCC TGCGCCCGAC AATGCCGGCT TCGTCGACGC CTACGACTTC
ATCGACAAGT TGAGCGACGC TGCGCCCGCC GACGAACTGC TCTATGTCGA CACCGGGGGC
AACCTGACCT GGACCTGCAA CGGCTTCCGC ATCAAACGCG GGCAACGGTT GATCTCGGAC
TGGAACAACA CCGCGATGGG CTACGCGCTG GCGGCGGCGA TCGGCGCTGC GGTGCAGGCG
AGGGGCGGTG TCACCTGCAT CGTCGGCGAC GGCGGTTTGA TGCTGTCGCT GGGTGAGTTG
GCGCTGCTCA AGCGGCACGA ACTGCCGATG CGGCTGATGC TGTTCAATAA TCACGGCCAT
GGCATCCAGA AGCAGACGCT GGAGACCTGG CTCGATGGCC ACTATGTCGG CGTCGATGCG
CCGAGCGGCC TGTCGTTCGT CGACTTCCGC AAAGTTGCCG AAGCGATGGA TCTGCCGGTG
GTCACGATCA GCCGCAGTGC CGACATTGCT AGCCAGCTCC GCGACGTTTA TGCGCGCAGA
GGCCCGGTGT TCTGCAACGT CGAAATCAAC CCGGCGCAGA AATTGTACCC GGTGCTGAAG
TTCGGCGCGC CGCTGGAGAG TCAGCTGCCG TCGATCGACG ATGAGCTGAT CAAGCGCGAA
ATGTTGATCG CGCGGTTTGT CCCCGGCTCC GCGCCAAAGC ACAGCGGCGG CGCGGGCGTA
TAG
 
Protein sequence
MKLSDYVIDF LARRGVTHVF GISGGAAVHM FDSAQRHPDV TPIFPQHEQA AAIAADGYAR 
ATGRLGVAIT TSGPGATNLL TGVCCAYYDS VPTLMITGQV ATHRLKGKNQ IRQLGFQETD
VTSIFATVTK YAVQISDPTT IRYHLEKAYH LAFEGRPGSV LIDLPDDLQR AEIDPDMLPG
FTPEPEAASN DLDAEIAALL PLIARAERPV LVLGGGLSTP RIGSMLDHLV DRLGMPVLTT
WAATDLIAAD HPLRVGPFGV YGPRLGNFTV QNADLILCLG SRLSQNVTGG ILPSFAREAT
IVMVDASRGE MDKFDDRGIR IATRIAARLD AFVPKLLAAI EAAPPREAWL NTIGHWRSAL
PDDRPGPAPD NAGFVDAYDF IDKLSDAAPA DELLYVDTGG NLTWTCNGFR IKRGQRLISD
WNNTAMGYAL AAAIGAAVQA RGGVTCIVGD GGLMLSLGEL ALLKRHELPM RLMLFNNHGH
GIQKQTLETW LDGHYVGVDA PSGLSFVDFR KVAEAMDLPV VTISRSADIA SQLRDVYARR
GPVFCNVEIN PAQKLYPVLK FGAPLESQLP SIDDELIKRE MLIARFVPGS APKHSGGAGV