Gene RPB_2640 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_2640 
Symbol 
ID3910432 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp3019636 
End bp3020607 
Gene Length972 bp 
Protein Length323 aa 
Translation table11 
GC content70% 
IMG OID637884539 
Productthiamine-monophosphate kinase 
Protein accessionYP_486253 
Protein GI86749757 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0611] Thiamine monophosphate kinase 
TIGRFAM ID[TIGR01379] thiamine-monophosphate kinase 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.23047 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.216691 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCTCCG GTGAAGACGA TCTGATTGCC CGCTACTTCA AGCCGCTTGC GACCGATCCG 
GGCGCGCTGG GGCTGGTCGA CGACGCGGCG GTGCTGGCGG CGTCGCGCGA CGATCTGGTG
CTGACCACCG ACGCCATCGT CGAGGGCGTG CATTACCTGC CCGGCGATCC GCCCCGGGCC
ATCGCCCGCA AGGCGCTGCG GGTGAACCTG TCCGACCTCG CCGCCAAGGG GGCGACGCCC
GCGGGGTTCC TGCTGACGCT GGCGCTGCGC AGCGCCGACG AGCGCTTCCT GGCGCCGTTC
GCACAGGCGC TCGGCGAGGA TGCCGCCCTT TTCGATTGTC CGCTGCTCGG CGGCGATACG
GTGTCCACCC CGGGGCCGAT GATGATTTCG ATCACCGCCA TCGGCCGGGT GCCGCCGGGT
CGCATGGTGC GGCGCAACAC GCTTTGTGCC GGGGACCGGA TCCTCGTCAC CGGCACGATC
GGCGACTCGG CACTCGGCCT CGACCTGCTG CAGGGCGCGA ATGCCGACAT CTCCGACGAG
CATCGCGCCT TTCTGATCGA CCGCTATCGC GTGCCGCAGC CGCGTTTGGC CTTGGCGCAA
GCCATACGTG ACCATGCCGG CGCGGCGATG GACGTGTCCG ACGGGCTGGC AGGCGATCTC
GCCAAGATGT GCGCCGCCTC CGGCGTCACC GCGATCCTCG ACGCTGCGGC CGTCCCGCTC
TCCGCTGCGG CGCAGGCGAT GATCTCGGGC GAGCCGGCGA AGCTGGCCCG CGTGCTCGGG
GGCGGTGACG ACTATGAGCT GCTTTGCGGC GTTGCAGCAC AGCAGCTCGA CCCGTTTCTT
GCTGCAGCGC AGCGAATAGG GGTTTCGGTC AGCGTCATCG GCTCCGCCGA AGCCGGAACC
GGAGCGCCGC GATGGCGCGA CGCCGAGCAT CGTGACATCG CGCTGTCAGG GCTGTCATAC
AGTCATTTCT AG
 
Protein sequence
MPSGEDDLIA RYFKPLATDP GALGLVDDAA VLAASRDDLV LTTDAIVEGV HYLPGDPPRA 
IARKALRVNL SDLAAKGATP AGFLLTLALR SADERFLAPF AQALGEDAAL FDCPLLGGDT
VSTPGPMMIS ITAIGRVPPG RMVRRNTLCA GDRILVTGTI GDSALGLDLL QGANADISDE
HRAFLIDRYR VPQPRLALAQ AIRDHAGAAM DVSDGLAGDL AKMCAASGVT AILDAAAVPL
SAAAQAMISG EPAKLARVLG GGDDYELLCG VAAQQLDPFL AAAQRIGVSV SVIGSAEAGT
GAPRWRDAEH RDIALSGLSY SHF