Gene RSP_3641 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRSP_3641 
Symbol 
ID3722130 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides 2.4.1 
KingdomBacteria 
Replicon accessionNC_007494 
Strand
Start bp748893 
End bp750221 
Gene Length1329 bp 
Protein Length442 aa 
Translation table11 
GC content52% 
IMG OID640073317 
ProductpfkB family carbohydrate kinase 
Protein accessionYP_355154 
Protein GI77465651 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG3613] Nucleoside 2-deoxyribosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTTGACA AGCCTGTCCT ACTGCTGGGC GAAATCTGCG TGGATTTCAC GCTTTCTACG 
CCAATGTCAT CCGCCAAGAT GCGCCTTGGC GGCGTAGTTC ACGCAGCAAG GGGTCTGTGG
GCATTAAATG TACCGTACTC GGTAGCCGCT ATTTGCCCAG ATTATCTGGT CTCAGAGGCG
GAGGCGTATC TTTACGCTCA TGGATGTACT AACTTCATTC AGGTAGGCAC TGTGCTTGGC
GCCCCTAACG TCTTCCTAAT TGGAGACGCG AGAGAAGTGG GGCATCAGGG ATACGAGAAC
ATCCTGCGCG AAGCAAAGAA GGTGCTGACG TTCGATCTAG ACGAGAAACT AAGGGGCTTC
AAAAACATAG TAGTTTATCC GGGCACTTTC GATTTCGAGG GTATTGTCAA AAACCTTGAT
AGCGACGCGA AAATCACTGT TGACATTGCA TACGACGTGA ATGATGCAGA TGAACTTTCC
AGAGTTTACG GCGCATCTCA TGCGCTAGCC ATATCAACAT CATCTGAGTT ATTTTCCTCT
CTTGCCCTTG AGAATTTGGG GCCACTTTTG AAGGCCTGCA AGACTCGAAG CGCAAAGCAT
CTTCTGCTAA AGGAGAATAG AGGGGGGAGC AGACTATTCG ACTTAGAATC CGGAGAAAGC
TGGGAAATCC CAGCCACACT GGCTAGAACA GTAAACTCTG TGGGGGTGGG CGATGTTTAT
ACCGCTGTAT TTGCGGCATT AATAGACCAT GGCGCACTTA CGGCCGCCCT TCGCGGGATG
CAAGCTGCTA CACGCTACGC GCAGACGACA TTTCCCGACG ACTTCAAGAG AGACGTTCAA
CGTGACTTCA AGCTAACACC TGATGAAGTC CTGTCGCTGG GCGGTGTACT GCTGCCATGG
CACGACCGAC CTCACTTCGA AATTTATCTG GCCGCACCTG ACTTCTCGTA TATGGAGAAA
GCTGAAGTTG ACGCTGCAGT GGAGGCGCTC CTCTATCATA ATTTTACGGT CAGACGCCCC
GTACAAGACA ATGGAGAAGC ATGCCCTGGA ACCCCGGCGG AAGATCTGCG GTCCTACTAC
GATAGAGATG TCGAACTGTT GAGGCGCTGC AGTTTAGTAT TTGCTGTCCC GCTGAACCGC
GATCCAGGAA CGCTTGTTGA AATTGGCCTT GCCATTGCGG CTGGCACGCC GGTCGTTACC
TACGATCCGC GGCAAGAGAA CAATAATACG ATGGTGATCT GCGGCAGCGA CGCCTACTCA
GATGACTTGG ACCAGTGCTT GAACTCGACG TTCGAACTTC TCGCCAAACT GCGGGAGCAA
GCGAAATGA
 
Protein sequence
MLDKPVLLLG EICVDFTLST PMSSAKMRLG GVVHAARGLW ALNVPYSVAA ICPDYLVSEA 
EAYLYAHGCT NFIQVGTVLG APNVFLIGDA REVGHQGYEN ILREAKKVLT FDLDEKLRGF
KNIVVYPGTF DFEGIVKNLD SDAKITVDIA YDVNDADELS RVYGASHALA ISTSSELFSS
LALENLGPLL KACKTRSAKH LLLKENRGGS RLFDLESGES WEIPATLART VNSVGVGDVY
TAVFAALIDH GALTAALRGM QAATRYAQTT FPDDFKRDVQ RDFKLTPDEV LSLGGVLLPW
HDRPHFEIYL AAPDFSYMEK AEVDAAVEAL LYHNFTVRRP VQDNGEACPG TPAEDLRSYY
DRDVELLRRC SLVFAVPLNR DPGTLVEIGL AIAAGTPVVT YDPRQENNNT MVICGSDAYS
DDLDQCLNST FELLAKLREQ AK