Gene Rsph17025_3177 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17025_3177 
Symbol 
ID5085662 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17025 
KingdomBacteria 
Replicon accessionNC_009429 
Strand
Start bp38613 
End bp39794 
Gene Length1182 bp 
Protein Length393 aa 
Translation table11 
GC content67% 
IMG OID640484749 
Producthypothetical protein 
Protein accessionYP_001169366 
Protein GI146279208 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0006] Xaa-Pro aminopeptidase 
TIGRFAM ID[TIGR02993] ectoine utilization protein EutD 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.266675 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACAGAAG TCGAGCTGCG GTTCAGCCGC GACGAGTTCG CCCAGCGTCT GGAAAAGACG 
CGACGGGCGA TGGAGGCGAA GGGGGTGGAC CTTCTGATCG TCACCGACCC CAGCAACATG
AACTGGCTGA CCGGCTATGA CGGCTGGTCC TTCTATGTCC ACCAGTGCGT GATCGTGCCG
CCCGACGGCG AGCCGATCTG GTATGGCCGC GGGCAGGACG CGAACGGGGC GAAGCTGACC
GCCTGCCTCG CGCACGGGAA CATCATCGGC TATCCCGACC ACTATGTGCA ATCGACCGAG
CGGCACCCGA TGGACTACCT GTCGGCGCTG ATGACCGACC GCGGCTGGGG CAGCCTGCGG
ATCGGCGTCG AGATGGACAA CTACTATTTC TCGGCCGCCG CCTTCGCCAG CCTCACGCGC
CACCTGCCGA ACGCCCGCTT CATCGACTGC ACCGCGCTGG TGAACTGGCA GCGTGCGGTG
AAGTCGCCGC AGGAGATCGC CTACATGCGC CGCGCCGCCC GCATCGTCGA GGCGATGCAT
GCGCGCATCC TCGACAAGGT CGCGGTGGGG ATGCGCAAGT GCGACCTCGT GGCCGAGATT
TACGACGCGG GCATCCGCGG CGCCGACGGC TTTGGCGGCG ACTATCCCGC GATCGTGCCG
CTTCTGCCCT CGGGGCGCGA GGCCAGCGCG CCGCACCTGA CCTGGGACGA CCGGCCGATG
AAGGCGGGCG AGGGCACCTT CTTCGAGATC GCCGGCTGTT ATCACCGCTA TCATGTGCCG
CTGTCGCGGA CCGTCTTCCT CGGCCAGCCC ACGCAAGCGT TCCTGGATGC CGAGAAGGCG
ACGCTGGAAG GGATGGAGGC GGGCCTTGCC GCCGCGCGTC CGGGCGCCAC CTGCGAGGAT
ATCGCCCGCG GCTTCTTCGA CGTGCTGGCG AAATACGGCA TCCTCAAGGA CAATCGCACC
GGCTATCCGA TCGGCGTGAG CTATCCGCCC GACTGGGGCG AGCGCACCAT GAGCCTGCGC
CCCGGAGACC GGACCGAGCT TCGCCCCGGC ATGACCTTCC ATTTCATGAC CGGCCTCTGG
CTCGAGGACA TGGGCCTCGA GATCACCGAG TCGATCCTGA TCACCGAGAC GGGGGTGGAG
TGCCTTGCCA ATGTCCCGCG CCAGCTGTTC GTGAAGGACT GA
 
Protein sequence
MTEVELRFSR DEFAQRLEKT RRAMEAKGVD LLIVTDPSNM NWLTGYDGWS FYVHQCVIVP 
PDGEPIWYGR GQDANGAKLT ACLAHGNIIG YPDHYVQSTE RHPMDYLSAL MTDRGWGSLR
IGVEMDNYYF SAAAFASLTR HLPNARFIDC TALVNWQRAV KSPQEIAYMR RAARIVEAMH
ARILDKVAVG MRKCDLVAEI YDAGIRGADG FGGDYPAIVP LLPSGREASA PHLTWDDRPM
KAGEGTFFEI AGCYHRYHVP LSRTVFLGQP TQAFLDAEKA TLEGMEAGLA AARPGATCED
IARGFFDVLA KYGILKDNRT GYPIGVSYPP DWGERTMSLR PGDRTELRPG MTFHFMTGLW
LEDMGLEITE SILITETGVE CLANVPRQLF VKD