Gene Rsph17025_4070 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17025_4070 
Symbol 
ID5086243 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17025 
KingdomBacteria 
Replicon accessionNC_009430 
Strand
Start bp119937 
End bp121010 
Gene Length1074 bp 
Protein Length357 aa 
Translation table11 
GC content73% 
IMG OID640485633 
Producthypothetical protein 
Protein accessionYP_001170227 
Protein GI146280070 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2342] Predicted extracellular endo alpha-1,4 polygalactosaminidase or related polysaccharide hydrolase 
TIGRFAM ID[TIGR01370] possible cysteinyl-tRNA synthetase, Methanococcus type 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.141328 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value0.0763536 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGATGGA GACAGGCCCT GCGTTGCATC GTCCGCACCC CCGCCCTCCT GCTGATCGCG 
GGACTGCCGC CGGGCCTTGC GGGGGCGCTT CCCCCGGCGC AGGCCGCCCC CTGCGCCGAG
GCGCCCCGGG CGGCCGAGGC GCACCGCGAC CGGCTGTCGG CCGCGCTGCG CGACGGGTTC
GGCGTCCAGT ACTGGGGGGC CGCCTATGAC GCCGAGGGGC TGTCGGCCGC GCCCCACGGG
CTGCTGATCG TCGAGGCGAC CCGCGTGGGC GCCGACCGCA GCGCCGACGG ACGCGAGCAG
CTGTTCACCC CGGCCGAGAT CGCCCGGATC AGCCACGAGG GCCGCCGCCC GGTGATCGCC
TATCTGAACC TGGCCGAGAT CGAGAGCTAC CGCCATTACC ACGCCCGCAC CCCGCGCGAG
GAGCAGCGCT GGCAGGGCCC GACCAGCGCC TCGGGCGAGC GGCTGGCGGC CTACTGGCGA
CCCGAATGGC ATGAGGTGCT GCGCGAGCGG GTGGACGAGC TGATGCGGCT GGGCTTCGAC
GGACTCTTTC TCGACGATGT GCTGCATTAC TACACCCATG CCGCGGGCGA GACCCAGCCG
ACACCGGGCT ACGACGCGAG CGACGCGCCC GGGGATGCGC CGGCCCATGC GCGGGCGATG
ATGGCGCTGG TGGTCGATCT GGCCGAGCAT GCGCGGCGCC AGCGCTGCGA CGCGATCGTG
GTGGTGAACA ACGGCGCCTT CATCGGGCGC GACGCCGGCC CCGATCCGGC CACGGCGGAA
TCCCCCGGCC CCTTCGCGCG CTACCGCAGC GCGATCAGCG CGATCCTGGC CGAAAGCGTG
TTCGACACCA ACAACCGCCA GCCCACGATC GACGCCCTGC GCGAGGATTT CCTCGACCGG
GGCGTCCAGG TCATGTCGAT CGACTTCAAG ACCCATTTCG TCGGGCCCGG CGGCGAGAGC
TACCGCGAGC TGGTGCGGCG GCGCGCCGCG AAGGCGGGCT TTGCGGCCTA TGTCGCCGAT
GACGAGGCTT TCAACCGCCT CTACGAGCCG ATCAGGGCCC CGGCAATCCG CTGA
 
Protein sequence
MRWRQALRCI VRTPALLLIA GLPPGLAGAL PPAQAAPCAE APRAAEAHRD RLSAALRDGF 
GVQYWGAAYD AEGLSAAPHG LLIVEATRVG ADRSADGREQ LFTPAEIARI SHEGRRPVIA
YLNLAEIESY RHYHARTPRE EQRWQGPTSA SGERLAAYWR PEWHEVLRER VDELMRLGFD
GLFLDDVLHY YTHAAGETQP TPGYDASDAP GDAPAHARAM MALVVDLAEH ARRQRCDAIV
VVNNGAFIGR DAGPDPATAE SPGPFARYRS AISAILAESV FDTNNRQPTI DALREDFLDR
GVQVMSIDFK THFVGPGGES YRELVRRRAA KAGFAAYVAD DEAFNRLYEP IRAPAIR