Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rsph17025_4070 |
Symbol | |
ID | 5086243 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodobacter sphaeroides ATCC 17025 |
Kingdom | Bacteria |
Replicon accession | NC_009430 |
Strand | + |
Start bp | 119937 |
End bp | 121010 |
Gene Length | 1074 bp |
Protein Length | 357 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 640485633 |
Product | hypothetical protein |
Protein accession | YP_001170227 |
Protein GI | 146280070 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2342] Predicted extracellular endo alpha-1,4 polygalactosaminidase or related polysaccharide hydrolase |
TIGRFAM ID | [TIGR01370] possible cysteinyl-tRNA synthetase, Methanococcus type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 0.141328 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 27 |
Fosmid unclonability p-value | 0.0763536 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGATGGA GACAGGCCCT GCGTTGCATC GTCCGCACCC CCGCCCTCCT GCTGATCGCG GGACTGCCGC CGGGCCTTGC GGGGGCGCTT CCCCCGGCGC AGGCCGCCCC CTGCGCCGAG GCGCCCCGGG CGGCCGAGGC GCACCGCGAC CGGCTGTCGG CCGCGCTGCG CGACGGGTTC GGCGTCCAGT ACTGGGGGGC CGCCTATGAC GCCGAGGGGC TGTCGGCCGC GCCCCACGGG CTGCTGATCG TCGAGGCGAC CCGCGTGGGC GCCGACCGCA GCGCCGACGG ACGCGAGCAG CTGTTCACCC CGGCCGAGAT CGCCCGGATC AGCCACGAGG GCCGCCGCCC GGTGATCGCC TATCTGAACC TGGCCGAGAT CGAGAGCTAC CGCCATTACC ACGCCCGCAC CCCGCGCGAG GAGCAGCGCT GGCAGGGCCC GACCAGCGCC TCGGGCGAGC GGCTGGCGGC CTACTGGCGA CCCGAATGGC ATGAGGTGCT GCGCGAGCGG GTGGACGAGC TGATGCGGCT GGGCTTCGAC GGACTCTTTC TCGACGATGT GCTGCATTAC TACACCCATG CCGCGGGCGA GACCCAGCCG ACACCGGGCT ACGACGCGAG CGACGCGCCC GGGGATGCGC CGGCCCATGC GCGGGCGATG ATGGCGCTGG TGGTCGATCT GGCCGAGCAT GCGCGGCGCC AGCGCTGCGA CGCGATCGTG GTGGTGAACA ACGGCGCCTT CATCGGGCGC GACGCCGGCC CCGATCCGGC CACGGCGGAA TCCCCCGGCC CCTTCGCGCG CTACCGCAGC GCGATCAGCG CGATCCTGGC CGAAAGCGTG TTCGACACCA ACAACCGCCA GCCCACGATC GACGCCCTGC GCGAGGATTT CCTCGACCGG GGCGTCCAGG TCATGTCGAT CGACTTCAAG ACCCATTTCG TCGGGCCCGG CGGCGAGAGC TACCGCGAGC TGGTGCGGCG GCGCGCCGCG AAGGCGGGCT TTGCGGCCTA TGTCGCCGAT GACGAGGCTT TCAACCGCCT CTACGAGCCG ATCAGGGCCC CGGCAATCCG CTGA
|
Protein sequence | MRWRQALRCI VRTPALLLIA GLPPGLAGAL PPAQAAPCAE APRAAEAHRD RLSAALRDGF GVQYWGAAYD AEGLSAAPHG LLIVEATRVG ADRSADGREQ LFTPAEIARI SHEGRRPVIA YLNLAEIESY RHYHARTPRE EQRWQGPTSA SGERLAAYWR PEWHEVLRER VDELMRLGFD GLFLDDVLHY YTHAAGETQP TPGYDASDAP GDAPAHARAM MALVVDLAEH ARRQRCDAIV VVNNGAFIGR DAGPDPATAE SPGPFARYRS AISAILAESV FDTNNRQPTI DALREDFLDR GVQVMSIDFK THFVGPGGES YRELVRRRAA KAGFAAYVAD DEAFNRLYEP IRAPAIR
|
| |