Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RoseRS_3883 |
Symbol | |
ID | 5210865 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus sp. RS-1 |
Kingdom | Bacteria |
Replicon accession | NC_009523 |
Strand | + |
Start bp | 4862728 |
End bp | 4863756 |
Gene Length | 1029 bp |
Protein Length | 342 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 640597478 |
Product | membrane dipeptidase |
Protein accession | YP_001278186 |
Protein GI | 148657981 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG2355] Zn-dependent dipeptidase, microsomal dipeptidase homolog |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAACCTTC ACATTCCGAT CTTCGACGGT CATAACGATA CCCTTCTGCG TCTCTTCGCA TCGAAACACA ATGATTCGTT CTTTGAGTCG TCCCAGGGGC ACATCGACCT GGCGCGCGCC CGCGCTGGCG GTTTTGCTGG CGGCTTCTTC GCGGTCTTTG TTCCTCCTGC GCCAGGTGAA CAGCCAGCGA ACGACGACGA CCTTCCTGAG CGATTACCGT TCGCCTATGC GCTCGAAACA GCGCTGGCAA TGACCGCCCT GCTCTTCCGG ATCGAAGCGC AATCCGCTGG TCAGGTGCGG GTCGCACGCA CGGTTGATGA CATCGAGCAC GGCATCCGAA CCAATACGCT GAGCGCGATC CTGCATTTCG AGGGCGCCGA CGCGATCGAT CCTGAGTTTC ACACACTCGA AGTGCTCTAC CGGGCTGGTC TCCGCTCCCT GGGGATCGTC TGGAGTCGCC CGAACGCATT CGGATGGGGC GTACCGTTTC GTTTCCCGCA CGATCCCGAT ATTGGTCCTG GTCTGACCGA AGCCGGACAC GAACTGGTGC GAATATGCAA CCGCCTCGGC ATCATGATCG ATCTGTCGCA TCTGAACGAA GCCGGCTTCT GGGATGTGGC GCGCCTGAGC AGCGCACCGC TGGTCGCAAC CCACTCGAAC GCATATGCCC TCTGTCCCTC GCCGCGCAAC CTGACCGACC GCCAGCTCGA CGCGATCCGC GAGTCGGACG GGATGGTCGG CGTCAATTTC CACGTCGGCT TTCTTCGTCG CGATGGCAGG CGCGATGCTG CGACGCCGCT GGATGCTGTA GCAGAGCACG TCATCTACCT GGTCGAACGG TTGGGAATTG ATCGGGTCGG TTTCGGCTCG GATTTCGACG GCGCGCTGAT GCCGCACGAG TTGGGAGACG TCGCCGGACT GCCACGCCTG CTGGAGACAT TGCGCCGTCA CGGGTTCGAT GAAGCATCGC TGCGCAAACT GGCGCACGAA AACTGGGTGC GTGTTTTGAA AAAAACATGG CGCAGGTGA
|
Protein sequence | MNLHIPIFDG HNDTLLRLFA SKHNDSFFES SQGHIDLARA RAGGFAGGFF AVFVPPAPGE QPANDDDLPE RLPFAYALET ALAMTALLFR IEAQSAGQVR VARTVDDIEH GIRTNTLSAI LHFEGADAID PEFHTLEVLY RAGLRSLGIV WSRPNAFGWG VPFRFPHDPD IGPGLTEAGH ELVRICNRLG IMIDLSHLNE AGFWDVARLS SAPLVATHSN AYALCPSPRN LTDRQLDAIR ESDGMVGVNF HVGFLRRDGR RDAATPLDAV AEHVIYLVER LGIDRVGFGS DFDGALMPHE LGDVAGLPRL LETLRRHGFD EASLRKLAHE NWVRVLKKTW RR
|
| |