Gene RoseRS_3883 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRoseRS_3883 
Symbol 
ID5210865 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus sp. RS-1 
KingdomBacteria 
Replicon accessionNC_009523 
Strand
Start bp4862728 
End bp4863756 
Gene Length1029 bp 
Protein Length342 aa 
Translation table11 
GC content61% 
IMG OID640597478 
Productmembrane dipeptidase 
Protein accessionYP_001278186 
Protein GI148657981 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2355] Zn-dependent dipeptidase, microsomal dipeptidase homolog 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAACCTTC ACATTCCGAT CTTCGACGGT CATAACGATA CCCTTCTGCG TCTCTTCGCA 
TCGAAACACA ATGATTCGTT CTTTGAGTCG TCCCAGGGGC ACATCGACCT GGCGCGCGCC
CGCGCTGGCG GTTTTGCTGG CGGCTTCTTC GCGGTCTTTG TTCCTCCTGC GCCAGGTGAA
CAGCCAGCGA ACGACGACGA CCTTCCTGAG CGATTACCGT TCGCCTATGC GCTCGAAACA
GCGCTGGCAA TGACCGCCCT GCTCTTCCGG ATCGAAGCGC AATCCGCTGG TCAGGTGCGG
GTCGCACGCA CGGTTGATGA CATCGAGCAC GGCATCCGAA CCAATACGCT GAGCGCGATC
CTGCATTTCG AGGGCGCCGA CGCGATCGAT CCTGAGTTTC ACACACTCGA AGTGCTCTAC
CGGGCTGGTC TCCGCTCCCT GGGGATCGTC TGGAGTCGCC CGAACGCATT CGGATGGGGC
GTACCGTTTC GTTTCCCGCA CGATCCCGAT ATTGGTCCTG GTCTGACCGA AGCCGGACAC
GAACTGGTGC GAATATGCAA CCGCCTCGGC ATCATGATCG ATCTGTCGCA TCTGAACGAA
GCCGGCTTCT GGGATGTGGC GCGCCTGAGC AGCGCACCGC TGGTCGCAAC CCACTCGAAC
GCATATGCCC TCTGTCCCTC GCCGCGCAAC CTGACCGACC GCCAGCTCGA CGCGATCCGC
GAGTCGGACG GGATGGTCGG CGTCAATTTC CACGTCGGCT TTCTTCGTCG CGATGGCAGG
CGCGATGCTG CGACGCCGCT GGATGCTGTA GCAGAGCACG TCATCTACCT GGTCGAACGG
TTGGGAATTG ATCGGGTCGG TTTCGGCTCG GATTTCGACG GCGCGCTGAT GCCGCACGAG
TTGGGAGACG TCGCCGGACT GCCACGCCTG CTGGAGACAT TGCGCCGTCA CGGGTTCGAT
GAAGCATCGC TGCGCAAACT GGCGCACGAA AACTGGGTGC GTGTTTTGAA AAAAACATGG
CGCAGGTGA
 
Protein sequence
MNLHIPIFDG HNDTLLRLFA SKHNDSFFES SQGHIDLARA RAGGFAGGFF AVFVPPAPGE 
QPANDDDLPE RLPFAYALET ALAMTALLFR IEAQSAGQVR VARTVDDIEH GIRTNTLSAI
LHFEGADAID PEFHTLEVLY RAGLRSLGIV WSRPNAFGWG VPFRFPHDPD IGPGLTEAGH
ELVRICNRLG IMIDLSHLNE AGFWDVARLS SAPLVATHSN AYALCPSPRN LTDRQLDAIR
ESDGMVGVNF HVGFLRRDGR RDAATPLDAV AEHVIYLVER LGIDRVGFGS DFDGALMPHE
LGDVAGLPRL LETLRRHGFD EASLRKLAHE NWVRVLKKTW RR