Gene Rmet_5038 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRmet_5038 
Symbol 
ID4041900 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCupriavidus metallidurans CH34 
KingdomBacteria 
Replicon accessionNC_007974 
Strand
Start bp1724189 
End bp1725235 
Gene Length1047 bp 
Protein Length348 aa 
Translation table11 
GC content67% 
IMG OID637980459 
Productextra-cytoplasmic solute receptor 
Protein accessionYP_587169 
Protein GI94313960 
COG category[S] Function unknown 
COG ID[COG3181] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.610861 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTGTTCC CTTGTTTCTC CTCGCGCCGG CAAACCACGC CGGCCGCTTC TGGCGGGATG 
CCGCGCGCGC AACGCGGCAG GCTTGCCGTC ATCGCCGCGC TGTCCGCGCT GTGTCTGCCC
GGCGGCATGT TGCCGACTTC CGCCTGGGCG GATAGTGGAA CGCCATTGAA GCTCGTCGTC
ACCTTCCCGC CCGGGGGCAG CACGGATATC GCCGCGCGCA TCGTGCAGCC CAAGTTGGCG
GAGGTGTTAG GGCGTCCCGT GGTGGTGGAG AACCGTCCCG GCGCCGCGAG TCAGGTCGCG
ACGCAGTACG TGGCGCGCTC CGCGCCGGAC GGCAACACCT TGCTGATCAG CTTCGATACC
CATGCGATCA ATCCCATCGC GAAATCCCGG CTGCCGTATG ACACCTTCAA GGATTTCTCC
GGTGTCACGC TGGCGCTGCG TTTCCCACTG GTGATCGGTG CCCATCCGTC GGTGCCCGGC
AAGGACCTGC GCGGATTCCT GGATGCCGCG CGGCGCGCGC CCAATCAGTA CAGCTACGCG
TCCACCGGTC TTGGTTCGAT GAACCATCTC GTCGCGGAGG ACTTGAAGCG TCAGGCCGGG
GTGGAACTGC TGCACGTGCC TTACGCCGGC GGGGGGCCGG CTGTGCAGGC CGTGCTGGGG
AACGTGTCGA GCCTGACGCT GCTGAGCTAC GCCGCGCTCA AGGGCCAGAT CGCTGCGGGG
CGCATCAAGC CCCTCGCTGT GACCGGCGCC AACCGACTGC CTGATCTGCC CGATGTGCCG
ACGGTGGCGG AGTCCGGATT CCCGGGCTTC GAGGCGTACT CGTGGATTGG CGTGTTCGCG
CCGTCCGGCA CGCCACCGGC CGTGGCGCGC AAGCTGACCA GCGACTTCCA GGCCGCCCTT
AATGATCCGG AGACCCACCG CAAGCTGACG CAGGCAGGGT TCGAGGTGAT GGCCACCGAT
GGTCCGGCGC TCGATCGTTA CGCTCGCGAG CAGTATGAAC GCTGGAAAGC CTTCGTCGTG
AAGACCGGGC TGAAGCTGGA GGAGTAG
 
Protein sequence
MLFPCFSSRR QTTPAASGGM PRAQRGRLAV IAALSALCLP GGMLPTSAWA DSGTPLKLVV 
TFPPGGSTDI AARIVQPKLA EVLGRPVVVE NRPGAASQVA TQYVARSAPD GNTLLISFDT
HAINPIAKSR LPYDTFKDFS GVTLALRFPL VIGAHPSVPG KDLRGFLDAA RRAPNQYSYA
STGLGSMNHL VAEDLKRQAG VELLHVPYAG GGPAVQAVLG NVSSLTLLSY AALKGQIAAG
RIKPLAVTGA NRLPDLPDVP TVAESGFPGF EAYSWIGVFA PSGTPPAVAR KLTSDFQAAL
NDPETHRKLT QAGFEVMATD GPALDRYARE QYERWKAFVV KTGLKLEE