Gene Rmet_4552 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRmet_4552 
Symbol 
ID4041411 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCupriavidus metallidurans CH34 
KingdomBacteria 
Replicon accessionNC_007974 
Strand
Start bp1163034 
End bp1165592 
Gene Length2559 bp 
Protein Length852 aa 
Translation table11 
GC content54% 
IMG OID637979974 
Productputative curculin-like lectin 
Protein accessionYP_586686 
Protein GI94313477 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.602066 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.0474093 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGAAAC ACCCCAGACG GGTATTCCAG CCAATCTTGC TAATCTGTGT CGCCGCAGTT 
CTGTCGGCCT GTGGCGGCGA CGACGGAGGC GATGGATCTG CTACGGCAAG CGGAAGCCCG
AGCAACACCG CACGATCGGT TCAATACTAT TCAGGCCAAA AGGCCATTGA CCATCTGGGC
AATTCCCTAA GCGAGTTTGC CGTGCTGCAC GGTATGACGG CAGAGGAACT GAGACAAAAA
TTATTGACCG ACCCCACGCT TTTCGTGACG AGTCTCGCAA AGCTTGTTTA CCGTGACAAC
GCCCGGCAAG CCCAAGTCGT TGCGCCAAAT ACTCTATCGC GATTACAGGT CCTCACCGAG
AGTAGCTCAG CCGACCCGGG TAATGTTTTT GCTTTGCATA GCCGAAGTGG TTCCACCAAG
ACTATTTACC TCAATTTCAA AGGCGGCACT CTTCGCTATC CAACAGCCGT TCCGCCAATC
GTTGAAACTT ATCCCGCATT TGACCTGGAT GGCGACCCTT CCACGTTCAA CACGGAAGAA
AGGCTCGTGA TTCAAGAAGT TTTTCGACGC GTCGCCGAAG ACTTTGCCAT CTTCGACGTA
GATGTCACCA CAGAAGAGCC GGCTCGCGAC AAACTTGTGA GGGCGGATGC CCAAGATGGA
ATTTACGGGC AAGAAATATA CATCACCCGG GACATATATA GTGTCGACTC AGGGGGTGAA
GCACCCATAG GCGTGTTCAA TTCGATAGAC ACACCCGGAC AGCCAGACCA AAGCGACTAC
AACAAGGCGG CGAAAGTTTT CTATGACAAG CTCGCTGTTG CCGCATCTGA TCGCGCGGCG
GTAATAGCCG ACGCCATTTC CCATGAAACT GGACACACAC TCGGACTATC CCACATGGGG
ACCGCTACCG CGACCTACTA CCCTGGCGAT AGCTTCTCCA GTTGGCTGCC ATTAATGTCT
CAGCATCCTA ACGCCAACTC TATCCGAAAG CTCACGACAT GGTCGCACGG GGACTACAAA
AACGCCAACA ACCATGAAGA CGAGCTAGCG ATTATTAACG CTTATGGTTT GAAATTAATA
TCGGACGATT TTGGAGATAC AATCGGAGCC GCCTTTCCGC TTGGAATCGA TGGCACGAAT
AGCGATGGCC GTCCGTCATC ACGGGTCTCC GGCATAATCG GCTCGTCAAC AGATAGCGAC
ATGTTTCGCA TTACGGTGCC CGAAGGCCCG CTTACTATTC AGGCGGACCC TGCCGCCGTG
GGCCCGAATC TGGACATCAA ACTGAGCCTT CTCGATAGTG CCGGCAATTC AATTCAGATT
GTCAAAGACA ACCCAACCAC TGCCGACTCA CTCAGCGCAG GTATCGCATT ACCCCAAATT
GCAGCTGGGA CGTACTACGT AAAAGTGGAA GGAACTGGTC GCGGGTCACC TCTGACCCCC
GACCCGAAGA TCGGTTTGCC ATGGGGTTAC ACAAAGTATG GCAGCTTGGG ATCCTATACT
TTGGCCGTTA TCTATGCTCC GGTATTGGCG GCCGACCAGA TATCCACGTT GACCAAGGAA
AGGACTCAGG CAATGGGATT CAATATCTCG CTGCTGTCTC CTGGGGCATG GAACGTCTTC
GTCTCGGCCG GCACAAGCCT CGTGAAGAGT TATGGCGGCG CTTCGGGAGC AGCTCTCACT
GACGCCGCTA TATTGCCATC TGGAGCGAAG CTAAACGCCG GAGAATCGCT GCGCGCGGGC
GGCATGACGC TCACCATGCA AGCGGACGGG AACCTTGTGA TCTACGACGC GGCGCACAAA
GTCCTCTTTG CCTCAACTAC CAATAGCGTG GACAACCAAG GCTCCGCTCT TAACATGCAG
GCAGACGGAA ACCTTGTAAT CCGCAACCCG GCCGGCAAGT CAGTCTGGGC CTCAGGTACC
GATGACTTTG GCGGGGAGTA CATGGTGTTC GGCAGCGATG GCACTTTGAG TTTGCGCACG
GGAAGCCGAA TAAACTGGCT GCTGCCGTCG CAGTTGGCCG TATTGACGTC GGCACAAGTA
TCAGTGCTGA GTGCAGCGCA GATTCAGGGG TTGGACACTG CTGTCAATAA TTTGTCGTCA
GACGCCCTGA ATTTCATTTC ATCGCTTGGA TCGACCGTTG TTAAACGATG GGGTGGCGTA
ACCGGCGTCG CTCTGACAGA GGTGAAAAAG ATTGCCGCCG GAGGAAAGCT AAATTCTGGT
GTGACCCTGA CGGCCGGAAG CGTGAGCCTC GTTATGCAAG GGGACGGCAA CCTCGTGATC
TACGATACGG CACACAAGGC CGTCTGGGCT TCGAACACGA ATAGCGTGGA CAATCAAGGC
GCACATCTCA ATATGCAGGC AACGGACGGG AACCTTGTTA TGTACACACC AGCCGGCAAA
GCAATTTGGG CCTCTAATAC CAATGACAAC AGAGGAGAGT TTATGGTGTT CGGAAGCGAT
GCATGCATGA GTCTGCAAAC TGGGAGTCGC GTGAGTTGGT TCACCGCGGC ACAAGTGCAA
AGCCTGACAC AAGCCGGCAA GATGCCGGTA GCCCACTAA
 
Protein sequence
MKKHPRRVFQ PILLICVAAV LSACGGDDGG DGSATASGSP SNTARSVQYY SGQKAIDHLG 
NSLSEFAVLH GMTAEELRQK LLTDPTLFVT SLAKLVYRDN ARQAQVVAPN TLSRLQVLTE
SSSADPGNVF ALHSRSGSTK TIYLNFKGGT LRYPTAVPPI VETYPAFDLD GDPSTFNTEE
RLVIQEVFRR VAEDFAIFDV DVTTEEPARD KLVRADAQDG IYGQEIYITR DIYSVDSGGE
APIGVFNSID TPGQPDQSDY NKAAKVFYDK LAVAASDRAA VIADAISHET GHTLGLSHMG
TATATYYPGD SFSSWLPLMS QHPNANSIRK LTTWSHGDYK NANNHEDELA IINAYGLKLI
SDDFGDTIGA AFPLGIDGTN SDGRPSSRVS GIIGSSTDSD MFRITVPEGP LTIQADPAAV
GPNLDIKLSL LDSAGNSIQI VKDNPTTADS LSAGIALPQI AAGTYYVKVE GTGRGSPLTP
DPKIGLPWGY TKYGSLGSYT LAVIYAPVLA ADQISTLTKE RTQAMGFNIS LLSPGAWNVF
VSAGTSLVKS YGGASGAALT DAAILPSGAK LNAGESLRAG GMTLTMQADG NLVIYDAAHK
VLFASTTNSV DNQGSALNMQ ADGNLVIRNP AGKSVWASGT DDFGGEYMVF GSDGTLSLRT
GSRINWLLPS QLAVLTSAQV SVLSAAQIQG LDTAVNNLSS DALNFISSLG STVVKRWGGV
TGVALTEVKK IAAGGKLNSG VTLTAGSVSL VMQGDGNLVI YDTAHKAVWA SNTNSVDNQG
AHLNMQATDG NLVMYTPAGK AIWASNTNDN RGEFMVFGSD ACMSLQTGSR VSWFTAAQVQ
SLTQAGKMPV AH