Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rmet_4958 |
Symbol | |
ID | 4041820 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Cupriavidus metallidurans CH34 |
Kingdom | Bacteria |
Replicon accession | NC_007974 |
Strand | - |
Start bp | 1624878 |
End bp | 1625969 |
Gene Length | 1092 bp |
Protein Length | 363 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 637980379 |
Product | fimbrial protein |
Protein accession | YP_587089 |
Protein GI | 94313880 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.00524555 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.337439 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGTACTG AAATTGTAAA GAGCGTCTTC AGAAATCTGC CGATTGGCAG GAGCAGTTCC CTTCGGGCGC CAACGTATCG GCACAACAGA ACCGGCACCA TTTACGGCCT TTTTGTAAGG ATACTTGCCG TTCTGGCGTT CGCAATGGGG ATGCAATCCA CGGCACACGC CTACGCCTGT ACGAACACAA GCGGCTTACC GAATGCGATC AGCTATCCGG GCACTGTCGC CGTACCGTCC AGCTTGCTGC CGGGAGATAC CATTCCAGGT ACGGTACGCG CATTTTCAAT GTCGGGAACC TGTACTATTG GCTGGGGTGG GCCCCTGACG ATCCAGGTAG GCAGTTCCAT CGTTGCGTGC ACCCGAGATG GTGGCTCGAC GGAAGTCATG CCAGGTGTTT ATACGACTGG CATTAGTGGC GTCGGGATGC GGCTGCGCAA CAGCTCTGGG ACCCCCATCG TCAATGGCTC CGGACAGGCT TGCTTCTCGT CCATTGCGCA GATTGGAGCG GGAGGCAGCT ACAACTTCTC CGGCACGTTG GAGTTAGTCA GGATCGCGGG CCCGATTGTC GACGGATCGG TCATGAACAA CGCGGCCTGG GTGTTTGGTG TCTATAACAC CAACGGTTTG CTCAACATCG ACAGCGTCAC TGCCAGCCAG ATTTATCCGG CTGGTGCCAT TACGCTGAAA AGCATCGCCT GTACACCCAC TGCTCCGGCG GTGGTGCGGC TCCCGACGAT CAATCAGGGG GCACTCTCTG CCGGAACCGC GGGGGCCACG GCATTCGCCA TCGGGCTCAG ATGCGACGCT AGCGCCCGGG TCGGTATTTC GCTCGATGCC GCTGCCGGAT TGAGTGTGAT TGATGCGAAC AACGGCATAT TGAGTGTGCA GACCGGCGGT GCAGGGGGCG TGGGCGTACA GATCGTCGAC CAATACCAGG CGCCTGTACG CTTGCAGTCG CGCGTCGATA TGGGCACGAT CAGCGCCAAT GTTCAGAACA GCTTTCCCTT TATGGCGCGC TACATTCGGG TCGGGACCGT AGCGGCAGGT GCCGTGACTT CTGCAATGAC CTTCACGTTT GACTACCAGT AG
|
Protein sequence | MSTEIVKSVF RNLPIGRSSS LRAPTYRHNR TGTIYGLFVR ILAVLAFAMG MQSTAHAYAC TNTSGLPNAI SYPGTVAVPS SLLPGDTIPG TVRAFSMSGT CTIGWGGPLT IQVGSSIVAC TRDGGSTEVM PGVYTTGISG VGMRLRNSSG TPIVNGSGQA CFSSIAQIGA GGSYNFSGTL ELVRIAGPIV DGSVMNNAAW VFGVYNTNGL LNIDSVTASQ IYPAGAITLK SIACTPTAPA VVRLPTINQG ALSAGTAGAT AFAIGLRCDA SARVGISLDA AAGLSVIDAN NGILSVQTGG AGGVGVQIVD QYQAPVRLQS RVDMGTISAN VQNSFPFMAR YIRVGTVAAG AVTSAMTFTF DYQ
|
| |