Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TM1040_3635 |
Symbol | |
ID | 4075063 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ruegeria sp. TM1040 |
Kingdom | Bacteria |
Replicon accession | NC_008043 |
Strand | + |
Start bp | 691965 |
End bp | 693041 |
Gene Length | 1077 bp |
Protein Length | 358 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 638005155 |
Product | extracellular solute-binding protein |
Protein accession | YP_611864 |
Protein GI | 99078606 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0687] Spermidine/putrescine-binding periplasmic protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.248377 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACAGATA AGCTTGCGCA ATTCAGCCGT ATCGCGGTGG TCACAACCGC GATGGCAACA CTACCGATGA TGGCAGGGGC GGAAACGCTG CGCCTTCTGA CTTGGGGCGG CTACGCGCCC GAAGACGTCA TTGCGAAATT CGAAGAAGAA ACCGGCCACA CAGTCGAAGT GACCACCTCG AACAACGAAG AGATGATCGC AAAGTTGCGC GCCACCAATG GCGGCGGTTT TGATCTGGCC CAGCCGAGCC AGGACCGCAT CACCAGCGCG CAGGAAGAGT TCGGCATCTA CAAGCCGATC GATATGTCCC GCATCAATGC GGATCTGTTC ATTCCGTCGA TGCTGAGCGC GACCGCTGCA AACACGACCT TTGAGGGTGA AGTCTACGGC GTACCGCATG TCTGGGGCAC CAGCGGTCTT GTGGTGAATA CCGAGATGGC AGGCAATGTG CAGGACTACA GTGATCTTTG CGACGACTCG GTTGCAGGCA AGGTTTCTTA TCGTTTGAAG CGCCCGACTC TGATTGGTTT CGCCTATTCC ATGGGTCTGG ACCCGTTTGC GGCCTATGGC GATAGCGCTG CTTATCAGGG GATCCTCGAT CAGGTCGAAG CGAAACTCAC CGAGTGTAAA GCCAACGTCA AAACCTATTG GGATGGTGGC GACGAGATCA AAAACCTGCT GCGCTCCGGC GAAGTTGTGG CGTCCATGGC CTGGGATACC GGTGGCTGGC AGCTCAACGC TGACAACCCC GATATCACCT TTGTTGCACC AAAGTCCGGT GCGCTGGGTT GGATCGACAC CTTTGTTCTG CCTGCCCGTG GCCGTGCAGA TGATGCGGCC TATGACTGGA TCAACTTTGT GATGCGCCCG GAAATCGCGG CGATGATCAC CAACACCGCC GGGAACTTCA CTGCAGCGGT TGATGGTGAT GCAGCTGTCG ATGCGGACCT CAAAGCGCGC TACCAGAGCA GCTTTGACCA GCAGGCGATC GACAACATCA AGTGGTATCC CCCGGTGCCC GCAGGTCTCG AAGCGATGGA AGGGGCAAGC CTCGACCGGA TCAACGCGGC CAACTAA
|
Protein sequence | MTDKLAQFSR IAVVTTAMAT LPMMAGAETL RLLTWGGYAP EDVIAKFEEE TGHTVEVTTS NNEEMIAKLR ATNGGGFDLA QPSQDRITSA QEEFGIYKPI DMSRINADLF IPSMLSATAA NTTFEGEVYG VPHVWGTSGL VVNTEMAGNV QDYSDLCDDS VAGKVSYRLK RPTLIGFAYS MGLDPFAAYG DSAAYQGILD QVEAKLTECK ANVKTYWDGG DEIKNLLRSG EVVASMAWDT GGWQLNADNP DITFVAPKSG ALGWIDTFVL PARGRADDAA YDWINFVMRP EIAAMITNTA GNFTAAVDGD AAVDADLKAR YQSSFDQQAI DNIKWYPPVP AGLEAMEGAS LDRINAAN
|
| |