Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TM1040_2552 |
Symbol | |
ID | 4076683 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ruegeria sp. TM1040 |
Kingdom | Bacteria |
Replicon accession | NC_008044 |
Strand | - |
Start bp | 2694826 |
End bp | 2696676 |
Gene Length | 1851 bp |
Protein Length | 616 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 638007876 |
Product | extracellular solute-binding protein |
Protein accession | YP_614546 |
Protein GI | 99082392 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG4166] ABC-type oligopeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.437338 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.0565568 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCCACCG CGGCCCTGAT GACCTCGGCG CTTGCGGTAA AGAGCACAGA GAGCGAGGCG ATCACCGTCA GCCATGGCTA TTCGTTTTAC GGCGATCTGG ATTATCCGGC TGATTTTGAA CACTTCGACT ATGTAAACCC GGACGCGCCG AAGGGAGGGG AGCTTGCGAT TCCCTTCATC GGGACTCTCG ACAGCATGAA CCCCTATTCT GGCAAGGGTC GGGCGCATGC GTTTTCTGTC TACCCTTACG AGAGCCTCCT CTCGGATGCG GCACCTGCGG ATAAATATGG TCAGTCCTAC TGCCTGCTCT GCGAAACCGT GGAATATCCC GAAGACAAGA GCTGGGTGAT CTTTCGCATG CGTCCCGAGG CGCGGTTCTC CGACGGCACA CCGGTCACCG CACATGACAT CGCCTTCAGC CACAACCTGC TGCTGGATCA AGGTTTGAAG TCCTACGCAG ACTTCATTCG CCGGCTGATC CCAAAAGTTG AGGTGATCGA CGATCACACC ATCAAGTTCT ATTTTGCTGA TGGCGTCTCA CGCCGGAGCC TGATCGAGCA GGTGGGCGGC GTACCGGCCT GGTCGAAGAA GTGGTTCGAT GAAACGGGAC AGAAGCTCAA CGAGAACTGG ATCGAGGGGC CTCCGGGCTC TGGTCCCTAT GTGATCGAGG AAATCGACCT GTCGCGTCGT ATCGTGCTGA AACGCAACCC TGATTATTGG GGCAAGGATC TGCCGGTCAA CCAGGGGCGG CACAACTTTG ATTCGCTGCG GGTGGAAATC TTTGCCGACG ACACGGCAGC CTTTGAGGCC TTCAAGGCGG GCGAGTATAC CTTCCGCGCC GAAGGCGACA GCAAGAAATG GGCCAGCGGC TATGACTTTC CGAAGGTCGA TAGCGGCGCG GTGAAACTCG AAGAGCTGCC AAATGGTGCA CCGCCTGCGT CATCGGGCAT CGTGTTCAAT CTCGCCTCTC CCGAATTGCA AGATCGGCGT GTGCGCGAAG CGCTGGCTCT GGCGTTCAAC TTCGAGTGGT CCAACGAGAG CCTTCTCTAT GGTCTCTTTG ATCGCCGTGC GTCGTTCACC CAGAACTCCC CCCTGATGGC CAAAGGTGTG CCCGAGGGCG CCGAGCGCGC GTTTCTGCAG AGCCTCGGCG ACCTTGTTCC GGACGAGATG CTGACCGAGG AGGTCTATAT CCCGCCGACC TCCGATCCTT CGCGACTGTT TGACCGCCGC AACGCACGCA AGGCAGCGGC CTTGCTGGAT GCGGCCGGGT ATACGGTGGG CGATGGCGGC ATGCGCATGT CTCCGGATGG CTCTGCGTTT GAGCTGGATT TCCTGTTTTC CTCCTCATCC TCGCCCACCA CGCGCGGGGT GATGGAGAAC TTCGTCGACA ACCTCGAAAA CCTTGGCGTT CAGGTCAATT TCGAGGTGGT GGATACGGCG CAATACACCA GCCGGGAACG GGACCGCGAT TATGATCTGG TGGTCGACAG CTACACCACG TTCCTTGGCA CCGGTACCGG TCTGGAGCAG CGGTTTGGGT CCGAGGCCGC AGCCTTCTCG CTGTTCAACC CCGCCGGTTT GGCCTCCCCG CTGGTGGATG AGATCATCAC GAGATCGCTG CACGCTGAGA CCCGCGAAGA AGAAACTACC GTCATGACGG CGCTGGATCG CGCTTTGCGG CATGAGTTCA TCATGATCCC GCTGTGGTAT AACCCGAACC ATTGGGCCGC CTATTACGAT CAATATGAGC ACCCGGCGGA AATCCCGCCC TATAGCCTCG GGTATCTGGA CTTCTGGTGG TATAACGAGG ACAAGGCCAA AGCGCTGCGC GATGCTGGCG CGCTGAGGTA A
|
Protein sequence | MSTAALMTSA LAVKSTESEA ITVSHGYSFY GDLDYPADFE HFDYVNPDAP KGGELAIPFI GTLDSMNPYS GKGRAHAFSV YPYESLLSDA APADKYGQSY CLLCETVEYP EDKSWVIFRM RPEARFSDGT PVTAHDIAFS HNLLLDQGLK SYADFIRRLI PKVEVIDDHT IKFYFADGVS RRSLIEQVGG VPAWSKKWFD ETGQKLNENW IEGPPGSGPY VIEEIDLSRR IVLKRNPDYW GKDLPVNQGR HNFDSLRVEI FADDTAAFEA FKAGEYTFRA EGDSKKWASG YDFPKVDSGA VKLEELPNGA PPASSGIVFN LASPELQDRR VREALALAFN FEWSNESLLY GLFDRRASFT QNSPLMAKGV PEGAERAFLQ SLGDLVPDEM LTEEVYIPPT SDPSRLFDRR NARKAAALLD AAGYTVGDGG MRMSPDGSAF ELDFLFSSSS SPTTRGVMEN FVDNLENLGV QVNFEVVDTA QYTSRERDRD YDLVVDSYTT FLGTGTGLEQ RFGSEAAAFS LFNPAGLASP LVDEIITRSL HAETREEETT VMTALDRALR HEFIMIPLWY NPNHWAAYYD QYEHPAEIPP YSLGYLDFWW YNEDKAKALR DAGALR
|
| |