Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TM1040_0431 |
Symbol | |
ID | 4076191 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ruegeria sp. TM1040 |
Kingdom | Bacteria |
Replicon accession | NC_008044 |
Strand | - |
Start bp | 442988 |
End bp | 444295 |
Gene Length | 1308 bp |
Protein Length | 435 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 638005726 |
Product | extracellular solute-binding protein |
Protein accession | YP_612426 |
Protein GI | 99080272 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTACCTGA GAAACGCACT TTGTGCGGCC TCTGCACTTG CGGTTATGGC AACTGGCGCC GTCCAGGCCG AGACCACATT GACCATCGCG ACCGTGAACA ACGGCGACAT GATCCGGATG CAGGGCCTCA CTGACGACTT CACCGCCAAA CATCCCGACA TCCAGCTGGA GTGGGTCACG CTCGAAGAGA ACGTGCTGCG TCAGCGCGTG ACACAAGACA TCGCCACCAA CGGCGGCCAG TTCGATGTGA TGACCATCGG CATGTACGAA ACCCCGATCT GGGCCGCTCA GAATTGGCTT GTGCCGCTGA CCGACATGGG GGCTGATTAC GACGCGGACG ACATCCTGCC CGCGATGCGC GCTGGCCTTT CACACAATGG CACTCTCTAT GCCGCGCCGT TTTACGGTGA AAGCTCCATG GTCATGTATC GCACCGACCT GATGGAGGCC GCGGGCCTGA CAATGCCCGA AGCCCCCACA TGGGAGTTCA TCAAGGAAGC CGCCGCCGCG ATGACAGACA AGGACGCCGA AATCTACGGC GCATGCCTGC GCGGCAAGGC AGGATGGGGC GAGAACATGG CCTTCATCAC CACCGTGGCG AACAGCTTTG GTGCGCGCTG GTTTGACGAA GACTGGACGC CGCAGCTCGA CAGCCCCGAG TGGAAAGAGG CGGTCACTTT CTACAACGAT CTGCTGCAAA GCTACGGACC TCCGGGTGCC TCTACAAATG GGTTCAACGA GAACCTCGCA CTGTTCCAGC AAGGCAAGTG TGGCATGTGG ATCGACGCGA CTGTGGCGGC CTCCTTCGTG ACCAACCCCG ACGATTCCAC CGTCGCTGAC AAGGTTGGAT TTGCTCTGGC ACCCGACACC GGCAAGGGCA AACGGGCCAA CTGGCTCTGG GCCTGGGCGC TGGCCGTGCC TGCGGGTTCT GACGCCAAGG ATGAGGCCAA GGCATTCATC GAATGGGCCA CCTCCAAGGA GTATCTTGCG CTTGTGGCGG AAAACGAAGG TTGGGCCAAT GTACCGCCTG GCAGCCGCAC GTCGCTTTAT GAGAACCCTG AATACGCCAA GGTTCCCTTT GCGCAGATGA CGCTCGACTC GATCAATGCG GCGGACCCCA ACAGCCCCAC CGTGGATCCC GTGCCCTATG TGGGCATCCA GTATGTCGCG ATCCCCGAAT GGGCCGGCAT CGGCACCAGC GCAGGCCAGG AATTCTCGGC CATGGTCGCA GGTCAGCAAA CCCCGGACGA AGCGCTTGCA AAAGCACAGG CTCTGGTCGC TGACGAAATG GAAGCCGCAG GCTACTAA
|
Protein sequence | MYLRNALCAA SALAVMATGA VQAETTLTIA TVNNGDMIRM QGLTDDFTAK HPDIQLEWVT LEENVLRQRV TQDIATNGGQ FDVMTIGMYE TPIWAAQNWL VPLTDMGADY DADDILPAMR AGLSHNGTLY AAPFYGESSM VMYRTDLMEA AGLTMPEAPT WEFIKEAAAA MTDKDAEIYG ACLRGKAGWG ENMAFITTVA NSFGARWFDE DWTPQLDSPE WKEAVTFYND LLQSYGPPGA STNGFNENLA LFQQGKCGMW IDATVAASFV TNPDDSTVAD KVGFALAPDT GKGKRANWLW AWALAVPAGS DAKDEAKAFI EWATSKEYLA LVAENEGWAN VPPGSRTSLY ENPEYAKVPF AQMTLDSINA ADPNSPTVDP VPYVGIQYVA IPEWAGIGTS AGQEFSAMVA GQQTPDEALA KAQALVADEM EAAGY
|
| |