Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TM1040_3387 |
Symbol | |
ID | 4075287 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ruegeria sp. TM1040 |
Kingdom | Bacteria |
Replicon accession | NC_008043 |
Strand | + |
Start bp | 404374 |
End bp | 407385 |
Gene Length | 3012 bp |
Protein Length | 1003 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 638004896 |
Product | hypothetical protein |
Protein accession | YP_611621 |
Protein GI | 99078363 |
COG category | [E] Amino acid transport and metabolism [R] General function prediction only |
COG ID | [COG0160] 4-aminobutyrate aminotransferase and related aminotransferases [COG2334] Putative homoserine kinase type II (protein kinase fold) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.276595 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 0.86306 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGATCATT GGGCATCCGC TATGGAGCAG CACTGGGGCA TTCAAGCGGA GCTGAGGCAG CTTGATGGCG AATATGACCT CAACTTCCTC GCCGAAACCC CGGAGGGCAC GGGATATGTT GTCAAAGCCA TGCGCCCTGA TTGTGAAGAC TGGCTCGTGG ATATGCAAAT CCGCACCCTG GATCATATCT CAGCAGCAGA TGCCGATCTT CCCTGCCCCC GTGTCATCCC TGCGCGCTCT GGCCAGAAAA TGCTGCGCCT CGAAGATGCG CATGGCAACG CGCGTCTGGT TTGGGTGATC GAGCGCCTTG CTGGCAAATG TTATGCAGAG GCCGCCCCCA AAACCGACGC CTTGATTGCG CAGGTTGGGC AGATTTTGGC TCGCACAACG GTTTCGCTCC AAGACTTTGA CCATCCTCAT CTTGCGCGGG ATTTCAAATG GGATCTGATG CAGGCCGATT GGGTGAATGG CGCGCTCGAT TGCCTTGAAG ATGGCGCGCG CAGAGCTTTG ATCGCGGACA TTGGTGATCA GTTTACGGCG CTCAAGCCAG CACTTGCAAA GCTGCCGCAG CAGGCGATCC ACAATGATGC CAACGACTAC AATATCCTCG TGAACGGCGG CGCGGGGACC GGTATTGAGC CCAAGATCTC GGGACTCATT GATTTTGGCG ACATGTGTCG CGCGCCGCGC ATCTGTGATC TTGCGATTGC AGCGGCGTAT GTGGTGCTCG ACCACCCCAA GCCCGAAGCG GCGCTGACAG CCCTGGTGTC GGGCTATCAC GCCGAAAACC CGCTGAGCGC GCCTGAGCTC GATCTTCTCT GGCCGCTCCT CAGAATGCGC CTCGCCGTTA GCGTGGTGAA CTCCACCCTG ATGGCAACAG AGAACCCGCA TGATCCCTAT GTGACGATCT CGCAGGCCCC CGCATGGCGG TTTCTTGAAG GGCACGACCT GAACGGCGAT CTGATGGCGG CGCGGCTGCG CGCGGCCTGT GGCCTTCCCG TGGTTGAAGG CGCAGATCGG GTCATGGCCT GGCTTGATGA GGCGCGAGGC AGTTTCGCTC CGCTCATGGG GCAGGACCTT ACCGATGTAC CGATGGGATC GCTCTCGGTT GAAAAGAGCC TCTGGCCGCA AAACCCTTTT GACATGCCAC TGGCCGAGGC TGCGCGCGTG GGAGAGGAAT TTAACACCGA AGATCAGATC TGGCTTGGCT ATTACCACGA GCCGCGTCTG ATCTACACCG CGCCTGCATT CCGCAATGGT CCATGGAAAG CCAGCGACCG CCGCACGGTG CATCTGGCTG TCGACGGGTT TGCGCCTGCT GGCACCACGC TCCATGCCCC CCTCGAGGGA GAAGTCTGGG TCGTCGAGAA CCGTGACAGC CATCTCGATT ACGGGGGCGT GATCATCCTG CGTCACAAGA CCCCGGAGGG AGACCCGTTC TACACCCTTT ATGGGCATCT CGACCCCGAG GTTGTCACCC GACTGCAGCC CGGCGACCCC ATCACAAAGG GGGAGGCGTT CTGTCGACTT GGAACAGCTG AGGAGAACGG GGGCTGGGCA CCACATGTGC ATTTCCAACT GGCGCTGAGC TGCGACGGCA TCGAGACCGA TTGGCCTGGC GTGGGATCGC CAGATGACAT GTATCTGTGG CGTGCGCTCT GCCCCAATCC AGCCGCGCTT CTGAATCTGC CTGATGACAA GACCTGCTAT CGTCCCACGG ACAAATCCAC AGTTCTTGAA AAAAGACGCG CGCATTTTGG CGGCAATCTC AGTCTTACAT ATTCCGACCC GGTGATGCTT GTGCGGGGGT GGAAACACCA TCTGTTCGAC GAATGGGGTC GACCCTATCT AGACGCCTAC AACAATGTGC CGCATGTGGG CCACGCGCAC CCTCGCATTC AAGCGGTTGC GGCCGATCAG CTGCGCCGTA TGAATTCCAA CACCCGTTAC TTGCACCCGG CGCAGACTGC CTTTGCCGAC AAAGTCCTGT CAAAATTGCC CGATCACTTC GAAGTCTGTT TCTTCGTGAA CTCCGGCACC GAAGCCAATG AGCTGGCGCT GCGCCTAGCG CGAGCACACA CGGGTGCAAA GGGCATGGTC ACGCCGGATC ACGGCTATCA TGGCAACACA ACCGGTGCGA TCGACCTTTC AGCTTACAAA TTCAACAAGC CGGGTGGCGT CGGGCAGGCG GATTGGGTGG AGCTGGTCGA GGTCGCGGAC GACTATCGCG GCAGCTTTCG TCGCGACGAT CCGGACCGCG CTCAGAAATT TGCGGATCTT GTCGACCCGG CGATTGCAAC TCTGAACTCG AAGGGGCACG GCATTGCGGG CTTTATCGCG GAAACATTCC CGTCGGTTGG GGGCCAGATT ATCCCGCCCA AGGGCTACTT GCCTGCAGTA TATGAAAAAA TCCGCGCAGC GGGTGGTGTT TGCATCGCAG ATGAGGTTCA GACCGGCCTC GGACGACTGG GCGAGTATTA CTTTGGCTTT GAGCACCAAG GCGCCCTGCC CGACATCGTG GTAATGGGCA AGCCCATCGG CAATGGCCAT CCGCTCGGGG TGCTGGTCAC AACCAAGGCG ATTGCCGAGA GTTTCGACAA CGGGATCGAG TTCTTCTCGA CCTTTGGGGG CTCCACCCTG TCCTGTCGGA TCGGCAAGGA AGTGCTCGAC ATCGTGGATG ACGAAGGGTT GCAGGAAAAT GCCCGCGCCC GTGGGGCGGA GCTGATCTCA GGGCTCAGAG CCCTCGAGCG CAAATATGCC TGCGTCGGAG ATGTGCGTGG GGTGGGGCTG TTTCTGGGGT TGGAACTGAT CCATGCCGAC GGCTCCGAGG CGACAGAGAT CTGTTCCTAC GTCAAAAACC GGATGCGCGA TCACCGTATC CTGATTGGCA GCGAAGGCCC CAAGGACAAC ATCCTGAAAA TCCGCCCCCC CCTCACCATC GAAGCCGAAG ACGTGGAGAT GCTGGTGAGC GTGCTAGATG AGGTGCTGGC TGAGATCAAT CCGGCTGAGT AA
|
Protein sequence | MDHWASAMEQ HWGIQAELRQ LDGEYDLNFL AETPEGTGYV VKAMRPDCED WLVDMQIRTL DHISAADADL PCPRVIPARS GQKMLRLEDA HGNARLVWVI ERLAGKCYAE AAPKTDALIA QVGQILARTT VSLQDFDHPH LARDFKWDLM QADWVNGALD CLEDGARRAL IADIGDQFTA LKPALAKLPQ QAIHNDANDY NILVNGGAGT GIEPKISGLI DFGDMCRAPR ICDLAIAAAY VVLDHPKPEA ALTALVSGYH AENPLSAPEL DLLWPLLRMR LAVSVVNSTL MATENPHDPY VTISQAPAWR FLEGHDLNGD LMAARLRAAC GLPVVEGADR VMAWLDEARG SFAPLMGQDL TDVPMGSLSV EKSLWPQNPF DMPLAEAARV GEEFNTEDQI WLGYYHEPRL IYTAPAFRNG PWKASDRRTV HLAVDGFAPA GTTLHAPLEG EVWVVENRDS HLDYGGVIIL RHKTPEGDPF YTLYGHLDPE VVTRLQPGDP ITKGEAFCRL GTAEENGGWA PHVHFQLALS CDGIETDWPG VGSPDDMYLW RALCPNPAAL LNLPDDKTCY RPTDKSTVLE KRRAHFGGNL SLTYSDPVML VRGWKHHLFD EWGRPYLDAY NNVPHVGHAH PRIQAVAADQ LRRMNSNTRY LHPAQTAFAD KVLSKLPDHF EVCFFVNSGT EANELALRLA RAHTGAKGMV TPDHGYHGNT TGAIDLSAYK FNKPGGVGQA DWVELVEVAD DYRGSFRRDD PDRAQKFADL VDPAIATLNS KGHGIAGFIA ETFPSVGGQI IPPKGYLPAV YEKIRAAGGV CIADEVQTGL GRLGEYYFGF EHQGALPDIV VMGKPIGNGH PLGVLVTTKA IAESFDNGIE FFSTFGGSTL SCRIGKEVLD IVDDEGLQEN ARARGAELIS GLRALERKYA CVGDVRGVGL FLGLELIHAD GSEATEICSY VKNRMRDHRI LIGSEGPKDN ILKIRPPLTI EAEDVEMLVS VLDEVLAEIN PAE
|
| |