Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TM1040_0113 |
Symbol | |
ID | 4078698 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ruegeria sp. TM1040 |
Kingdom | Bacteria |
Replicon accession | NC_008044 |
Strand | + |
Start bp | 119466 |
End bp | 121451 |
Gene Length | 1986 bp |
Protein Length | 661 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 638005400 |
Product | AMP-dependent synthetase and ligase |
Protein accession | YP_612108 |
Protein GI | 99079954 |
COG category | [I] Lipid transport and metabolism |
COG ID | [COG1022] Long-chain acyl-CoA synthetases (AMP-forming) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGTCTCAGA CCGCTGCGCA GACATCGTCT GGCGCCGGGC AGCTTATGTC TGTTCCGGCC CTGCTGGAGA GAAACGCAAC TGTCCATGCC AACCGCCCGG CCTACCGGGA AAAAGAGTTT GGCATCTGGC AAAGTTGGAC CTGGAAAGAA ACCGCCGAGG AAATCGAGGC GCTGGCGCTT GGGTTGATGA ATCTTGGTGT GGCCGAGGGA GATTTCATTG CTATCATCGG ACGCAACCGG CCATCGCTCT ATTGGTCGAT GGTGGCGGCA CAGAGCGTCG GTGCAGTGCC GGTGCCGGTC TATCAGGACT CAAGTGCCGA GGAGATGGCC TATGTGCTGG ATCACTGCGG TGCGGCCTAT GTGATTGCGG GCGATCAGGA ACAGGTCGAC AAGGTGCTGG AGGTTCAGGA CACGCTGACC AACCTCAAGC ATATGATCTA TGTGGACCCC AAGGGTCTGC GCAAATACGA CCACCACCAG CTGCATCAAT ACAGCCACGT GCAGGAACAG GGCCGCGCCG CGCGCGATGA GCTCAGCTCG GATCTTGCGG CGCGTAAGGC GAAGCTCACG TATGATAGCA CCTGCGTGAT GCTCTATACG TCGGGCACCA CGGGGCGCCC CAAGGGCGTT GTGCTGTCCA ATCGCAACGT GATCGAGAGC GCCAAGAACG CCAGCACCTT TGACAAGCTC ACCGAGAATG AAGACATCCT GTCCTACCTG CCAATGGCTT GGGTCGGGGA TTTCATCTTC TCGCTGGGAC AGTCCTACTG GTGCGGGTTC TGCGTCAATT GCCCCGAGAG CGAAGACACC ATGATGACCG ATCTGCGTGA GATCGGCCCA ACTTATTTCT TTGCACCGCC GCGCGTGTTT GAGGGTCAGC TCACCAATGT GATGATCCGC ATGGAAGACG CGGGTAAGCT CAAGCAGAAG ATGTTTCATC ACTTCTTGGC CCATGCCAAG AAAGTCGGCG GCCTGATCCT GGATGGCAAA CCTGTCGGCA TGATGGATCG CCTGAAGTAT CGCCTTGGGG ATCTCTTGGT TTACGGTCCC CTGAAGGATA CGCTTGGCTA TGGTCGCATT CGCGTCGGCT ACACCGCCGG TGAGGCGATT GGACCGGAGA TTTTTGATTT CTACCGCTCG CTGGGGATCA ATCTCAAACA GCTTTATGGT CAGACAGAGG CCACGGTGTT CATTACTGTG CAGCCTGATG GCGAAGTGCG CGCGGATACG GTGGGCGTGC CCGCACCGGA TGTGGAGATC AAGATCGACG ACAAGGGTGA GATCCACTAC CGCTCGCCGG GGACCTTTGT GGAATACTAC AAGAATGCGG AATCCACCGC TTCGACAAAG GACGCCGAGG GCTGGGTCGC CACGGGTGAT GCGGGCTTTA TCGAGGAAAG CTCCGGCCAT CTGCGGATCA TCGACCGCGC CAAGGATGTC GGCAAAATGG CCAGCGGCGC GATGTTTGCG CCCAAGTACG TCGAGAACAA GCTGAAGTTC TATCCTGACA TTCTCGAGGC CGTGCTATTT GGCAACGGCC GGGATCGCTG TGTGGCCTTT ATCAACATCG ACCTCACGGC GGTGGGCAAC TGGGCAGAGC GCAACAATAT TGCCTACGCC TCCTATCAGG AGCTCGCAGG CCACCCGCGC GTGCTGGAGA CCATCCGCAA CCATGTTGAG GCGGTGAATG TCTCAGTCGC GCAGGATGAG ATGCTGTCTG GCTGTCAGGT GCATCGCTTC GTGGTGCTAC ACAAGGAGCT CGATGCCGAT GATGGCGAGA TGACCCGCAC CCGCAAGGTG CGCCGTCGCA TCGTCGAGGA GAAATTCGCC GACATTATTG CCGCGCTTTA TGATGGTTCC GAGCAGATCT CGACCCGGAC CGAAGTGACA TATGAAGACG GTCGCAAAGG CGCGATCAGT GCAACGCTGA CCTGTGTGGA TGCGAAGGTG CAATCGCCCA TGGCGCAGCA GGTGGCAGCG GAATGA
|
Protein sequence | MSQTAAQTSS GAGQLMSVPA LLERNATVHA NRPAYREKEF GIWQSWTWKE TAEEIEALAL GLMNLGVAEG DFIAIIGRNR PSLYWSMVAA QSVGAVPVPV YQDSSAEEMA YVLDHCGAAY VIAGDQEQVD KVLEVQDTLT NLKHMIYVDP KGLRKYDHHQ LHQYSHVQEQ GRAARDELSS DLAARKAKLT YDSTCVMLYT SGTTGRPKGV VLSNRNVIES AKNASTFDKL TENEDILSYL PMAWVGDFIF SLGQSYWCGF CVNCPESEDT MMTDLREIGP TYFFAPPRVF EGQLTNVMIR MEDAGKLKQK MFHHFLAHAK KVGGLILDGK PVGMMDRLKY RLGDLLVYGP LKDTLGYGRI RVGYTAGEAI GPEIFDFYRS LGINLKQLYG QTEATVFITV QPDGEVRADT VGVPAPDVEI KIDDKGEIHY RSPGTFVEYY KNAESTASTK DAEGWVATGD AGFIEESSGH LRIIDRAKDV GKMASGAMFA PKYVENKLKF YPDILEAVLF GNGRDRCVAF INIDLTAVGN WAERNNIAYA SYQELAGHPR VLETIRNHVE AVNVSVAQDE MLSGCQVHRF VVLHKELDAD DGEMTRTRKV RRRIVEEKFA DIIAALYDGS EQISTRTEVT YEDGRKGAIS ATLTCVDAKV QSPMAQQVAA E
|
| |