Gene TM1040_0113 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_0113 
Symbol 
ID4078698 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp119466 
End bp121451 
Gene Length1986 bp 
Protein Length661 aa 
Translation table11 
GC content58% 
IMG OID638005400 
ProductAMP-dependent synthetase and ligase 
Protein accessionYP_612108 
Protein GI99079954 
COG category[I] Lipid transport and metabolism 
COG ID[COG1022] Long-chain acyl-CoA synthetases (AMP-forming) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTCTCAGA CCGCTGCGCA GACATCGTCT GGCGCCGGGC AGCTTATGTC TGTTCCGGCC 
CTGCTGGAGA GAAACGCAAC TGTCCATGCC AACCGCCCGG CCTACCGGGA AAAAGAGTTT
GGCATCTGGC AAAGTTGGAC CTGGAAAGAA ACCGCCGAGG AAATCGAGGC GCTGGCGCTT
GGGTTGATGA ATCTTGGTGT GGCCGAGGGA GATTTCATTG CTATCATCGG ACGCAACCGG
CCATCGCTCT ATTGGTCGAT GGTGGCGGCA CAGAGCGTCG GTGCAGTGCC GGTGCCGGTC
TATCAGGACT CAAGTGCCGA GGAGATGGCC TATGTGCTGG ATCACTGCGG TGCGGCCTAT
GTGATTGCGG GCGATCAGGA ACAGGTCGAC AAGGTGCTGG AGGTTCAGGA CACGCTGACC
AACCTCAAGC ATATGATCTA TGTGGACCCC AAGGGTCTGC GCAAATACGA CCACCACCAG
CTGCATCAAT ACAGCCACGT GCAGGAACAG GGCCGCGCCG CGCGCGATGA GCTCAGCTCG
GATCTTGCGG CGCGTAAGGC GAAGCTCACG TATGATAGCA CCTGCGTGAT GCTCTATACG
TCGGGCACCA CGGGGCGCCC CAAGGGCGTT GTGCTGTCCA ATCGCAACGT GATCGAGAGC
GCCAAGAACG CCAGCACCTT TGACAAGCTC ACCGAGAATG AAGACATCCT GTCCTACCTG
CCAATGGCTT GGGTCGGGGA TTTCATCTTC TCGCTGGGAC AGTCCTACTG GTGCGGGTTC
TGCGTCAATT GCCCCGAGAG CGAAGACACC ATGATGACCG ATCTGCGTGA GATCGGCCCA
ACTTATTTCT TTGCACCGCC GCGCGTGTTT GAGGGTCAGC TCACCAATGT GATGATCCGC
ATGGAAGACG CGGGTAAGCT CAAGCAGAAG ATGTTTCATC ACTTCTTGGC CCATGCCAAG
AAAGTCGGCG GCCTGATCCT GGATGGCAAA CCTGTCGGCA TGATGGATCG CCTGAAGTAT
CGCCTTGGGG ATCTCTTGGT TTACGGTCCC CTGAAGGATA CGCTTGGCTA TGGTCGCATT
CGCGTCGGCT ACACCGCCGG TGAGGCGATT GGACCGGAGA TTTTTGATTT CTACCGCTCG
CTGGGGATCA ATCTCAAACA GCTTTATGGT CAGACAGAGG CCACGGTGTT CATTACTGTG
CAGCCTGATG GCGAAGTGCG CGCGGATACG GTGGGCGTGC CCGCACCGGA TGTGGAGATC
AAGATCGACG ACAAGGGTGA GATCCACTAC CGCTCGCCGG GGACCTTTGT GGAATACTAC
AAGAATGCGG AATCCACCGC TTCGACAAAG GACGCCGAGG GCTGGGTCGC CACGGGTGAT
GCGGGCTTTA TCGAGGAAAG CTCCGGCCAT CTGCGGATCA TCGACCGCGC CAAGGATGTC
GGCAAAATGG CCAGCGGCGC GATGTTTGCG CCCAAGTACG TCGAGAACAA GCTGAAGTTC
TATCCTGACA TTCTCGAGGC CGTGCTATTT GGCAACGGCC GGGATCGCTG TGTGGCCTTT
ATCAACATCG ACCTCACGGC GGTGGGCAAC TGGGCAGAGC GCAACAATAT TGCCTACGCC
TCCTATCAGG AGCTCGCAGG CCACCCGCGC GTGCTGGAGA CCATCCGCAA CCATGTTGAG
GCGGTGAATG TCTCAGTCGC GCAGGATGAG ATGCTGTCTG GCTGTCAGGT GCATCGCTTC
GTGGTGCTAC ACAAGGAGCT CGATGCCGAT GATGGCGAGA TGACCCGCAC CCGCAAGGTG
CGCCGTCGCA TCGTCGAGGA GAAATTCGCC GACATTATTG CCGCGCTTTA TGATGGTTCC
GAGCAGATCT CGACCCGGAC CGAAGTGACA TATGAAGACG GTCGCAAAGG CGCGATCAGT
GCAACGCTGA CCTGTGTGGA TGCGAAGGTG CAATCGCCCA TGGCGCAGCA GGTGGCAGCG
GAATGA
 
Protein sequence
MSQTAAQTSS GAGQLMSVPA LLERNATVHA NRPAYREKEF GIWQSWTWKE TAEEIEALAL 
GLMNLGVAEG DFIAIIGRNR PSLYWSMVAA QSVGAVPVPV YQDSSAEEMA YVLDHCGAAY
VIAGDQEQVD KVLEVQDTLT NLKHMIYVDP KGLRKYDHHQ LHQYSHVQEQ GRAARDELSS
DLAARKAKLT YDSTCVMLYT SGTTGRPKGV VLSNRNVIES AKNASTFDKL TENEDILSYL
PMAWVGDFIF SLGQSYWCGF CVNCPESEDT MMTDLREIGP TYFFAPPRVF EGQLTNVMIR
MEDAGKLKQK MFHHFLAHAK KVGGLILDGK PVGMMDRLKY RLGDLLVYGP LKDTLGYGRI
RVGYTAGEAI GPEIFDFYRS LGINLKQLYG QTEATVFITV QPDGEVRADT VGVPAPDVEI
KIDDKGEIHY RSPGTFVEYY KNAESTASTK DAEGWVATGD AGFIEESSGH LRIIDRAKDV
GKMASGAMFA PKYVENKLKF YPDILEAVLF GNGRDRCVAF INIDLTAVGN WAERNNIAYA
SYQELAGHPR VLETIRNHVE AVNVSVAQDE MLSGCQVHRF VVLHKELDAD DGEMTRTRKV
RRRIVEEKFA DIIAALYDGS EQISTRTEVT YEDGRKGAIS ATLTCVDAKV QSPMAQQVAA
E