Gene TM1040_3387 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_3387 
Symbol 
ID4075287 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008043 
Strand
Start bp404374 
End bp407385 
Gene Length3012 bp 
Protein Length1003 aa 
Translation table11 
GC content59% 
IMG OID638004896 
Producthypothetical protein 
Protein accessionYP_611621 
Protein GI99078363 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG0160] 4-aminobutyrate aminotransferase and related aminotransferases
[COG2334] Putative homoserine kinase type II (protein kinase fold) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.276595 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.86306 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATCATT GGGCATCCGC TATGGAGCAG CACTGGGGCA TTCAAGCGGA GCTGAGGCAG 
CTTGATGGCG AATATGACCT CAACTTCCTC GCCGAAACCC CGGAGGGCAC GGGATATGTT
GTCAAAGCCA TGCGCCCTGA TTGTGAAGAC TGGCTCGTGG ATATGCAAAT CCGCACCCTG
GATCATATCT CAGCAGCAGA TGCCGATCTT CCCTGCCCCC GTGTCATCCC TGCGCGCTCT
GGCCAGAAAA TGCTGCGCCT CGAAGATGCG CATGGCAACG CGCGTCTGGT TTGGGTGATC
GAGCGCCTTG CTGGCAAATG TTATGCAGAG GCCGCCCCCA AAACCGACGC CTTGATTGCG
CAGGTTGGGC AGATTTTGGC TCGCACAACG GTTTCGCTCC AAGACTTTGA CCATCCTCAT
CTTGCGCGGG ATTTCAAATG GGATCTGATG CAGGCCGATT GGGTGAATGG CGCGCTCGAT
TGCCTTGAAG ATGGCGCGCG CAGAGCTTTG ATCGCGGACA TTGGTGATCA GTTTACGGCG
CTCAAGCCAG CACTTGCAAA GCTGCCGCAG CAGGCGATCC ACAATGATGC CAACGACTAC
AATATCCTCG TGAACGGCGG CGCGGGGACC GGTATTGAGC CCAAGATCTC GGGACTCATT
GATTTTGGCG ACATGTGTCG CGCGCCGCGC ATCTGTGATC TTGCGATTGC AGCGGCGTAT
GTGGTGCTCG ACCACCCCAA GCCCGAAGCG GCGCTGACAG CCCTGGTGTC GGGCTATCAC
GCCGAAAACC CGCTGAGCGC GCCTGAGCTC GATCTTCTCT GGCCGCTCCT CAGAATGCGC
CTCGCCGTTA GCGTGGTGAA CTCCACCCTG ATGGCAACAG AGAACCCGCA TGATCCCTAT
GTGACGATCT CGCAGGCCCC CGCATGGCGG TTTCTTGAAG GGCACGACCT GAACGGCGAT
CTGATGGCGG CGCGGCTGCG CGCGGCCTGT GGCCTTCCCG TGGTTGAAGG CGCAGATCGG
GTCATGGCCT GGCTTGATGA GGCGCGAGGC AGTTTCGCTC CGCTCATGGG GCAGGACCTT
ACCGATGTAC CGATGGGATC GCTCTCGGTT GAAAAGAGCC TCTGGCCGCA AAACCCTTTT
GACATGCCAC TGGCCGAGGC TGCGCGCGTG GGAGAGGAAT TTAACACCGA AGATCAGATC
TGGCTTGGCT ATTACCACGA GCCGCGTCTG ATCTACACCG CGCCTGCATT CCGCAATGGT
CCATGGAAAG CCAGCGACCG CCGCACGGTG CATCTGGCTG TCGACGGGTT TGCGCCTGCT
GGCACCACGC TCCATGCCCC CCTCGAGGGA GAAGTCTGGG TCGTCGAGAA CCGTGACAGC
CATCTCGATT ACGGGGGCGT GATCATCCTG CGTCACAAGA CCCCGGAGGG AGACCCGTTC
TACACCCTTT ATGGGCATCT CGACCCCGAG GTTGTCACCC GACTGCAGCC CGGCGACCCC
ATCACAAAGG GGGAGGCGTT CTGTCGACTT GGAACAGCTG AGGAGAACGG GGGCTGGGCA
CCACATGTGC ATTTCCAACT GGCGCTGAGC TGCGACGGCA TCGAGACCGA TTGGCCTGGC
GTGGGATCGC CAGATGACAT GTATCTGTGG CGTGCGCTCT GCCCCAATCC AGCCGCGCTT
CTGAATCTGC CTGATGACAA GACCTGCTAT CGTCCCACGG ACAAATCCAC AGTTCTTGAA
AAAAGACGCG CGCATTTTGG CGGCAATCTC AGTCTTACAT ATTCCGACCC GGTGATGCTT
GTGCGGGGGT GGAAACACCA TCTGTTCGAC GAATGGGGTC GACCCTATCT AGACGCCTAC
AACAATGTGC CGCATGTGGG CCACGCGCAC CCTCGCATTC AAGCGGTTGC GGCCGATCAG
CTGCGCCGTA TGAATTCCAA CACCCGTTAC TTGCACCCGG CGCAGACTGC CTTTGCCGAC
AAAGTCCTGT CAAAATTGCC CGATCACTTC GAAGTCTGTT TCTTCGTGAA CTCCGGCACC
GAAGCCAATG AGCTGGCGCT GCGCCTAGCG CGAGCACACA CGGGTGCAAA GGGCATGGTC
ACGCCGGATC ACGGCTATCA TGGCAACACA ACCGGTGCGA TCGACCTTTC AGCTTACAAA
TTCAACAAGC CGGGTGGCGT CGGGCAGGCG GATTGGGTGG AGCTGGTCGA GGTCGCGGAC
GACTATCGCG GCAGCTTTCG TCGCGACGAT CCGGACCGCG CTCAGAAATT TGCGGATCTT
GTCGACCCGG CGATTGCAAC TCTGAACTCG AAGGGGCACG GCATTGCGGG CTTTATCGCG
GAAACATTCC CGTCGGTTGG GGGCCAGATT ATCCCGCCCA AGGGCTACTT GCCTGCAGTA
TATGAAAAAA TCCGCGCAGC GGGTGGTGTT TGCATCGCAG ATGAGGTTCA GACCGGCCTC
GGACGACTGG GCGAGTATTA CTTTGGCTTT GAGCACCAAG GCGCCCTGCC CGACATCGTG
GTAATGGGCA AGCCCATCGG CAATGGCCAT CCGCTCGGGG TGCTGGTCAC AACCAAGGCG
ATTGCCGAGA GTTTCGACAA CGGGATCGAG TTCTTCTCGA CCTTTGGGGG CTCCACCCTG
TCCTGTCGGA TCGGCAAGGA AGTGCTCGAC ATCGTGGATG ACGAAGGGTT GCAGGAAAAT
GCCCGCGCCC GTGGGGCGGA GCTGATCTCA GGGCTCAGAG CCCTCGAGCG CAAATATGCC
TGCGTCGGAG ATGTGCGTGG GGTGGGGCTG TTTCTGGGGT TGGAACTGAT CCATGCCGAC
GGCTCCGAGG CGACAGAGAT CTGTTCCTAC GTCAAAAACC GGATGCGCGA TCACCGTATC
CTGATTGGCA GCGAAGGCCC CAAGGACAAC ATCCTGAAAA TCCGCCCCCC CCTCACCATC
GAAGCCGAAG ACGTGGAGAT GCTGGTGAGC GTGCTAGATG AGGTGCTGGC TGAGATCAAT
CCGGCTGAGT AA
 
Protein sequence
MDHWASAMEQ HWGIQAELRQ LDGEYDLNFL AETPEGTGYV VKAMRPDCED WLVDMQIRTL 
DHISAADADL PCPRVIPARS GQKMLRLEDA HGNARLVWVI ERLAGKCYAE AAPKTDALIA
QVGQILARTT VSLQDFDHPH LARDFKWDLM QADWVNGALD CLEDGARRAL IADIGDQFTA
LKPALAKLPQ QAIHNDANDY NILVNGGAGT GIEPKISGLI DFGDMCRAPR ICDLAIAAAY
VVLDHPKPEA ALTALVSGYH AENPLSAPEL DLLWPLLRMR LAVSVVNSTL MATENPHDPY
VTISQAPAWR FLEGHDLNGD LMAARLRAAC GLPVVEGADR VMAWLDEARG SFAPLMGQDL
TDVPMGSLSV EKSLWPQNPF DMPLAEAARV GEEFNTEDQI WLGYYHEPRL IYTAPAFRNG
PWKASDRRTV HLAVDGFAPA GTTLHAPLEG EVWVVENRDS HLDYGGVIIL RHKTPEGDPF
YTLYGHLDPE VVTRLQPGDP ITKGEAFCRL GTAEENGGWA PHVHFQLALS CDGIETDWPG
VGSPDDMYLW RALCPNPAAL LNLPDDKTCY RPTDKSTVLE KRRAHFGGNL SLTYSDPVML
VRGWKHHLFD EWGRPYLDAY NNVPHVGHAH PRIQAVAADQ LRRMNSNTRY LHPAQTAFAD
KVLSKLPDHF EVCFFVNSGT EANELALRLA RAHTGAKGMV TPDHGYHGNT TGAIDLSAYK
FNKPGGVGQA DWVELVEVAD DYRGSFRRDD PDRAQKFADL VDPAIATLNS KGHGIAGFIA
ETFPSVGGQI IPPKGYLPAV YEKIRAAGGV CIADEVQTGL GRLGEYYFGF EHQGALPDIV
VMGKPIGNGH PLGVLVTTKA IAESFDNGIE FFSTFGGSTL SCRIGKEVLD IVDDEGLQEN
ARARGAELIS GLRALERKYA CVGDVRGVGL FLGLELIHAD GSEATEICSY VKNRMRDHRI
LIGSEGPKDN ILKIRPPLTI EAEDVEMLVS VLDEVLAEIN PAE