Gene TM1040_3466 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_3466 
Symbol 
ID4075100 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008043 
Strand
Start bp489983 
End bp491395 
Gene Length1413 bp 
Protein Length470 aa 
Translation table11 
GC content63% 
IMG OID638004975 
Productaromatic-L-amino-acid decarboxylase 
Protein accessionYP_611700 
Protein GI99078442 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0076] Glutamate decarboxylase and related PLP-dependent proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.811665 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACTGGA CCGATTTTGC ATCCTGGGGC CGCAAGGTCG CCGATTGGGC GCAGGACTAT 
CACCTCACCG TGGGCGAGCG CCCGGTGCGC GCGCAGACCA AACCCGGTGA TATGCTGACC
GCCCTGCCCG AAACACCGCC GGAACAGGGC GAGGGGATGG AGACGATCTT TGCCGATTTT
GAGGAAAAGG TGATGCCCGG CATCACCCAT TGGCAGCACC CGCGCTTCTT TGCCTATTTC
GCCTCCAACG CCGCCGCGCC CTCGGTGCTG GCGGAATTCC TGACCTCCGC CATCGCGCCG
CAATGTATGC TCTGGCAGAC CTCTCCGGCG GCGACCGAGA TGGAGACCCG GATGATGGAC
TGGCTGCGCC AATCCCTTGG CCTGCCGAGC GAATTTCGCG GCGTCATTCA GGACAGCGCC
TCTTCGGCGA CCCTTGCTGC GGTGCTCACC ATGCGCGAAC GGGCCTTGAA CTGGCAGGGC
AATCAACAGG GGCTCGCGGG CCAGCCGACC TTGCGGATCT ATTGTTCGTC CGAGGTGCAC
ACCTCGGTGG ACCGTGCGAT CTGGGTTGCG GGCATCGGTC AGGCCAACCT CGTGCGCGTA
CCCATCAAGG GCGACTGGCG CGGCATGGAT CCCGCAGCGC TTGAGACTGC GATCCAGTCG
GACAAGGCCG CAGGCCTTCA ACCCGCTGGC GTCATCCTCT GTGTGGGCGG CACCGGCACC
GGCGCCACGG ACCCCATCGC GGACTGCATC AAGGTGGCGC AGAATCATGG GCTTTATACC
CATGTGGACG CGGCATGGGC CGGTTCCGCG ATGATCTGCC CGGAGTTCCG CGCGTATTGG
CCGGGCATTG AGGGCGCAGA CAGCATCGTT TTCAATCCGC ATAAATGGCT CGGCGCGCAG
TTTGATTGCT CGGCGCATTT CCTTAAGAAC GCGGACGATC TGGTGCGCAC CCTCGCCATC
AGCCCGGAAT ACCTCAAGAC CCACGGTCAC GACGGCATCA TCAACTATTC CGAATGGTCG
GTCCCCCTGG GCCGCCGCTT CCGCGCGCTA AAGATCTGGT TCCTGATCCG CACCTATGGC
CTTGAAGGAT TGCGCCAACG CATCCGCAAT CACGTTGCAT GGTCGCGCCA GCTTCACGAC
GCGCTTGCGC AAGAGCCGGA TTTTGAAATC GTCACCCCGC CGATGTGGTC GCTCTGGACC
TTCCGCTACG CGCCTGATGG GGCCACGGAT CTCGATGCGC TGAACCTCGA ACTCGTGAAC
AGGATCAACG ACGACGGCCG CATCTACCTC ACCCAGACCC GCGTGGACGG CGTGCTGGTG
ATCCGGTTTC AGGCGGGCGC GTTTGAAACC ACCGAGGCTG ACATCATGCT CGCCCATGAT
GTGATCACAG AAATCGCAAG AGGACTGACC TGA
 
Protein sequence
MNWTDFASWG RKVADWAQDY HLTVGERPVR AQTKPGDMLT ALPETPPEQG EGMETIFADF 
EEKVMPGITH WQHPRFFAYF ASNAAAPSVL AEFLTSAIAP QCMLWQTSPA ATEMETRMMD
WLRQSLGLPS EFRGVIQDSA SSATLAAVLT MRERALNWQG NQQGLAGQPT LRIYCSSEVH
TSVDRAIWVA GIGQANLVRV PIKGDWRGMD PAALETAIQS DKAAGLQPAG VILCVGGTGT
GATDPIADCI KVAQNHGLYT HVDAAWAGSA MICPEFRAYW PGIEGADSIV FNPHKWLGAQ
FDCSAHFLKN ADDLVRTLAI SPEYLKTHGH DGIINYSEWS VPLGRRFRAL KIWFLIRTYG
LEGLRQRIRN HVAWSRQLHD ALAQEPDFEI VTPPMWSLWT FRYAPDGATD LDALNLELVN
RINDDGRIYL TQTRVDGVLV IRFQAGAFET TEADIMLAHD VITEIARGLT