Gene TM1040_1442 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_1442 
Symbol 
ID4078072 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp1540223 
End bp1541452 
Gene Length1230 bp 
Protein Length409 aa 
Translation table11 
GC content60% 
IMG OID638006753 
Productaminotransferase, class V 
Protein accessionYP_613437 
Protein GI99081283 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0520] Selenocysteine lyase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.259917 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.109823 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAACGCG CCATGGATAT CGAATTTGTC CGCAAACAAT TTCCGGCATT TGAACAGCCC 
TCTTTGCAGG GTCAGGCCTT CTTTGAGAAT GCAGGCGGCT CTTACACCTG CCGGCAGGTG
ATCGACCGCC TGTTTCGATT TTACACCGAG CATAAGGTCC AGCCCTACGC GCCTTATGCC
GCCTCCGAGG CAGCCGGCGC CGAGATGGAT GAAGCGCGCA GTCGTCTGGC GGCGATGCTG
GGTGTCTCTG CGCAGGATCT GAGCTTCGGT CCTTCGACAA CCCAGAATAC CTATGTGCTG
GCGCAGGCCT TCCGGGGCTT CTTGAAGCCG GGCGAGAAGA TCATCGTTAC CAATCAGGAC
CATGAGGCCA ACTCCGGCCC GTGGCGACGC TTGGCCGACG AGGGCATCGA GGTGCTGGAG
TGGCAGATCG ATCCTGCCAC CGGCCATCTT GAACCGAGCG CGCTGGAGGA TCTCTTGGAC
GAGAGCGTGC GGCTGGTCTG TTTTCCCCAT TGCTCCAATG TGGTGGGCGA GATCAATCCG
GTCACGGAAA TCACTGCGCT GGCCCATGCT GCGGGGGCTT TTGTCTGCGT TGATGGCGTC
TCTTACGCGC CGCATGGTTT GCCGAACGTG GGTGAACTGG GGCCGGATAT CTATTTGTTC
TCCGCCTATA AAACCTATGG CCCTCATCAA GGGATCATGG TGATCAATCC CGCTCTTGCC
GAGCTTTTGC CCAATCAGGC GCACTACTTT AACGGGGATG TTCCTTACAA ACGCTTCACC
CCCGCCGGAC CCGATCACGC GCAGGTCGCC GCCTGTGCGG GAATGGTCGA CTACTTCGAG
GCGCTCGCCG AACATCACAA CGCACCTGAG ATCACAGGCA CAGGGGCGGG CGCCTTTGTG
CACGATCTGA TGCGCGAGCA GGAGATCTCC TTGTTGCAGC CGCTCCTGGA TGCGGTGAAG
GGGCGAAACG ATGTGCGCTT GCTTGGCCCC GCGAACGCCA AAGAGCGTGC CCCGACCGTC
GCGCTTGCGC TTGGGCGGGC AGCAGAGCCC GTTGCCAAGC AATTGGCAGA GCTTGGGATC
ATGGCGGGGG GCAGCGACTT TTACGCAGTG CGTGCGCTCA GGGCAATGGG GGTCGACCCC
GCGCAGGGCG TGCTGCGGCT GAGTTTTACT CACTATACCG ATCAATTGGA GGTGACAGCG
CTGATCGAGG CCCTAGATCG CGTCCTGTAA
 
Protein sequence
MKRAMDIEFV RKQFPAFEQP SLQGQAFFEN AGGSYTCRQV IDRLFRFYTE HKVQPYAPYA 
ASEAAGAEMD EARSRLAAML GVSAQDLSFG PSTTQNTYVL AQAFRGFLKP GEKIIVTNQD
HEANSGPWRR LADEGIEVLE WQIDPATGHL EPSALEDLLD ESVRLVCFPH CSNVVGEINP
VTEITALAHA AGAFVCVDGV SYAPHGLPNV GELGPDIYLF SAYKTYGPHQ GIMVINPALA
ELLPNQAHYF NGDVPYKRFT PAGPDHAQVA ACAGMVDYFE ALAEHHNAPE ITGTGAGAFV
HDLMREQEIS LLQPLLDAVK GRNDVRLLGP ANAKERAPTV ALALGRAAEP VAKQLAELGI
MAGGSDFYAV RALRAMGVDP AQGVLRLSFT HYTDQLEVTA LIEALDRVL