Gene TM1040_2393 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_2393 
Symbol 
ID4076719 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp2516031 
End bp2517155 
Gene Length1125 bp 
Protein Length374 aa 
Translation table11 
GC content62% 
IMG OID638007715 
Productglycine cleavage system T protein 
Protein accessionYP_614387 
Protein GI99082233 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0404] Glycine cleavage system T protein (aminomethyltransferase) 
TIGRFAM ID[TIGR00528] glycine cleavage system T protein 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.673157 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGGATC TGAAGACCAC TCCCCTTACT GAATTGCACG AGAGCCTCGG GGCCAAGATG 
GTGCCCTTTG CTGGCTACTC CATGCCTGTG CAATACCCGC TCGGGATCAT GAAAGAACAC
ACGCACACCC GCGAGAAAGC CGGGCTCTTT GATGTGAGCC ACATGGGGCA AGTGATTCTG
CGTGGAGAGA GCTATGAGGC CTTGGCCGCT GCGTTTGAAA AGCTGGTGCC GATGGATGTA
CTCGGGCTGT CCGAGGGGCG CCAACGCTAC GGGCTGTTCA CCAACGATAC GGGCGGCATC
GAGGACGACC TGATGTTCGC CAACCGCGGT GATCACCTGT TTGTGGTTGT GAACGCAGCC
TGCAAGGACG CTGACATCGC CCGCATGAAG GCCGCGCTTG AACCCGAAGT AACAGTCGAG
CCTGTCACCG ATCGCGCGCT TCTGGCCCTT CAGGGCCCTG CCGCTGAGGC GGCGCTGGAG
GCGTTGGTGC CTGGCGTTGC GGCGATGAAG TTCATGGATG TGGCAACCTT CGCATATGAA
GGCGGAGAGC TCTGGATCTC CCGCTCTGGT TATACCGGAG AGGATGGCTA TGAGATCTCC
GTCGCCGAAG CCGGCGCAGA GGCCTTTGCC AAGGCTCTGC TGGCACATGC CGACGTTGAG
GCCATTGGCC TCGGCGCACG AGATTCCCTG CGCCTTGAGG GCGGGCTTTG CCTTTATGGA
CACGACATCG ACACTGAGAC CCGCCCGTTT GAGGCCGCTC TTGGCTGGGC GATCCAAAAA
GTACGCCGCC CGGGTGGCGA CCGCGCGGGC GGCTTTCCCG GTGCGGATGC GATCTTTGCT
GATCTCGGTG GCAAGGCCCC GCGCAAACGT GTGGGTCTGA AGCCCGAGGG CCGTGCGCCG
ATGCGCGAGG GTGTTGTGCT TTACGCAAGT GCCGAAGGTG GCGACCCTAT CGGCACCATC
ACGTCTGGTG GCTTTGGTCC GACCGTTGGC GGCCCGGTCG CCATGGGCTA CGTCACCGCA
GAGCATGCTG CTCTGGACAC ACAGGTCTTT GGCGAGCTGC GCGGCAAGCG CGTGCCCGTC
ACTGTTGCAA AGCTGCCCTT CGTGGCGGCC AACTTCAAAC GCTAA
 
Protein sequence
MSDLKTTPLT ELHESLGAKM VPFAGYSMPV QYPLGIMKEH THTREKAGLF DVSHMGQVIL 
RGESYEALAA AFEKLVPMDV LGLSEGRQRY GLFTNDTGGI EDDLMFANRG DHLFVVVNAA
CKDADIARMK AALEPEVTVE PVTDRALLAL QGPAAEAALE ALVPGVAAMK FMDVATFAYE
GGELWISRSG YTGEDGYEIS VAEAGAEAFA KALLAHADVE AIGLGARDSL RLEGGLCLYG
HDIDTETRPF EAALGWAIQK VRRPGGDRAG GFPGADAIFA DLGGKAPRKR VGLKPEGRAP
MREGVVLYAS AEGGDPIGTI TSGGFGPTVG GPVAMGYVTA EHAALDTQVF GELRGKRVPV
TVAKLPFVAA NFKR