Gene TM1040_1946 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_1946 
Symbol 
ID4076897 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp2050013 
End bp2051275 
Gene Length1263 bp 
Protein Length420 aa 
Translation table11 
GC content63% 
IMG OID638007262 
Productphosphoribosylamine--glycine ligase 
Protein accessionYP_613941 
Protein GI99081787 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0151] Phosphoribosylamine-glycine ligase 
TIGRFAM ID[TIGR00877] phosphoribosylamine--glycine ligase 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.0479651 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATATCC TCATCCTCGG CAGCGGCGGG CGGGAACATG CACTTGCCTG GGCGGTGATG 
CAGAACCCCA AATGCGACCG GCTCATTGTG GCGCCGGGCA ATGCGGGTAT CGCCCAGATC
GCCGACTGCG CCAGCCTCAA CGCCGAGGAT GGCGGCGCGG TTGTGACATT CGCCGAGGAA
AACGCCATTG ATTTTGTGAT CGTCGGCCCC GAGGCGCCTT TGGCCGCCGG TGTGGCAGAC
CGGTTGCGCG ATGCGGGCAT TCTGGTCTTT GGTCCCTCTG AGGCCGCCGC CCGGCTGGAG
GCTTCCAAAA GCTTCACCAA GGAGATCTGC GACGCGGCCA ATGCGCCCAC TGCGGGCTAT
GGCCACTTCA CTGATGCCGA GGCGGCCAAG GCCCATGTCC GTGCCAACGG CGCGCCGATT
GTGGTCAAGG CCGATGGTCT GGCCGCAGGC AAGGGCGTGA TCGTGGCGAT GGACGAGCAG
ACTGCGCTCG ATGCCATCGA CGATATGTTC GGCGGTGCCT TTGGTGGGGC GGGCGCAGAG
GTTGTCATCG AGGAATTCAT GGAAGGTGAA GAGGCATCGC TCTTTGTGCT CTGTGATGGT
GAGGAAATCC TGTCCATCGG TACCGCACAG GACCACAAGC GCGTCGGCGA AGGCGACACT
GGCCTAAATA CCGGCGGCAT GGGGGCTTAT TCTCCTGCAC CGGTTCTGAG CGCCGAGGTT
GAAGCCAAGG CCATGGAAGA GATCGTGAAG CCCACCATGC GGGTGATGGC CGAGCGTGGC
ATGCCCTACC AAGGCGTGCT CTATGCAGGC CTGATGATCA AGGACGGCCA GCCGCGTCTG
GTGGAATATA ACGTCCGCTT TGGCGATCCC GAATGTCAGG TGCTGATGAT GCGCCTTGGC
GCGCAGGCCC TGGACCTGAT GCAAGCCGCA GCCGAAGGTC GCCTTGCGGA CGCCCGCGTC
AACTGGGCTG ATGACCACGC GATCACGGTG GTGATGGCTG CGGCAGGCTA TCCGGGAAGC
TATGAAAAAG GCAGCGAGAT CAAGGGCCTT GATGCTCTGC CCGAAGACAG CATGAATATG
GTCTTTCACG CAGGGACCAA GGCCGATGGC GACAAGATCC TCGCCAATGG TGGCCGGGTG
CTGAATGTGA CTGCACGGGG CGAGAGCCTC TCTGAGGCGC GCGATCGCGC CTATGCCATG
GTCGATCAGA TCGACTGGCC CGAGGGCTTC GTGCGCCGCG ACATCGGCTG GCGCGCGCTT
TGA
 
Protein sequence
MNILILGSGG REHALAWAVM QNPKCDRLIV APGNAGIAQI ADCASLNAED GGAVVTFAEE 
NAIDFVIVGP EAPLAAGVAD RLRDAGILVF GPSEAAARLE ASKSFTKEIC DAANAPTAGY
GHFTDAEAAK AHVRANGAPI VVKADGLAAG KGVIVAMDEQ TALDAIDDMF GGAFGGAGAE
VVIEEFMEGE EASLFVLCDG EEILSIGTAQ DHKRVGEGDT GLNTGGMGAY SPAPVLSAEV
EAKAMEEIVK PTMRVMAERG MPYQGVLYAG LMIKDGQPRL VEYNVRFGDP ECQVLMMRLG
AQALDLMQAA AEGRLADARV NWADDHAITV VMAAAGYPGS YEKGSEIKGL DALPEDSMNM
VFHAGTKADG DKILANGGRV LNVTARGESL SEARDRAYAM VDQIDWPEGF VRRDIGWRAL