Gene TM1040_1838 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_1838 
Symbol 
ID4077863 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp1938162 
End bp1939529 
Gene Length1368 bp 
Protein Length455 aa 
Translation table11 
GC content62% 
IMG OID638007154 
ProductL-glutamine synthetase 
Protein accessionYP_613833 
Protein GI99081679 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0174] Glutamine synthetase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.0277584 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.270137 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGACCC CGACGCGCTG GCAGGAAAAA CTGCCCGAAG CTGCCACGAC CTATCTGGAG 
GGCCGTCGTC TCGACGAGGT TGAATGCGTT ATCTCGGACT TGCCGGGCAT CGCCCGGGGC
AAGGCGGTGC CAGCGTCGAA ATTTGCCAAG CAGGACTATT TTTACCTGCC CGACAGCATC
TTTTATCAGA CCATCACCGG GGATTGGGCC GAAGCCGCCG ACGAGGACGG CTGGATCGAA
AAGGACATGC TGCTGCGCCC GGACATGAGC ACCGCAACCG CCGCCCCCTG GACCGGCGAC
TGGACTCTGC AGGTCATCCA CGATGCCTAT GACCGCGACG GCAACCCGAT CCCCTTCAGC
CCGCGCAACG TGCTGAAACA TGTGGTGAGC CTCTATGAGG CGCAGGGCTG GAAACCCGTG
GTGGCGCCGG AGATGGAATT CTATCTCGTG GCCCGCAACG TCGACCCCGC GCGCGACATC
CAGCCCATGA TGGGCCGCTC TGGCCGTCCG GCGGCGGCGC GTCAGGCCTA TTCGATGACA
GCGGTGGACG AATTTGGCCC CGTCATCGAC GACATCTATG ATTTTGCCGA GGCGCAAGGC
TTTGAAATCG ACGGCATCAC CCAGGAAGGC GGTGCCGGGC AGCTGGAAAT CAACCTGCGC
CATGGCTCGC CGGTGAAGCT AGCCGATGAG GTGTTTTACT TCAAACGCCT GATCCGCGAG
GCCGCGCTGC GCCATGATTG CTTTGCCACC TTCATGGCAA AACCGATCGC TGATGAGCCG
GGCTCTGCCA TGCATATCCA CCACTCGGTG ATCGACATCG AGAGCGGCGA GAACATCTTT
TCCGGTCCCC AAGGTGGTGA AACGGATGCG TTTTATCACT TTATCGGCGG GCTGCAGAAC
CACCTTCCTG CCGGTCTCGC GGTGATGGCG CCCTATGTGA ATTCCTATCG CCGCTATGTG
AAGAACCACG CCGCGCCGAT CAATCTGGAA TGGGCGCGCG ACAACCGCAC CACCGGCATT
CGGGTGCCGC TCTCCAGTTC CGCCTCGCGC CGGGTTGAAA ACCGCATCGC CGGGATGGAT
TGCAACCCCT ATCTCGGTAT CGCGGTATCG CTGGCCTGCG GCTACCTCGG CCTGATGGAA
GAGCGCCGCC CCACCCGCCA GTTCAAAGGC GATGCCTATG AGGGCGAAGG TGACTTTCCG
CAGGTCATGG GTCAGGCGCT CGATCTCTTT GACGAGTCCA AGGCGCTTCA CGAGGTGCTC
GGCCCCGAAT TCGCCCGGGT CTACAGCACG GTGAAGCGCG CAGAATACGA AGAGTTCCTG
CAAGTGATCT CGCCGTGGGA GCGTGAGCAC CTGTTGCTCA ACGTCTGA
 
Protein sequence
MPTPTRWQEK LPEAATTYLE GRRLDEVECV ISDLPGIARG KAVPASKFAK QDYFYLPDSI 
FYQTITGDWA EAADEDGWIE KDMLLRPDMS TATAAPWTGD WTLQVIHDAY DRDGNPIPFS
PRNVLKHVVS LYEAQGWKPV VAPEMEFYLV ARNVDPARDI QPMMGRSGRP AAARQAYSMT
AVDEFGPVID DIYDFAEAQG FEIDGITQEG GAGQLEINLR HGSPVKLADE VFYFKRLIRE
AALRHDCFAT FMAKPIADEP GSAMHIHHSV IDIESGENIF SGPQGGETDA FYHFIGGLQN
HLPAGLAVMA PYVNSYRRYV KNHAAPINLE WARDNRTTGI RVPLSSSASR RVENRIAGMD
CNPYLGIAVS LACGYLGLME ERRPTRQFKG DAYEGEGDFP QVMGQALDLF DESKALHEVL
GPEFARVYST VKRAEYEEFL QVISPWEREH LLLNV