Gene TM1040_1375 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_1375 
SymbolguaA 
ID4075868 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp1468257 
End bp1469819 
Gene Length1563 bp 
Protein Length520 aa 
Translation table11 
GC content59% 
IMG OID638006685 
ProductGMP synthase 
Protein accessionYP_613370 
Protein GI99081216 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0518] GMP synthase - Glutamine amidotransferase domain
[COG0519] GMP synthase, PP-ATPase domain/subunit 
TIGRFAM ID[TIGR00884] GMP synthase (glutamine-hydrolyzing), C-terminal domain or B subunit
[TIGR00888] GMP synthase (glutamine-hydrolyzing), N-terminal domain or A subunit 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.497878 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGAGA CAGCCCATGA CCGCCTTTTA ATTATAGACT TCGGCAGCCA GGTAACGCAG 
CTGATTGCGC GCCGCCTGCG CGAGTTGAAC GTCTATTGTG AAATCCACCC CTATCAGAAT
GTCACCATGG ACTTCGTGCG CGAGTTCGCG CCCAAGGCGG TGATCTTCTC TGGTGGTCCC
GACAGCGTGA CGCGCGAAGG CTCTCCCCGC GCGCCGCAAG AGATTTTTGA CTACGGCGTG
CCGATCCTTG GCATCTGTTA TGGCCAGCAA GTGATGATGC ATCAGCTTGG CGGCACTGTT
CAATCCGGCC ATGGAACCGC CGAATTTGGC CGCGCCTATG TGACACCCAC CGAAGAGCGC
ATCGACATGC TCAGCGGCTG GTTCCTGGAT CAGACGGAAC AGGTCTGGAT GAGCCACGGC
GACCATGTCT CTGAAATCGC ACCCGGTTTC AAGGTCTACG GCACCTCGCC CAACGCGCCA
TTTGCGATCA CGGCGGATCT GGAGCGCAAC TTCTACGCTG TTCAGTTCCA CCCCGAGGTT
CATCACACTC CCAACGGCAA GACGCTCTAT GAAAACTTCG TGCGTCTGGC CGGGTTCAGC
GGTGACTGGA CCATGGGCGC CTATCGCGAG CAGATGGTCG AAACCATCCG CGAGCAGGTC
GGCGACAAGA AAGTCATCTG TGCCCTTTCG GGTGGCGTCG ACAGTTCCGT TGCGGCGGCT
CTGATCCACG AGGCGATCGG CGATCAGCTG ACCTGTGTGT TTGTGGACCA TGGTCTTCTG
CGCAAGAACG AGGCGGAAGA AGTCGTCGGC ATGTTCCGCG ACCACATGAA CCTGCAGGTC
ATCCACGCGG ATGAGACCGA GCTCTTTCTC GGTGAGCTAG AAGGTCAGTC CGACCCCGAA
ACCAAGCGCA AGATTATCGG CAAGCTGTTC ATCGACGTGT TCCAGAAATA CGCCGATCAG
ATCGAAGGCG CGGAGTTCCT GGCCCAAGGT ACGCTCTACC CGGATGTCAT CGAGTCGGTC
TCGTTCTCTG GTGGCCCTTC GGTGACGATC AAGTCGCACC ACAACGTCGG TGGTCTGCCC
GAAAAAATGG GCCTGAAACT GGTGGAGCCG CTGCGCGAGC TCTTCAAGGA CGAAGTGCGC
GCGCTCGGGC GTGAACTTGG CCTGCCCGAC AGCTTTATTG GACGGCACCC CTTCCCCGGA
CCGGGTCTGG CGATCCGTTG CCCCGGCGAG ATCACCCGCG ACAAGCTGGA CATCCTGCGC
GAAGCGGATG CCATCTATAT CGACCAGATC CGCAAACACG GTCTTTATGA TGAGATCTGG
CAGGCCTTTG TGGCGATCCT GCCGGTGCGC ACCGTGGGCG TGATGGGCGA CGGTCGCACC
TATGACTACG CCTGTGCCCT GCGCGCGGTC ACCTCGGTCG ATGGGATGAC GGCGGATTAC
TACCCGTTCA GCCATGAGTT CCTTGGTGAG ACCGCAACGC GGATCATCAA TGAAGTCAAA
GGCATCAACC GTTGCACCTA TGACATCACC TCAAAGCCTC CGGGCACGAT CGAGTGGGAA
TGA
 
Protein sequence
MTETAHDRLL IIDFGSQVTQ LIARRLRELN VYCEIHPYQN VTMDFVREFA PKAVIFSGGP 
DSVTREGSPR APQEIFDYGV PILGICYGQQ VMMHQLGGTV QSGHGTAEFG RAYVTPTEER
IDMLSGWFLD QTEQVWMSHG DHVSEIAPGF KVYGTSPNAP FAITADLERN FYAVQFHPEV
HHTPNGKTLY ENFVRLAGFS GDWTMGAYRE QMVETIREQV GDKKVICALS GGVDSSVAAA
LIHEAIGDQL TCVFVDHGLL RKNEAEEVVG MFRDHMNLQV IHADETELFL GELEGQSDPE
TKRKIIGKLF IDVFQKYADQ IEGAEFLAQG TLYPDVIESV SFSGGPSVTI KSHHNVGGLP
EKMGLKLVEP LRELFKDEVR ALGRELGLPD SFIGRHPFPG PGLAIRCPGE ITRDKLDILR
EADAIYIDQI RKHGLYDEIW QAFVAILPVR TVGVMGDGRT YDYACALRAV TSVDGMTADY
YPFSHEFLGE TATRIINEVK GINRCTYDIT SKPPGTIEWE