Gene TM1040_2883 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_2883 
Symbol 
ID4076417 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp3052183 
End bp3053406 
Gene Length1224 bp 
Protein Length407 aa 
Translation table11 
GC content57% 
IMG OID638008212 
Productargininosuccinate synthase 
Protein accessionYP_614877 
Protein GI99082723 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0137] Argininosuccinate synthase 
TIGRFAM ID[TIGR00032] argininosuccinate synthase 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCGCGC CCAAGAAAGT TGTGCTGGCC TACTCCGGTG GCCTTGATAC TTCGATCATC 
CTGAAATGGT TGCAGACCGA ATACGGCTGC GAGGTTGTCA CCTTTACTGC CGATCTCGGC
CAGGGCGAGG AACTGGAGCC TGCACGAAAA AAAGCAGAGC TTCTCGGCAT TAAACCCGAA
AACATCTTTA TCGAGGACAT CCGCGAGGAG TTCGTCCGCG ATTTCGTCTT CCCGATGTTC
CGTGCCAATG CGGTCTACGA AGGTCTCTAC CTGCTGGGCA CCTCGATTGC GCGCCCGCTG
ATCTCCAAAC GTCTTGTCGA GATCGCAGAG GCAACTGGCG CTGATGCCGT GTCCCATGGT
GCAACCGGCA AGGGCAACGA CCAGGTCCGG TTCGAGCTGT CCGCCTATGC TCTGAACCCC
GACATTAAGG TGATTGCACC TTGGCGCGAG TGGGATCTGA CCTCGCGCAC CAAACTGCTG
GAATTTGCCG AGGCAAACCA GATCCCGATT GCAAAGGACA AGCGTGGCGA GGCGCCTTTC
TCGGTCGACG CAAACCTGCT GCACACCTCC TCTGAGGGTA AGGTTCTGGA AGATCCGGCA
GAAATGGCGC CCGATTACGT CTATCAGCGT ACCGTAAACC CGGAAGATGC GCCCAACGAG
CCCGAGTTCA TCGAAATCAC CTTTGAAAAA GGCGATGCGG TTGCGATCAA CGGCGAGGCC
ATGTCGCCTG CGACGATCCT GACCAAGCTC AACGAATATG GCCGCAAGCA CGGCATTGGC
CGTCTGGATT TCGTCGAGAA CCGCTTTGTC GGCATGAAAT CCCGCGGCAT CTACGAGGCC
CCGGGCGGCG ACATCCTGCT CGAAGCACAC CGTGGCATTG AACAGATCAC CCTCGATAGC
GGCGCGGGCC ATCTCAAGGA CTCGATCATG CCGCGTTATG CAGAGCTGAT CTATAATGGC
TTCTGGTACT CTCCGGAGCG TGAAATGCTG CAGGCCCTGA TTGATGAGAG CCAGAAGCAC
GTCACCGGCA CCGTACGCGT AAAGCTTTAC AAAGGCTCCG CAAAAACGGT TGGTCGCTGG
TCCGAACACT CGCTCTATTC CGAGGCACAT GTGACCTTTG AAGAAGACGC GGGCGCCTAC
GATCAAAAAG ACGCGCAGGG CTTCATCCAG CTCAACGCAT TGCGTCTGAA GCTTTTGGCA
GCGCGCAACC GTCGAGTAAA ATAA
 
Protein sequence
MSAPKKVVLA YSGGLDTSII LKWLQTEYGC EVVTFTADLG QGEELEPARK KAELLGIKPE 
NIFIEDIREE FVRDFVFPMF RANAVYEGLY LLGTSIARPL ISKRLVEIAE ATGADAVSHG
ATGKGNDQVR FELSAYALNP DIKVIAPWRE WDLTSRTKLL EFAEANQIPI AKDKRGEAPF
SVDANLLHTS SEGKVLEDPA EMAPDYVYQR TVNPEDAPNE PEFIEITFEK GDAVAINGEA
MSPATILTKL NEYGRKHGIG RLDFVENRFV GMKSRGIYEA PGGDILLEAH RGIEQITLDS
GAGHLKDSIM PRYAELIYNG FWYSPEREML QALIDESQKH VTGTVRVKLY KGSAKTVGRW
SEHSLYSEAH VTFEEDAGAY DQKDAQGFIQ LNALRLKLLA ARNRRVK