Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TM1040_0939 |
Symbol | |
ID | 4077333 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ruegeria sp. TM1040 |
Kingdom | Bacteria |
Replicon accession | NC_008044 |
Strand | + |
Start bp | 1002122 |
End bp | 1003069 |
Gene Length | 948 bp |
Protein Length | 315 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 638006242 |
Product | agmatinase |
Protein accession | YP_612934 |
Protein GI | 99080780 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0010] Arginase/agmatinase/formimionoglutamate hydrolase, arginase family |
TIGRFAM ID | [TIGR01230] agmatinase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.220122 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCGAGT TCAATCAACC CATCAGCGGC AATGATCTGG CGCGGTTCTC GGGGCCGGGC ACCTTTATGC GCCTGCCGCA GGCCACCTCG CTAGAGGGTC TGGATGTGGC GGTGCTGGGT GTGCCGATGG ATATTGGCAC CTCCTGGCGT TCCGGGACGC GGTTCGGGCC CAAGCAGATC CGCGCCGAGA GCGCGATGAT CCGCCCGTAC AACATGGCCA CCGGGGCTGC GCCTTTTGAT GCGCTCAACA TTGCCGATAT TGGCGATCTG GCGATCAATA CGTTCAACCT CGCGGAGTCG CTGCGCATCA TCGAAGACAG CTATGACGCG ATCCTCGGTT CGGGCGTGCT TCCCTTTGCG ATGGGGGGCG ATCACTCGAT CACCCTGCCG ATCCTCAGGG CGATGGCGCG CAGGTATGGC CCGGTTGCGG TGATCCATGT GGATGCCCAT GCGGACGTCA ATGACGAGAT GTTTGGCGAG CGCGAAACCC ACGGCACCGT GTTCCGCCGC GCCTATGAAG AGGGGCTTCT GGAGGCGGAT AAAGTTTATC AGATTGGTCT GCGCGGTACC GGCTATGGAC CGGATGATTT CAAAGAGCCG CAGAGCTGGG GCTTTCAGCA TTTTGTCGCC TCCGAGCTCT GGAACCGATC GCTGCACAAT ATGGGGGCGG AAATCCGTCG CGATATTGGC GGGCGTCCGG TCTATATCTC CTATGATATC GACAGTCTTG ATCCCGCCTT CGCGCCGGGC ACCGGCACGC CGGAAATTGG CGGGCTGACC ACCATGCAGG CGCTGGAGCT GATCCGCGCC TTCAAGGGGC TCAATGTCGT GGGATGTGAT CTTGTCGAAG TCTCGCCGCC CTACGACCCT TCGGGGAATA CGGCGCTTGT AGCAGCCAAT CTGATCTATG AGATGCTGTG CATCCTGCCA GGACGGGATA GGGCGTAA
|
Protein sequence | MSEFNQPISG NDLARFSGPG TFMRLPQATS LEGLDVAVLG VPMDIGTSWR SGTRFGPKQI RAESAMIRPY NMATGAAPFD ALNIADIGDL AINTFNLAES LRIIEDSYDA ILGSGVLPFA MGGDHSITLP ILRAMARRYG PVAVIHVDAH ADVNDEMFGE RETHGTVFRR AYEEGLLEAD KVYQIGLRGT GYGPDDFKEP QSWGFQHFVA SELWNRSLHN MGAEIRRDIG GRPVYISYDI DSLDPAFAPG TGTPEIGGLT TMQALELIRA FKGLNVVGCD LVEVSPPYDP SGNTALVAAN LIYEMLCILP GRDRA
|
| |