Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TM1040_0938 |
Symbol | |
ID | 4077566 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ruegeria sp. TM1040 |
Kingdom | Bacteria |
Replicon accession | NC_008044 |
Strand | + |
Start bp | 1001097 |
End bp | 1002125 |
Gene Length | 1029 bp |
Protein Length | 342 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 638006241 |
Product | agmatinase |
Protein accession | YP_612933 |
Protein GI | 99080779 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0010] Arginase/agmatinase/formimionoglutamate hydrolase, arginase family |
TIGRFAM ID | [TIGR01230] agmatinase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 0.738745 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.133276 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACAAGCC CGCCTGCATT GGCTAGGACG TCAGCAGAAC CCGAGAACAG GGAGCAGTCC AAAATGGCGC TGGAAGACGC AAAAACGCAG GTAGATCAGG CATTTACGCG CGAGGACCTT AAGGGTCTCA GCTTTGAGAA CACCTTTGGA GGGGCCACGT CGTTTCTGCG GCGTCGCTAC ACCAAGGATC TGACCCACGC GGATATCGCC GTGACGGGTA TTCCGTTTGA TCAGGCGGTG ACAAACCGCC CCGGCACACG GCTGGGTCCG CGCGCGGTGC GCGAGGCCTC GGCCCTGCAA AGCCCCGATG CGCCTTATGG CTGGGATATT TGCCCTGCAA GCGAGCTGGC AATCGTGGAC TACGGCGATC TGGCCTTTGA CTATGCCAAT GTCCCGGCGT TTCCCGACAC GCTCACGGAT CATATCCGGG GCATTCTGGC CACCGATACA GCATCGGTGG CGATTGGCGG CGACCATTAT GTGAGCTTCC CGATCCTCAA GGCCTATGCC GAGAAATACG GGCCTGTTTC GCTCTTGCAT TTCGATGCTC ACAGCGACAC CTGGGCGGAT GATGATTTCA GCCGCGTGGA TCATGGCACG ATGTTCTACA AAGCGGTGAA ATCCGGCATC ATTGACCCGG CTACCTCGGT GCAGGTGGGC ATTCGCACCA CCAATGAGGA CAATCTGGGC GTGCCGACTA TCGATGCCCC CACGGTGCAT GAGATCGGCC CGGTCGAGAC CGCGCGCCGC ATCCGCGAGG TGCTGGGCGA CCGGCCAGTC TATCTGACCT TTGACATTGA CTGCCTCGAT CCGGCCTATG CGCCGGGCAC TGGCACCCCG GTCTGGGGTG GCCTCACCTC TGCGCAGGCA GAGGCGATCC TCAAGGGGTT GCGCGGCATC AACATTCTGG GCGGCGACGT CGTCGAAGTC TCGCCCCCCT TTGACACCAC CGGCGCCACC GCCATCGCGG GCGCCCATGT GGCCATGAGT ATTATCTGCC TTCTGGGCTG GAGGATGACA GGACGATGA
|
Protein sequence | MTSPPALART SAEPENREQS KMALEDAKTQ VDQAFTREDL KGLSFENTFG GATSFLRRRY TKDLTHADIA VTGIPFDQAV TNRPGTRLGP RAVREASALQ SPDAPYGWDI CPASELAIVD YGDLAFDYAN VPAFPDTLTD HIRGILATDT ASVAIGGDHY VSFPILKAYA EKYGPVSLLH FDAHSDTWAD DDFSRVDHGT MFYKAVKSGI IDPATSVQVG IRTTNEDNLG VPTIDAPTVH EIGPVETARR IREVLGDRPV YLTFDIDCLD PAYAPGTGTP VWGGLTSAQA EAILKGLRGI NILGGDVVEV SPPFDTTGAT AIAGAHVAMS IICLLGWRMT GR
|
| |