Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_3987 |
Symbol | |
ID | 5706662 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | - |
Start bp | 4534834 |
End bp | 4536021 |
Gene Length | 1188 bp |
Protein Length | 395 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 641273412 |
Product | arginase/agmatinase/formiminoglutamase |
Protein accession | YP_001538768 |
Protein GI | 159039515 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0010] Arginase/agmatinase/formimionoglutamate hydrolase, arginase family |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 5 |
Fosmid unclonability p-value | 0.000739402 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGATGCGCC GGATCGCCGT CCTCGACGCG CCCACCAACC TCGGTCTGCG TCCGCCCACC TCGACCTCGG TGCCAGGCTG CGGCAAGGCG CCGGGGGCGC TGCGTGACCA CGGCCTGCTC GCCCGGCTCC GAGCCCGTGA TGCCGGCTGC CTGACCCCGT CCCGGTACGA CCCCGGTGAC TGGCGGCCCG GCGATGGCGT CTGCCACGCC TGGGAGATCG CCAGCTATTC GGTGGCGCTC GCTGACCGGA TCGGAGCGGT CATGGACAGC GGCGAGTTCC CGCTGGTGCT CGGTGGGGAC TGTTCGGTCC TGCTCGGTTC GGCGCTGGCC ATGCACCGTC TCGGTGAGGC GGTGGGCGGC CGGGTCGGAC TCGTCTACGT GGACGGGCAT TCGGACTTTC GGCACCCCGG CAACGCCTCC TACGTGGGAG CGGCGGCCGG TGAGGGGCTG GCCCTGGTCA CCGGCCGAGG TCAGATCGAC CTGACCGCCA TCGAAGGCCG GCGGCCGTAC TTCCGAGACA TCGACGTGGC GGTGCTGGGC ATCCGTGCGC AGGATGACTA CCGGCTGGAC CTTCAGGCCG CCGGGATCAC CACCCGACCG GTTCCGGCGC TGCGCGCCGA GGGCGCGGCT CGTACGGCGC AGTGGGCGCA CGAGCAGCTC GCCCACTGCG CCGGCTACTG GCTGCATGTT GACGTGGACG TGCTGGACCC AGCCGTGATG CCCGCCGTTG ACGCTCCCGA CCCCGGCGGA ATCGCCTTCG CCGAACTGGA GATCCTGCTC GCCGGCCTGG TCGACACCCC GCACTGCCTC GGCGTCGAGG TGACCGTTTT CGATCCTGAC TACGACCCGG ACGGGGCGTA CGCCGCCGAG ATCGTCAACA CCCTGGTCGC CGGGCTCCGT CCGGTCACCG TGCCGGGCTC CGTGTCGCCC CGGCTGCGTG CAGCTGCCTC GCCACGTCCG ACCCCGGCGC GCCGACAGAG CCTGGAACGG CCGGTCGCCG TCGCCGACCC GGGAGGGGTA ACCGGCGCGG AGAAGCAGCC GGATCCGCCC GCCGTCCCGG GTGCGCGTAC CGTTGCCGCC GCGGACGAGG GTGGGGATCT CGGACCGTCG ACCGACTCGG CCCCGCCCAT CGACCGGCTG GCGCAGGTCC CCGGCAGCCT CCGGACTACT GGGCTCTCTG CACCCTGA
|
Protein sequence | MMRRIAVLDA PTNLGLRPPT STSVPGCGKA PGALRDHGLL ARLRARDAGC LTPSRYDPGD WRPGDGVCHA WEIASYSVAL ADRIGAVMDS GEFPLVLGGD CSVLLGSALA MHRLGEAVGG RVGLVYVDGH SDFRHPGNAS YVGAAAGEGL ALVTGRGQID LTAIEGRRPY FRDIDVAVLG IRAQDDYRLD LQAAGITTRP VPALRAEGAA RTAQWAHEQL AHCAGYWLHV DVDVLDPAVM PAVDAPDPGG IAFAELEILL AGLVDTPHCL GVEVTVFDPD YDPDGAYAAE IVNTLVAGLR PVTVPGSVSP RLRAAASPRP TPARRQSLER PVAVADPGGV TGAEKQPDPP AVPGARTVAA ADEGGDLGPS TDSAPPIDRL AQVPGSLRTT GLSAP
|
| |