Gene Syncc9605_1082 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSyncc9605_1082 
Symbol 
ID3737794 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSynechococcus sp. CC9605 
KingdomBacteria 
Replicon accessionNC_007516 
Strand
Start bp1020343 
End bp1021533 
Gene Length1191 bp 
Protein Length396 aa 
Translation table11 
GC content59% 
IMG OID637775673 
Productagmatinase 
Protein accessionYP_381394 
Protein GI78212615 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0010] Arginase/agmatinase/formimionoglutamate hydrolase, arginase family 
TIGRFAM ID[TIGR01229] arginase
[TIGR01230] agmatinase 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.193569 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCATCCC CATCGGACCC TTCAGGGGCC TTCCAGCGTT CGTATCCCAG CGAAGGCATG 
CAGGCACTCG AGAAAGAACG CAAGCTTCCA CTCACTGGCT GGCAGCAAGA AGTTGACCAG
GCCAAACGCT TCGGGCTTGA AGCCGCCGAA AGCATTGTTG ACCGCAACAT CTCCACCTTC
TCTAGAGGCG AGCTGCCGCA TTTCGCCGGC ATCAACACCT TCATGAAGGC GCCCTATTTA
GAAGATGTGA ACCAGGTGGG CAACTACGAC GTCGCCATCG TTGGTGTACC CCACGACTGC
GGCACCACCT ACCGGCCCGG AACGCGCTTC GGCCCCCAGG GGATCCGACG AATATCAGCG
CTTTACACCC CTTACAACTA CGAAATGGGT GTCGACCTGC GTGAACAGAT CACCCTCTGC
GATGTGGGTG ACATCTTCAC GATCCCGGCC AACAACGAAA AGAGCTTCGA TCAGATCTCC
AAAGGCATCG CCCACGTCTT CTCGAGCGGC ACCTTCCCGA TCATCCTCGG TGGCGACCAC
TCGATCGGTT TCCCCACGGT GCGTGGGGTG TGTCGCCATC TCGGCGACAA AAAAGTGGGA
ATCATCCATT TCGATCGCCA CGTCGACACC CAGGAGATCG ACCTTGATGA GCGGATGCAC
ACCTGCCCTT GGTTCCATGC CACAAACATG GCCAACGCCC CGGCAGAAAA CCTGGTGCAG
CTGGGCATTG GTGGTTGGCA AGTGCCTCGC GAGGGCGTCA AGGTCTGCAG GGAGCGGGGC
ACCAATGTGC TCACGGTGAC CGACATCACT GAAATGGGGC TGGAAGCCGC AGCCCAATAC
GCCATTGAAC GAGCCACCGA TGGCACGGAC TGCGTCTACA TCTCCTTCGA CATTGACTGC
ATCGATGCCG GCTTCGTGCC GGGAACTGGC TGGCCTGAGC CCGGTGGCTT GATGCCGCGA
GAAGCGCTCA AGCTGCTCGA GCTGATCGTG CGCAACGTTC CCGTCTGCGG CCTGGAAATC
GTTGAGGTTT CACCTCCCTA CGACATCAGT GACATGACCT CCCTGATGGC CACCCGGGTT
ATTTGCGACA CCATGGCCCA CCTTGTGGTG AGCGGTCAGT TACCCCGCAA AGAGAAGCCG
GAGTGGATCA GCGACACCTG CAACATGAAC GTTGATCAGA AGTGGAGATA G
 
Protein sequence
MSSPSDPSGA FQRSYPSEGM QALEKERKLP LTGWQQEVDQ AKRFGLEAAE SIVDRNISTF 
SRGELPHFAG INTFMKAPYL EDVNQVGNYD VAIVGVPHDC GTTYRPGTRF GPQGIRRISA
LYTPYNYEMG VDLREQITLC DVGDIFTIPA NNEKSFDQIS KGIAHVFSSG TFPIILGGDH
SIGFPTVRGV CRHLGDKKVG IIHFDRHVDT QEIDLDERMH TCPWFHATNM ANAPAENLVQ
LGIGGWQVPR EGVKVCRERG TNVLTVTDIT EMGLEAAAQY AIERATDGTD CVYISFDIDC
IDAGFVPGTG WPEPGGLMPR EALKLLELIV RNVPVCGLEI VEVSPPYDIS DMTSLMATRV
ICDTMAHLVV SGQLPRKEKP EWISDTCNMN VDQKWR