Gene PCC8801_3660 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPCC8801_3660 
Symbol 
ID7102914 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCyanothece sp. PCC 8801 
KingdomBacteria 
Replicon accessionNC_011726 
Strand
Start bp3820573 
End bp3821961 
Gene Length1389 bp 
Protein Length462 aa 
Translation table11 
GC content47% 
IMG OID643476675 
Productargininosuccinate lyase 
Protein accessionYP_002373778 
Protein GI218248407 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0165] Argininosuccinate lyase 
TIGRFAM ID[TIGR00838] argininosuccinate lyase 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGACCAAAC AAAAAACGTG GAGCGATCGC TTTGAAGGGA GTCTCCATCC AGCTATTGTT 
GAATTTAACG CCAGTATTGG CTTCGATATC GAACTAATCG AATACGATCT CACCGGATCG
ATTGCCCATG CCCAAATGTT AGCCCATACG GGCATTATTT CCCCAGAAGA AGCCCAAAAA
CTGACCCAAG GCTTAGAACA GATTCGCCAA GAATACCGCC AGGGAGAGTT TAAACCAGGA
ATTGATCAAG AAGATGTTCA CTTTGCCGTC GAAAGGCGAT TGACCGAAAT TGTCGGAGAT
GTGGGCAAAA AACTCCATAC CGCGCGATCG CGTAACGATC AGGTGGGAAC CGATATTCGG
CTCTATCTAA GGGATCAAAT TAGCCAAATT CGCGCCCAAT TACGGGAATT TCAGCAAGTC
TTAGTAAACC ACGCTGAGAA CCATATCGAA ACCCTGATCC CTGGCTATAC CCACCTGCAA
CGGGCACAAC CGATTAGTTT AGCCCACCAT CTCCTCGCCT ATTTCCAAAT GGCACAACGG
GACTGGGAAC GCTTAGGCGA AATTTACGCC AGAACCAATA TTTCACCCCT AGGATGCGGG
GCGTTAGCGG GGACGACTTT TCCTATTGAT CGCCATTACA GTGCCGAATT ATTGCAATTT
CAAGGGGTCT ATGGCAACAG TTTAGATGGG GTCAGCGATC GGGATTTTGC CATTGAATTC
CTCAACGCAG CCAGTCTGAT TATGGTTCAT TTAAGCCGTT TAAGTGAAGA AATGATCCTC
TGGTCTTCCC ATGAGTTTAG TTTTATCAGT CTAACGGATA GCTGCGCCAC GGGATCGAGT
ATTATGCCCC AAAAGAAAAA CCCGGATGTT CCTGAATTAG TCCGGGGAAA GGCTGGTCGC
GTTTTTGGCC ACTTGCAAGG GATGTTAGTC TTAATGAAAG GGCTACCTTT AGCGTACAAT
AAAGATCTGC AAGAGGACAA AGAAGCCATT TTTGATGGTG TCAAAACCGT GAAAGTCTGT
TTGGAGGCCA TGACCATTCT CTTAGCTGAA GGCATTAAAT TCCGCGAAGA ACGCCTAGCC
GAAGCCGTAT CCGAGGACTT TTCCAATGCA ACGGATGTAG CGGATTATTT AGCCGCTAAA
GGGATTCCTT TTCGGGAGGC CTATAATTTA GTTGGGAAGG TGGTTAAAAC CAGTTCGGCG
GCCGGTAAGT TACTCAAGGA TTTATCCTTA GAAGAATGGC AAGCGTTACA TCCAGCCTTT
GAAGCCGATA TTTACGATGC GATCGCCCCT AAACAGGTGG TAGCAGCGCG TAATAGCTAT
GGGGGAACTG GTTTTGAACA AATCCGTCAA GCCATTACAA GGGCAAAGGC TCAATTAGAG
TCCTCTTAA
 
Protein sequence
MTKQKTWSDR FEGSLHPAIV EFNASIGFDI ELIEYDLTGS IAHAQMLAHT GIISPEEAQK 
LTQGLEQIRQ EYRQGEFKPG IDQEDVHFAV ERRLTEIVGD VGKKLHTARS RNDQVGTDIR
LYLRDQISQI RAQLREFQQV LVNHAENHIE TLIPGYTHLQ RAQPISLAHH LLAYFQMAQR
DWERLGEIYA RTNISPLGCG ALAGTTFPID RHYSAELLQF QGVYGNSLDG VSDRDFAIEF
LNAASLIMVH LSRLSEEMIL WSSHEFSFIS LTDSCATGSS IMPQKKNPDV PELVRGKAGR
VFGHLQGMLV LMKGLPLAYN KDLQEDKEAI FDGVKTVKVC LEAMTILLAE GIKFREERLA
EAVSEDFSNA TDVADYLAAK GIPFREAYNL VGKVVKTSSA AGKLLKDLSL EEWQALHPAF
EADIYDAIAP KQVVAARNSY GGTGFEQIRQ AITRAKAQLE SS