Gene P9303_00121 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9303_00121 
SymbolargH 
ID4776025 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9303 
KingdomBacteria 
Replicon accessionNC_008820 
Strand
Start bp15728 
End bp17140 
Gene Length1413 bp 
Protein Length470 aa 
Translation table11 
GC content51% 
IMG OID640085511 
Productargininosuccinate lyase 
Protein accessionYP_001016034 
Protein GI124021727 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0165] Argininosuccinate lyase 
TIGRFAM ID[TIGR00838] argininosuccinate lyase 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAGGTG GAGTAACCGG TGGTTCTGCA GAAGGCTGGA GCAAAAGGTT TGAAGAGGGA 
TTGCATCCGG TTATTGAGCG GTTCAATGCT TCCATCAGCT TTGACATCAC TCTGCTCCAA
GAAGATTTAG ATGGGTCTAT CGCTCATGCA CGCATGCTTG GCGAATGTGG AGTGATCAGT
CTTGAAGAGG CCGCTCAGCT TGAGGGCGGC CTTGAGAAAA TCCGCTCAGA GGCAGCGGCA
GGAGAGTTTC AGCCAGGACT TGTTGATGAA GATGTTCATT TCGCTGTTGA ACGGCGCCTG
ATCGCTTTAT TAGGGCCTGT CGGCAAGAAA CTTCATACCG GCCGAAGTCG CAATGATCAG
GTAGGCACGG ATCTGCGCTT GTGGTTACGG CGTCGCTTGG ATGATTTGGA TTGTGAACTG
GAGCGGTTTC AGAATGCTCT TTTGACACAG GCAGAATCCC ATCGTCAAAC TTTGATTCCC
GGCTACACCC ACTTACAGCG TGCCCAGCCG CTTTGCTTGG CCCATCATCT TTTGGCTTAT
ATCGAGATGA TCCAAAGGGA TAGAGATCGG CTTAAAGATG TTCGCGGACG AGTCAATATT
TCGCCGCTTG GTGCGGCTGC CCTTGCAGGA ACATCAGTGC CTATTGATCG CCAAAATACA
GCAGCAGCAC TTGGCTTTGA GTGTATTTAC GCGAATAGTC TGGATGCGGT AAGTGATCGC
GATTTTGCTG TTGAGTACAC AGCGGCGGCT TCGTTGGTCA TGGTCCATCT CAGTCGCCTG
GCGGAGGAGG TGATTTTCTG GGCGTCTGAG GAATTTGCAT TTGTGCGTCT CAGCGACCGT
TGTGCCACTG GCAGCAGCTT GATGCCTCAG AAGAAGAATC CTGATGTGCC TGAATTGGTA
CGAGGCAAAT GTGGTCGTGT GTTTGGTCAC CTTCAGGGTC TCCTGACCAT GATTAAGGGA
TTGCCATTGG CTTACAACAA GGATTTTCAG GAGGATAAGG AAGCTCTCTT TGACACCGTT
CGCACTACTA AAGATTGTGT GGAAGCCATG TCGATTTTGA TGGAGCAAGG GTTGGAGTTT
TGTTCGGAAC GCTTGGCAGC TGCCGTTGAA TCCGATTTTT CCAATGCCAC TGATGTGGCT
GACTACCTGG TTGCTAAAGG AGTGCCCTTT CGGGAGGCCT ACCAATTGGT GGGTGCGGTT
GTGAAGCGTT GTTTAGACGA AGGGATTTTG CTTTGTGATC TAAGCCTTGA ACAATGGCAG
GAATTTCACT CCGCTATTGC TGAGGATTTG CATGAAGCAT TGGCTCCAAA GCGGGTTGTG
GCTGTGCGGA TTAGTGAAGG TGGCACTGGT TTTGATCGAG TTGAGGAGCA ATTGCGTCAC
TGGCGTAGCC GCCTTGATTC CGGAGTGTCA TGA
 
Protein sequence
MAGGVTGGSA EGWSKRFEEG LHPVIERFNA SISFDITLLQ EDLDGSIAHA RMLGECGVIS 
LEEAAQLEGG LEKIRSEAAA GEFQPGLVDE DVHFAVERRL IALLGPVGKK LHTGRSRNDQ
VGTDLRLWLR RRLDDLDCEL ERFQNALLTQ AESHRQTLIP GYTHLQRAQP LCLAHHLLAY
IEMIQRDRDR LKDVRGRVNI SPLGAAALAG TSVPIDRQNT AAALGFECIY ANSLDAVSDR
DFAVEYTAAA SLVMVHLSRL AEEVIFWASE EFAFVRLSDR CATGSSLMPQ KKNPDVPELV
RGKCGRVFGH LQGLLTMIKG LPLAYNKDFQ EDKEALFDTV RTTKDCVEAM SILMEQGLEF
CSERLAAAVE SDFSNATDVA DYLVAKGVPF REAYQLVGAV VKRCLDEGIL LCDLSLEQWQ
EFHSAIAEDL HEALAPKRVV AVRISEGGTG FDRVEEQLRH WRSRLDSGVS