Gene P9211_00121 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9211_00121 
SymbolargH 
ID5731751 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9211 
KingdomBacteria 
Replicon accessionNC_009976 
Strand
Start bp14906 
End bp16294 
Gene Length1389 bp 
Protein Length462 aa 
Translation table11 
GC content37% 
IMG OID641284354 
Productargininosuccinate lyase 
Protein accessionYP_001549897 
Protein GI159902553 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0165] Argininosuccinate lyase 
TIGRFAM ID[TIGR00838] argininosuccinate lyase 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.669318 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00196346 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
TTGGGAAAAC CTTGGAGCGA TCGATTCGAA GTAGGCCTTC ATCCTTTTAT AGAGAGTTTT 
AATGCTTCTA TAAAGTTTGA TTTTCTTCTT TTGCAAGAAG ATCTTGATGG ATCAATAGCT
CATGCAAGGA TGCTAGGCAA AACAGGCATA ATTAATGCTG ATGAAGCATC TCAACTTGAA
AAAGGCTTAA ACCAGATTCG TTTAGAAGCA TCCCAAGGTG TTTTTAATGC TGATCAACCT
GCTGAGGATG TTCATTTCGC TGTTGAGAAC AGATTAATAG AACTTTTAGG ACCGCTTGGA
AAGAAACTGC ATACTGGTAG AAGTAGAAAT GATCAAATAG CTACAGATAT AAGATTATGG
TTGCGACGAA AAATTGATGA AATCAATTTT GATTTAGAAA ATATTCAAAA GATTTTGTTG
GGCCATGCAG AAAAGAATTT GTATACACTT ATTCCTGGAT ATACGCATTT GCAAAGAGCT
CAACCTGTTT CATTAGCTCA TCATTTACTT GCATATCTTG AAATGTTTCA GAGAGATAGA
GATCGTTTGG TCGAAGTCAA AAGTCGAGTT AATACCTCTC CTTTGGGAGC AGCTGCCTTA
GCAGGAACTT CTTTACCAAT TGACAGGCTA TATACAGCAG ATCAATTAAA TTTTACTAGT
ATTTATTCCA ATAGTTTAGA TGCAGTAAGT GATCGTGATT TTGCAGTTGA ATTTATTGCT
GCATCTTCAT TAATTATGGT TCACTTAAGC CGATTATCAG AAGAAATAAT TTTTTGGTCT
AGCGAAGAAT TTTCGTTTGT AAAATTAACC GATCGATGCG CAACTGGTAG CAGCATAATG
CCTCAAAAAA AGAATCCTGA TGTACCTGAA CTTGTTAGAG GAAAGTCAGG AAGAGTCTTT
GGCCATCTCC AAGCTTTGTT GGTCATGCTC AAAGGTCTGC CTCTTGCATA TAACAAAGAT
TTTCAGGAAG ATAAAGAGGC TCTTTTTGAC ACAGTAGTGA CTGTTAGAAA TTCTCTTCAA
GCAATGTCTA TTCTCTTAGA AGAGGGTTTG GAGTTTTCTT TAGATCGCCT GGGATCAGCC
GTGGAATCGG ATTTTTCTAA TGCAACTGAT GTGGCAGATT ATTTAGTTTC TAAAGAAGTC
CCTTTTAGAG AGGCTTATCA GATTGTTGGA CGTTTAGTAA AGCTTTGTAT GAAAGAAGGT
ATTTTGCTTA AGGATCTTTC TTTTGATCAA TGGCAGGATA TGCACCCTGC TTTTGATCAG
GATATATATA AAAGGTTAAC TCCAGAACAT GTAGTCGCCT CGAGGATTAG TCAAGGCGGA
ACAGGCTTTG CTCAAGTGTC TGCACAGTTG GAAAATTGGC AAAATCAGTT TTCTTCTTTG
AAAGAATGA
 
Protein sequence
MGKPWSDRFE VGLHPFIESF NASIKFDFLL LQEDLDGSIA HARMLGKTGI INADEASQLE 
KGLNQIRLEA SQGVFNADQP AEDVHFAVEN RLIELLGPLG KKLHTGRSRN DQIATDIRLW
LRRKIDEINF DLENIQKILL GHAEKNLYTL IPGYTHLQRA QPVSLAHHLL AYLEMFQRDR
DRLVEVKSRV NTSPLGAAAL AGTSLPIDRL YTADQLNFTS IYSNSLDAVS DRDFAVEFIA
ASSLIMVHLS RLSEEIIFWS SEEFSFVKLT DRCATGSSIM PQKKNPDVPE LVRGKSGRVF
GHLQALLVML KGLPLAYNKD FQEDKEALFD TVVTVRNSLQ AMSILLEEGL EFSLDRLGSA
VESDFSNATD VADYLVSKEV PFREAYQIVG RLVKLCMKEG ILLKDLSFDQ WQDMHPAFDQ
DIYKRLTPEH VVASRISQGG TGFAQVSAQL ENWQNQFSSL KE