Gene NATL1_00111 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNATL1_00111 
SymbolargH 
ID4781267 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. NATL1A 
KingdomBacteria 
Replicon accessionNC_008819 
Strand
Start bp14783 
End bp16174 
Gene Length1392 bp 
Protein Length463 aa 
Translation table11 
GC content35% 
IMG OID640083274 
Productargininosuccinate lyase 
Protein accessionYP_001013840 
Protein GI124024724 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0165] Argininosuccinate lyase 
TIGRFAM ID[TIGR00838] argininosuccinate lyase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.642056 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGAAAAAG CATTGAGCAA AACTTGGAGT GACAGATTTG ATAAAGGACT TAATCCTTTT 
ATAGAAAAAT TTAATGCTTC AATCGAGTTT GATATTTGTT TATTAGAGGA AGATTTGGAT
GGATCAATTG CCCATGCACG TATGCTTGGA ATTCAAGGGA TTATTACCAA GGAAGAGGCG
CTTAGATTAG AGAATGGTCT TCAACAGATT CGAAAAGAGG CTTCTGATGG CTTATTTCAG
CCTGTCATTG CAGATGAAGA TGTGCATTTT GCAGTAGAAA AAAAATTAAT AGACTTGATA
GGCCCAGTAG GGAAAAAACT ACATACTGGT CGTAGTCGTA ATGATCAAGT TGGAACAGAT
CTGAGATTAT GGCTAAGAAA ACGTATTGAT GAAATTGATA TGGATTTGGT ACGTCTTCAG
AAATCTCTTT TTTTATTAGC AGAGGAAAAT CTGTATACGC TTATTCCTGG TTATACGCAT
TTACAAAGAG CCCAACCTTT GTCTCTGGCG CATCACTTGT TGGCATATAT TGAGATGGCA
CAAAGAGATA GAAATAGATT AAAAGATGTA AGAAAACGAG TGAATATTTC TCCACTAGGA
GCAGCTGCTT TAGCTGGCAC ATCGATTTCT ATAAGCAGAA AGATTACTTC TTCAGAATTA
CACTTTCAAG GTATTTATTC TAATAGTTTA GATGCTGTAA GTGATAGAGA CTTTGTCGTA
GAATTTTTAG GAGCTTCATC GTTAATTATG GCTCATTTAA GTAGATTATC TGAAGAAGTA
ATTTTGTGGG CATCTGAAGA ATTTGCCTTT ATTCAATTAA CCGACCGATG TGCTACTGGA
AGTAGTCTTA TGCCTCAAAA AAAGAATCCT GATGTACCTG AACTTGTTCG AGGCAAGTCA
GGAAGAGTAT TTGGACATTT ACAAGCTATG CTGACTATGA TTAAGGGATT ACCTTTAGCT
TACAACAAAG ATTTTCAAGA AGACAAAGAA GCTATCTTTG ATAGTGTTAA AACAGTTAAG
AATTCTTTGA TTGCCATATC AATTTTGTTT GAAGAGGGTT TAATTTTTAG AAAAGAAAGA
CTTAATCAAG CTGTTTCCTC AGATTTTTCA AATGCGACTG ATGTCGCTGA TTATTTAGTG
GCTAAGGACA TACCTTTCCG AGAGGCTTAT CAATTAGTTG GGCGAATTGT AAAAACTTCC
TTGGAGGAGG GGATTTTATT AAAAGATTTT CCTTTAGAAA GATGGAAAAC ATTTCATAAA
TTTTTTGAAA AAGATATTTA TGAAAAGCTT TTGCCTTCGA GTGTAGTTGA GTCTCGTTTG
AGTGCTGGTG GAACTGGATT TGAGAGAGTT CAAGAACAGC TTCTTTCTTG GCGAGAAAAA
TTATTTAATT AA
 
Protein sequence
MEKALSKTWS DRFDKGLNPF IEKFNASIEF DICLLEEDLD GSIAHARMLG IQGIITKEEA 
LRLENGLQQI RKEASDGLFQ PVIADEDVHF AVEKKLIDLI GPVGKKLHTG RSRNDQVGTD
LRLWLRKRID EIDMDLVRLQ KSLFLLAEEN LYTLIPGYTH LQRAQPLSLA HHLLAYIEMA
QRDRNRLKDV RKRVNISPLG AAALAGTSIS ISRKITSSEL HFQGIYSNSL DAVSDRDFVV
EFLGASSLIM AHLSRLSEEV ILWASEEFAF IQLTDRCATG SSLMPQKKNP DVPELVRGKS
GRVFGHLQAM LTMIKGLPLA YNKDFQEDKE AIFDSVKTVK NSLIAISILF EEGLIFRKER
LNQAVSSDFS NATDVADYLV AKDIPFREAY QLVGRIVKTS LEEGILLKDF PLERWKTFHK
FFEKDIYEKL LPSSVVESRL SAGGTGFERV QEQLLSWREK LFN