Gene NATL1_21221 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNATL1_21221 
Symbolasd 
ID4780944 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. NATL1A 
KingdomBacteria 
Replicon accessionNC_008819 
Strand
Start bp1779439 
End bp1780476 
Gene Length1038 bp 
Protein Length345 aa 
Translation table11 
GC content38% 
IMG OID640085419 
Productaspartate semialdehyde dehydrogenase 
Protein accessionYP_001015942 
Protein GI124026827 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0136] Aspartate-semialdehyde dehydrogenase 
TIGRFAM ID[TIGR01296] aspartate-semialdehyde dehydrogenase (peptidoglycan organisms) 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGAAACCAA AATCTCTTCT TCCAAATAGA CCACTTACAG TTGCGATTCT TGGATCGAGT 
GGAGCTGTAG GAAAGGAATT ATTGGCTTTG CTTGAAGAGA GATCATTCCC TGTTGGAAAA
TTAATTTTAC TAGCCTCCCA TCGCTCAGCA GGGGACATTC AGAGTTTTTG TGGAACTGAG
GTGAAAGTCG AAGAGACAAC TAAAGATAGT TTTAAAAATG TAGATTTAAT TTTGGCTTCA
GCTGGTAGTT CAATATCTCG TAAATGGAAA GATATAATTC AAAGTTCTGG AGCTTTGATG
ATTGATAACT CTAGTGCATT TCGAATGGAC GAGAATGTTC CTTTGATCGT TCCAGAAGTT
AATCCTTCTG ATGCTATTAA GCACAAAGGT GTAATTTCTA ATCCGAATTG CACGACTATT
TTATTAGCTT TGGTTCTATC TCCATTGACT AGGCATATCC CCATTAAGAG GATCGTTGTA
TCAACCTATC AATCCGCTAG TGGTGCAGGT GCAATGGCAA TGGAAGAATT AGAAAAATTA
AGCCAACAAG TTTTAGATGG AATAACTCCA AAAAGTAAAG TTCTTCCGTA TTCTTTGGCG
TTTAATCTTT TTTTGCATAA TTCCCCTTTG CAATCGAATA ATTATTGCGA GGAAGAGATG
AAAATGGTTA ATGAAACCCG AAAGATTCTT GGAGATCCCG AACTTTCATT GACCGCAACA
TGTGTAAGAG TCCCAGTCTT AAGAGCCCAT TCTGAATCGG TCAATATTGA ATTTACAAAG
CCTTTTCCAG TAAAAGAAGC TTATAAGATT TTGGAGGCTT CACCGGGAGT TGAAATACTT
GAGGACTTGG AAAATAATCG TTTTCCGATG CCTCAAGATG TAACTGGCAG AGATCCAATA
TCAGTTGGTC GAATAAGGCA AGATATAAGT AACCCGAATG CTTTGGAACT ATGGTTATGT
GGAGATCAAA TCCGCAAGGG TGCAGCACTT AATGCAGTGC AGATTGCTGA ACTTCTACTG
CCAAAGAATT ATGAATAA
 
Protein sequence
MKPKSLLPNR PLTVAILGSS GAVGKELLAL LEERSFPVGK LILLASHRSA GDIQSFCGTE 
VKVEETTKDS FKNVDLILAS AGSSISRKWK DIIQSSGALM IDNSSAFRMD ENVPLIVPEV
NPSDAIKHKG VISNPNCTTI LLALVLSPLT RHIPIKRIVV STYQSASGAG AMAMEELEKL
SQQVLDGITP KSKVLPYSLA FNLFLHNSPL QSNNYCEEEM KMVNETRKIL GDPELSLTAT
CVRVPVLRAH SESVNIEFTK PFPVKEAYKI LEASPGVEIL EDLENNRFPM PQDVTGRDPI
SVGRIRQDIS NPNALELWLC GDQIRKGAAL NAVQIAELLL PKNYE