Gene NATL1_04581 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNATL1_04581 
Symbol 
ID4779692 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. NATL1A 
KingdomBacteria 
Replicon accessionNC_008819 
Strand
Start bp419750 
End bp420736 
Gene Length987 bp 
Protein Length328 aa 
Translation table11 
GC content42% 
IMG OID640083735 
ProductO-acetylserine (thiol)-lyase A 
Protein accessionYP_001014287 
Protein GI124025171 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0031] Cysteine synthase 
TIGRFAM ID[TIGR01136] cysteine synthases
[TIGR01139] cysteine synthase A 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.795061 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCAGAA TCTATGAAGA CAATAGTTTC GCTATTGGTC ATACCCCATT AGTCAAGCTT 
AATTCAATTA CAAAAAACGC AAAAGCAACA GTATTGGCAA AGATAGAAGG CCGTAACCCC
GCCTACAGCG TTAAGTGCAG AATAGGAGCG AACATGATCT GGGATGCTGA GAAGAAAGGT
TTACTCAATA AAGATAAAGT CATTATTGAA CCAACTTCTG GAAATACTGG AATAGCCCTT
GCTTACACAG CTGCTGCTCG AGGTTACAAG TTAATCCTCA CGATGCCTGA GTCCATGTCA
ATTGAACGTC GGAGGATGAT GGCTGTTCTT GGTGCTGAAT TAATACTAAC TGAAGCAGCA
AAAGGAATGC CTGGTGCCAT TGCAAAAGCG AAGGAGATAG CTGATGGCGA CCCTCAAAAA
TACTTCATGC CAGGCCAATT TGATAATCCA GCTAATCCAG AAATACATTT CAAAACAACA
GGTCCTGAAA TTTGGGATGA TACCGATGGC CAAATTGATG TTTTAGTCTC TGGCGTCGGG
ACAGGTGGAA CAATAACAGG TGTCTCTCGT TTTATAAAAC AAGAAAAAAA TCATTCATTA
TTATCTGTTG CCGTTGAGCC CACTCATAGC CCAGTAATAA CACAAACTCT CAATGGAGAA
GAGGTAAAGC CCGGCCCTCA CAAGATTCAA GGGATCGGAG CTGGGTTTAT CCCTAAAAAT
CTTGACTTAT CAGTAGTCGA TCAAGTTGAG CAAGTCTCTA ATGATGAGTC TATTGCAATG
GCATTACGTT TAGCACAAGA AGAAGGTCTA CTAGTTGGAA TCTCATGTGG TGCCGCTGCA
GCTGTGGCTT TAAGACTTGC CGAAAAAGAA GAGTTCGCAG GGAAGACAAT CGTTGTGGTT
CTTCCTGACC TTGCAGAAAG ATATATCTCA TCCGTAATGT TTGAGAATGT TCCTACAGGC
GTTATTAAAG AACCATCATT AGCTTGA
 
Protein sequence
MSRIYEDNSF AIGHTPLVKL NSITKNAKAT VLAKIEGRNP AYSVKCRIGA NMIWDAEKKG 
LLNKDKVIIE PTSGNTGIAL AYTAAARGYK LILTMPESMS IERRRMMAVL GAELILTEAA
KGMPGAIAKA KEIADGDPQK YFMPGQFDNP ANPEIHFKTT GPEIWDDTDG QIDVLVSGVG
TGGTITGVSR FIKQEKNHSL LSVAVEPTHS PVITQTLNGE EVKPGPHKIQ GIGAGFIPKN
LDLSVVDQVE QVSNDESIAM ALRLAQEEGL LVGISCGAAA AVALRLAEKE EFAGKTIVVV
LPDLAERYIS SVMFENVPTG VIKEPSLA