Gene A9601_09531 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagA9601_09531 
SymbolilvA 
ID4717662 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. AS9601 
KingdomBacteria 
Replicon accessionNC_008816 
Strand
Start bp820171 
End bp821712 
Gene Length1542 bp 
Protein Length513 aa 
Translation table11 
GC content32% 
IMG OID640078666 
Productthreonine dehydratase 
Protein accessionYP_001009344 
Protein GI123968486 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1171] Threonine dehydratase 
TIGRFAM ID[TIGR01124] threonine ammonia-lyase, biosynthetic, long form 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATGATT ATTTTGAAAA AATACTTCAA GCTGAAGTCT ATGAAGTTGC AAAAAAAACA 
CCACTAGAGA AAGCTCATAA TTTAAGTAAC ACACTTAATA ATGAAGTTTT TCTAAAAAGA
GAAGATCTTC AGGATGTATT TTCATTCAAA ATAAGAGGTG CATATAACAA AATGAGTAAG
CTAACTAATT CACAGCTTGC TCAGGGAGTA ATTACTTCTA GTGCTGGAAA TCATGCCCAA
GGGGTTGCAC TTAGTGCCCT TAAGTTAAAT TGCCAAGCAA CCATATTAAT GCCCGTTACC
ACACCTATAG TAAAAGTTAA TGCAGTAAAA AGTTTAAAAG CAAAAGTTAT ATTATATGGT
GACAACTATG ATGAAACATA CAAAGAGGCA ATAAGGATTA GCCAAGAAAG AAATTTATGC
TTTATTCATC CCTTTGATGA TCCAGAAGTA ATAGCAGGAC AAGGAACTAT AGCTATAGAA
CTTGAACAGC AGCTTAAGGA AAAACCTTAT GCAATTTATA TTGCTGTAGG TGGTGGGGGA
TTGATATCAG GAATATCCAT ATACGTTAAA AAAATATGGC CAGAAGTAAA AATAATTGGT
GTAGAACCTG AAGATGCTGA CGCTATGACT AAATCATTGG AAGAAGAAAA AATTGTGGAA
CTACCTTCTG TAGGTCAATT TGCAGATGGA GTAGCGGTAA AAAAAATTGG TAAAAATACT
TTTGATATTG GTAGAAAATA TATAGATAAG ATGATTAGGG TTAATACTGA CGAAATCTGT
GCTGCTATAA AAGATGTTTT TGAGGATACT AGATCCATAT TAGAGCCCGC AGGGGCCTTA
TCAATAGCGG GAATGAAAAA AGATATTTTA AATTCGAATC ATTCAAATAG AAAAATGGTT
GCGATTGCAT GTGGTGCAAA TATGAATTTT GAGAGGCTTA GATTTGTAGC AGAAAGAGCA
GAACTTGGAG AGTGCAAAGA AGTAATGATG GCTGTTGAAA TTCCTGAACG TGCTGGTAGT
CTAATTGATT TTTGTAAGTT ACTTGATAAT AGAAATCTAA CAGAATTTAG CTATAGGATG
TCGAATTCTA AGAATGCACA GATATTTGTA GGGGTTCAAG TCTATGGTTT AAATGATAAA
AAAAATTTAT TAAATGTATT TAGAAATTCT GAGTACTCAT TTATTGACAT AAGTGATGAT
GAATTATCTA AAAATCATCT CAGACATATG GTAGGTGGAA GATTACCAAG GGATTTTAAA
GAGATGGAAT ATAAAAACTT TATTGAGCTT TTATACAGAT TTGAGTTTCC TGAAAGGCCT
GGCGCATTAA TAAACTTCTT AAATAATATG AAATCTAATT GGTCTATAAG CGTATTTCAC
TACAGGAATT ATGGAGCTGA TGTAGGGAAA ATTGTCATTG GAGTTTTGAT CGATAAAAAT
GAGATTTTAG AGTGGAATAA ATTTGTAAAA ATTCTAGGTT ATAAATTCTG GGATGAAACT
CAAAACGATA CATATAGATT GTTCCTTGGT GCATCAGATT AA
 
Protein sequence
MNDYFEKILQ AEVYEVAKKT PLEKAHNLSN TLNNEVFLKR EDLQDVFSFK IRGAYNKMSK 
LTNSQLAQGV ITSSAGNHAQ GVALSALKLN CQATILMPVT TPIVKVNAVK SLKAKVILYG
DNYDETYKEA IRISQERNLC FIHPFDDPEV IAGQGTIAIE LEQQLKEKPY AIYIAVGGGG
LISGISIYVK KIWPEVKIIG VEPEDADAMT KSLEEEKIVE LPSVGQFADG VAVKKIGKNT
FDIGRKYIDK MIRVNTDEIC AAIKDVFEDT RSILEPAGAL SIAGMKKDIL NSNHSNRKMV
AIACGANMNF ERLRFVAERA ELGECKEVMM AVEIPERAGS LIDFCKLLDN RNLTEFSYRM
SNSKNAQIFV GVQVYGLNDK KNLLNVFRNS EYSFIDISDD ELSKNHLRHM VGGRLPRDFK
EMEYKNFIEL LYRFEFPERP GALINFLNNM KSNWSISVFH YRNYGADVGK IVIGVLIDKN
EILEWNKFVK ILGYKFWDET QNDTYRLFLG ASD