Gene A9601_11561 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagA9601_11561 
SymbolthrA 
ID4717869 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. AS9601 
KingdomBacteria 
Replicon accessionNC_008816 
Strand
Start bp971451 
End bp972752 
Gene Length1302 bp 
Protein Length433 aa 
Translation table11 
GC content32% 
IMG OID640078871 
Producthomoserine dehydrogenase 
Protein accessionYP_001009547 
Protein GI123968689 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0460] Homoserine dehydrogenase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGAAAAT GCAAAATTGG TATTGTAGGT TTTGGAACTG TAGGTTCAGG GATTTATAAA 
ATATTAACTT CTGAACTCGA TTCACATCCA ATTCTAGAAG AAATAGAAAT TTCAAAAATA
GCAGTTAAAG ATCTTGATAA AAAAAGAGAT ATTGAGCTAG ATAATAATTT ATTAACTGAT
GATCCATTTA AATTAATTAA TGACCCATCT ATAGATGTTA TTGTTGAAGT AATGGGTGGG
GTTGATTTAG CGAAAGAAAT TATTCTGCAA TCATTAAAAT TAGGTAAATC TGTAGTTACC
GCAAACAAAG CAGTTATTGC AAGATATGGA GAAGAAATAT ATAAAACTGC ATCTAAAGAA
GGTGTCTATA TATTGTCAGA AGCAGCTGTT TGCGGGGGGA TTCCAATAAT TGAACCCTTA
AAAAGATCAT TAAAAAGTAA CAGTATAAAA AAAATGGTTG GGATAATAAA TGGCACAACA
AATTTTATTC TTTCAAAGAT GGCAAATGAA AAAGCTGATT ATAAGGAAAC TTTAAAATTG
GCCCAAAGCC TTGGTTACGC AGAATTTGAT CCAACTGCAG ATGTTGAGGG GCATGATGCT
GCTGATAAGA TTTCAATCCT TAGTGAACTC GCATTTGGAG GGAAAATCAA AAGAGAGGAG
ATACATTCTG AGGGTATCAG TAAAATTAAT CTCAAGGATA TCGAATATGC CAATAAATTA
GGATTTGAAA TAAAACTTTT AGCGCTCTCT GAAAGGGGAC AAATTAATAG TAATGATTCA
CTCGCTTTAA ATATTTGGGT AGGACCTTCT TTGATTCCAA AATCTCATCC ATTGTCAACA
GTTAAGGGAG TTAACAATGC CTTGTTGATT GAGGCTGATC CTCTTGGAGA AATAATGTTA
TATGGTCCAG GTGCAGGGAG TGGTCCAACT GCAGCATCAG TAGTATCAGA TATATTAAAT
CTGCATGCCG CCTCAGTAAA AAATAATAAT TCAATCGATC CATTATTATC TTTTGATTTC
TGGAGAAACT GCCATATCAT AGGATCTTCG CAAATAAACA AAAAAAATTA CCTTAGAATT
ATTTGTCTTG ATAGTCCAGG TGTAATAGGA AAGATTGGAG ATATTTTTGG AAAGAATAAT
GTATCAATCG AATCAATTGT TCAACTTGAT GCGAGTGAGG ACAAAGCTGA AATTGTCGTT
ATTACTCATG AGGTGAATAA TGGAGATTTT GAGAGATCGA AAAATGAAAT AAATTCGTTA
AATGAAGTAA AAATTATTGC AAGTCAATTA AGTTGTATTT AA
 
Protein sequence
MRKCKIGIVG FGTVGSGIYK ILTSELDSHP ILEEIEISKI AVKDLDKKRD IELDNNLLTD 
DPFKLINDPS IDVIVEVMGG VDLAKEIILQ SLKLGKSVVT ANKAVIARYG EEIYKTASKE
GVYILSEAAV CGGIPIIEPL KRSLKSNSIK KMVGIINGTT NFILSKMANE KADYKETLKL
AQSLGYAEFD PTADVEGHDA ADKISILSEL AFGGKIKREE IHSEGISKIN LKDIEYANKL
GFEIKLLALS ERGQINSNDS LALNIWVGPS LIPKSHPLST VKGVNNALLI EADPLGEIML
YGPGAGSGPT AASVVSDILN LHAASVKNNN SIDPLLSFDF WRNCHIIGSS QINKKNYLRI
ICLDSPGVIG KIGDIFGKNN VSIESIVQLD ASEDKAEIVV ITHEVNNGDF ERSKNEINSL
NEVKIIASQL SCI