Gene P9515_11421 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9515_11421 
SymbolthrA 
ID4719035 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9515 
KingdomBacteria 
Replicon accessionNC_008817 
Strand
Start bp994977 
End bp996278 
Gene Length1302 bp 
Protein Length433 aa 
Translation table11 
GC content31% 
IMG OID640080823 
Producthomoserine dehydrogenase 
Protein accessionYP_001011456 
Protein GI123966375 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0460] Homoserine dehydrogenase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.209412 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGAAAAT GTAAAATTGG TATTATAGGT TTTGGAACTG TAGGGACTGG TATTTATAGA 
ATTTTGAAAT CGAGAAATGA TGTTCATCCT ATTTTAAAAG ATATAGAAAT TGTTGGAATA
GCTGTTAAAA ATATTAATAA AAATAGAAAT ATTGAATTAG AAAATAATCT ATTAATAGAT
GATCCTAATA AATTAATTAA CGACTCTAAT ATTGACGTAA TTGTAGAAGT AATGGGAGGT
ACGGATATGG CCAAAGATAT TATTTTAAAA TCTCTAAAAG CTCGAAAATC TGTAGTTACA
GCGAACAAAG CAGTCATCGC AAGATATGGG AGTGAAATAT ATGCTACGGC TGCAAAGTAT
GGGGTTTATG TTTTAACAGA AGCAGCAGTT TGCGGAGGGA TCCCAATAAT TGAACCTTTA
AAAAGATCTC TTAAAAGTAA TCAAATAAAT AAAATAGTTG GGATAATAAA CGGCACAACA
AATTTTATTC TTTCAAAAAT GACAAATGAA AAAGCTGATT ATGAGGAAAC ATTAAAACTA
GCTCAAAAAC TAGGGTTTGC AGAATTTGAT CCAACTGCTG ATGTTGAAGG TCATGATGCA
GCTGATAAGA TTTCAATTCT TTCTGAACTG GCATTTGGAG GAAGAATTAA TAGAGATTCA
ATTAGTACGG AAGGCATTAA TCAAATAAAT ATTAAAGATA TTGAATATGC AAATAAACTT
GGTTTTGAAA TAAAACTCTT AGCCTTCTCA GAGAGAAATA CTATCAATAA CAAAAATTCT
CTTTCCTTAA ATATTTGGGT AGGACCCTCT TTAATTCCTA AATATCATCC TTTAGCAAAG
GTTGACGGTG TAAATAATGC CATTCTTATA GAGTCATATC CACTTGGAGA AATAATGTTA
TACGGCCCAG GTGCGGGCAG TGGTCCAACG GCAGCTTCTG TAGTTTCTGA TATATTAAAT
CTTCAGGCTT CATTATCAAA AAATTTACCT ACAAAAGATC CATTATTATC TTTTAATTTT
TGGAGAGACT GTCACATAAT AGATTTCAAT CAAATATCAA AAAAGAATTA TCTCAGAATT
ATTTGTTTAG ATTCTCCTGG TGTAATTGGT AAGATCGGAG ATTTATTTGG GAAGAATGAT
GTCTCAATAG AATCAATAGT TCAACTAGAT GCAAGTGAGA ATAAAGCTGA AATAGTTGTT
ATCACTCATG AAGTGTCAAA TGGTAACTTT GAAAAATCAA AGGAGGGGAT AAGAGCCCTT
CCTGAAGTTG AATTAATTGC CAGCCAATTA AGTTGTATTT GA
 
Protein sequence
MGKCKIGIIG FGTVGTGIYR ILKSRNDVHP ILKDIEIVGI AVKNINKNRN IELENNLLID 
DPNKLINDSN IDVIVEVMGG TDMAKDIILK SLKARKSVVT ANKAVIARYG SEIYATAAKY
GVYVLTEAAV CGGIPIIEPL KRSLKSNQIN KIVGIINGTT NFILSKMTNE KADYEETLKL
AQKLGFAEFD PTADVEGHDA ADKISILSEL AFGGRINRDS ISTEGINQIN IKDIEYANKL
GFEIKLLAFS ERNTINNKNS LSLNIWVGPS LIPKYHPLAK VDGVNNAILI ESYPLGEIML
YGPGAGSGPT AASVVSDILN LQASLSKNLP TKDPLLSFNF WRDCHIIDFN QISKKNYLRI
ICLDSPGVIG KIGDLFGKND VSIESIVQLD ASENKAEIVV ITHEVSNGNF EKSKEGIRAL
PEVELIASQL SCI