Gene P9303_08881 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9303_08881 
SymbolthrA 
ID4776348 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9303 
KingdomBacteria 
Replicon accessionNC_008820 
Strand
Start bp805183 
End bp806499 
Gene Length1317 bp 
Protein Length438 aa 
Translation table11 
GC content56% 
IMG OID640086397 
Producthomoserine dehydrogenase 
Protein accessionYP_001016904 
Protein GI124022597 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0460] Homoserine dehydrogenase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGCAAGA AGATAGGAGT TGGTCTCCTG GGGCTTGGAA CAGTAGGCGC CGGTGTTGCA 
GGCATCCTCC AAGCCCCCGA AGGGCGGCAT CCTCTGGTGG CAGAGCTTGA ACTGGTGCGT
GTAGCGGTTC GAAATCTGCA ACGACCTCGC TCGATTGAAC TTCCCGCATC CTTGCTGACC
AACAACCCAC AGGCTGTTGT TGATGATCCC TCTGTGCAAG TGGTTGTTGA AGTCATGGGA
GGGATCGAAC CAGCCCGAAC CCTGATCATG CGAGCCATTG CCGCCGGCAA AGCAGTTGTT
ACAGCTAACA AAGCCGTAAT TGCAAGACAT GGCGAAGAGA TTGCAGCTGC TGCAGCTGCT
GCAGGGGTAT ACGTACTGAT CGAAGCTGCC GTCGGCGGAG GCATCCCGAT CATCGAGCCA
CTTAAGCAAT CACTAGGTGG CAACCGGATC GAACGCGTGA GCGGCATCAT CAACGGCACC
ACCAACTACA TCCTCAGTCG CATGGCACAG GAAGGGGTGG CCTATGACGA CGTGCTCAAG
ACCGCCCAGG ATCTTGGCTA TGCAGAGGCA GATCCAGCGG CCGACGTTGA GGGCTTCGAT
GCAGCAGACA AGATTGCCAT CCTCAGTGGA CTGGCCTTCG GTGGACCTGT CAACCGTGAC
TCAATTCCCA CCCAAGGCAT CAACAAGCTT CAAAGCCGCG ATGTGGACTA CGCCAAACAG
CTTGGCTACA GGGTGAAATT ATTGGCTGTC GCCGAACGTC TCAACTCAGA CGCTCAGACC
AGTCAGTCCT TGCCCTTAGC TGTAAGGGTG CAACCAACAA TGGTGCCTCT AGACCACCCG
CTTGCAGGAG TAAATGGCGT GAACAACGCC ATCCTGGTTG AGGGCGATCC GATCGGCCGC
GTGATGTTTT ATGGCCCAGG AGCAGGTTCT GGACCCACCG CCTCTGCTGT GGTAGCTGAC
ATCCTCAACA TCGCTGGCAT ACGCCAACTA GGTGAAGTCC ACGGCAGCCT CGATCCCCTT
CTCGCCGCAA GCAGTTGGCG TTCCTGTCAC CTCGTTGATC CAAGTGCCAT CCGTCAGCGC
AACTATGTGC GCTTCAATGC AGAGGACACA CCAGGCGTGA TCGGTCGGAT CGGTAGCTGC
TTCGGCGATC GTGCAATTTC AATTCAATCA ATCGTGCAAT TCGATGCCTC CGATGCCGGC
GCAGAAATCG TTGTAATTAC CCATGAAATA AGTCAAGGCC AGATGCAAGA TGCCCTTACT
GCAATCACCT CTATGGCTGA GGTCAAAGGA CTTGCCGCCC ATCTCAGCTG CCTTTAA
 
Protein sequence
MGKKIGVGLL GLGTVGAGVA GILQAPEGRH PLVAELELVR VAVRNLQRPR SIELPASLLT 
NNPQAVVDDP SVQVVVEVMG GIEPARTLIM RAIAAGKAVV TANKAVIARH GEEIAAAAAA
AGVYVLIEAA VGGGIPIIEP LKQSLGGNRI ERVSGIINGT TNYILSRMAQ EGVAYDDVLK
TAQDLGYAEA DPAADVEGFD AADKIAILSG LAFGGPVNRD SIPTQGINKL QSRDVDYAKQ
LGYRVKLLAV AERLNSDAQT SQSLPLAVRV QPTMVPLDHP LAGVNGVNNA ILVEGDPIGR
VMFYGPGAGS GPTASAVVAD ILNIAGIRQL GEVHGSLDPL LAASSWRSCH LVDPSAIRQR
NYVRFNAEDT PGVIGRIGSC FGDRAISIQS IVQFDASDAG AEIVVITHEI SQGQMQDALT
AITSMAEVKG LAAHLSCL