Gene P9303_18581 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9303_18581 
SymbolthrB 
ID4775974 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9303 
KingdomBacteria 
Replicon accessionNC_008820 
Strand
Start bp1620202 
End bp1621152 
Gene Length951 bp 
Protein Length316 aa 
Translation table11 
GC content57% 
IMG OID640087367 
Producthomoserine kinase 
Protein accessionYP_001017865 
Protein GI124023558 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0083] Homoserine kinase 
TIGRFAM ID[TIGR00191] homoserine kinase 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.566516 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGCAGC CGAGTATCGG CCAAAAAATT GTGGTTGACG TACCGTCCAC CACCGCCAAC 
CTTGGTCCTG GCTTCGACTG CCTTGGTGCT GCCCTTGACC TCAACAACCG TTTTGCCATG
CGGCGGATCG AAGGGGACAG CGGACGCTTT GAACTCATTA TTGAAGGCAA TGAGGGAAGC
CACCTACGCG GTGGGCCCAA CAACCTGATT TATCGCGCCG CCCAGAGGGT GTGGAAAGCC
GCAGGGCTCG AACCGGTGGG ACTAGAAGCC AAAGTGAGGC TTGCGGTACC CCCCGCAAGA
GGCCTAGGAA GCAGTGCCAG TGCCATCGTG GCAGGGTTGG TTGGAGCCAA TGCCTTGGTG
GGCGAACCTC TCAGCAAAGA AAAACTGTTG GAACTGGCCA TTGACATTGA GGGACATCCC
GACAATGTGG TGCCTTCTCT TCTAGGAGGT CTTTGCTTGA CTGCCAAGGC AGCCTCGCAA
CGCTGGCGGG TTGTTCGTTG TGTCTGGATC AATTCCGTGA AAGTCGTTGT AGCAATCCCC
TCTATTCGCC TAAGCACGAG CGAGGCAAGG CGCGCCATGC CTAAAGACAT TCCGATCAGC
GATGCCGTAG AAAACCTTGG TGCCCTTACG CTCCTGCTGC AGGGACTGCG GACAGGAAAC
GGCGACCTGA TTACAGACGG GATGCACGAT CGATTGCATG AGCCCTATCG CTGGCCATTA
ATCAAAGGTG GTTTGGATGT TCGCGATGCG GCTCTGAATG CCGGGGCCTG GGGTTGTGCC
ATCAGCGGAG CTGGCCCCAG CGTGCTGGCT CTGTGCCCGG AGGATAAAGG GCAAGCAGTC
AGTCAGGCAA TGGTAAAAGC TTGGGAGGCC GAGGGTGTAG CAAGTAGGGC ACCACTGCTT
AGCATTCAGA CAGGAGGGAG CCACTGGCAA CCTCAAATTG AGGATGAGTA G
 
Protein sequence
MAQPSIGQKI VVDVPSTTAN LGPGFDCLGA ALDLNNRFAM RRIEGDSGRF ELIIEGNEGS 
HLRGGPNNLI YRAAQRVWKA AGLEPVGLEA KVRLAVPPAR GLGSSASAIV AGLVGANALV
GEPLSKEKLL ELAIDIEGHP DNVVPSLLGG LCLTAKAASQ RWRVVRCVWI NSVKVVVAIP
SIRLSTSEAR RAMPKDIPIS DAVENLGALT LLLQGLRTGN GDLITDGMHD RLHEPYRWPL
IKGGLDVRDA ALNAGAWGCA ISGAGPSVLA LCPEDKGQAV SQAMVKAWEA EGVASRAPLL
SIQTGGSHWQ PQIEDE