Gene P9303_01441 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9303_01441 
Symbol 
ID4776592 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9303 
KingdomBacteria 
Replicon accessionNC_008820 
Strand
Start bp158274 
End bp160001 
Gene Length1728 bp 
Protein Length575 aa 
Translation table11 
GC content48% 
IMG OID640085643 
Productthiamine pyrophosphate-requiring enzyme 
Protein accessionYP_001016164 
Protein GI124021857 
COG category[E] Amino acid transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0028] Thiamine pyrophosphate-requiring enzymes [acetolactate synthase, pyruvate dehydrogenase (cytochrome), glyoxylate carboligase, phosphonopyruvate decarboxylase] 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCACCTCC TAAATTCAGC CAGGCAGACG ATGACTTGTG TGCCTGTGGT TCACGAAGTT 
GCAGCAGTTG TGGCTGCTGA GTATCACAAT GCAACCGTCT CATCTGGCAA GGCAGCCAAG
CCAAGGGGGC GAGCCATTGC TCTGGTTACA GCAGGCCCGG GATTAACCAA TGCTCTGACG
GGCCTCGCTG GGGCTTGGTT AGAAAGCCGT GAGCTACTTC TTCTTGGTGG TCAGGCGAAA
GTGGCGGATC TTGCTTCGTC TGGTTTGCGC CAATTGGGCA TTCAGGAGGT GAATGGTGTT
GCCATGGCCG CTCCCGTTTG CAAAAGGAGC CTTTGCTTGC GAGAACCATT GGAACAACAA
GCATTTTTTG CTGAAGTCTC TTCAGGCTGG CAGGCTCGTC CTGGCCCTGT ATTCCTAGAG
TTTCCGCTTG ATGTTCAAGC ATTGCAAGTT CCTGAAGCTT GGGTCGCTGG CATCGATGAA
GCAAATATCA ACAAGGCTGA TGACGCTGCT CCTGTCATCG ACAGTGACAT GGTTCATTCA
TTAGCAGCTT CTATAGCCGC GGCTGAGCGT CCTGTCTTGC TGCTTGGAGG GGGGATCTCT
TCTGTAACGG CTCAGCAGTT AGAGCCTCAG CTTGCTTCGC TTGGTTTGCC AGTAATGACG
ACGTGGAATG GAGCCGATCG CTATGGCGCA GAGCATTCCA ATTATTTTGG CAGGCCCAAT
ACCTGGGGAC AGCGCTATAG CAATCTGCTG ATTCAGCAAT CAGATTTTTT GGTTGCGATC
GGCAGCCGGC TTGGGCTCCA GCAGACAGGC TTTAATTGGC AAGAATTTGT GCCAGTTGGG
AAAGTTATTC AGGTTGACAT TGATCCAGCT GAGCTGGCTA AGCCCAATCC AAAACTTGAT
CTAGCCATTG AAGCGAATGC TAATGATTTT ATACAACAGT TGCTTAGTTT TGACTTAGGT
AGCCACCCTG ATTGGCTTGC TTACTGTTCT GATGTTAAAA CCAGATTGCC GATTTCTGAA
GCCTGCAATA TCACACCTGT TGGTTATCTC AATCCCTTTG AAATGGTAAT CAAGCTATCC
ATGCTTTGCA ATGCTTCTGA TCACATTGTT CCTTGTAGTA GTGGAGGAGC ATTCACTGTA
ATGATGCAGG CTTTTGAGCT TCAGCAGGGT CAGACAATGA TCACCGATAA GGGATTAGCG
AGCATGGGGT ATGGCTTATC AGGTGCTATC GGAACATCAA TTGCTGACCC TGATGTACGT
ACAGTGCTAG TTGAGGGCGA TGGTGGATTC ACCCAGAACC TTCAGGAGTT AGCAACTGTG
GCCGTGAATA ATCTCAATCT AAAGATGTTC CTGTTTTGCA ATAATGGTTA TGCATCGATC
AGGATGACGC AAAAGAATTA CTTTGATGGT GCTTATATGG GTTGTGATGT TTCTTCGGGT
TTAGGCTTCC CTGATTGGTC TAAACTTGCC GAGGCTTATG GGATTGATTG CTTTGAGCTA
GGAGAGGCTT GGTGGGATGC TGAACGATTT GACCATTTGT GGAATCACCA AGGCCCTGCT
CTGTTTTTGG TTCCATTGCA TCCTGAACAG ACATATTCTC CTAAGATCGC TAGTCGCATT
AGTGCTAATG GCGGCATGGA ATCAAATCCC TTGCATCGAA TGAGTCCGGA TTTAGATCAA
GAGCTTGAGG ATTTCGTGAC ACGCTTTATT CCAAAAAAAG CGTCTTAA
 
Protein sequence
MHLLNSARQT MTCVPVVHEV AAVVAAEYHN ATVSSGKAAK PRGRAIALVT AGPGLTNALT 
GLAGAWLESR ELLLLGGQAK VADLASSGLR QLGIQEVNGV AMAAPVCKRS LCLREPLEQQ
AFFAEVSSGW QARPGPVFLE FPLDVQALQV PEAWVAGIDE ANINKADDAA PVIDSDMVHS
LAASIAAAER PVLLLGGGIS SVTAQQLEPQ LASLGLPVMT TWNGADRYGA EHSNYFGRPN
TWGQRYSNLL IQQSDFLVAI GSRLGLQQTG FNWQEFVPVG KVIQVDIDPA ELAKPNPKLD
LAIEANANDF IQQLLSFDLG SHPDWLAYCS DVKTRLPISE ACNITPVGYL NPFEMVIKLS
MLCNASDHIV PCSSGGAFTV MMQAFELQQG QTMITDKGLA SMGYGLSGAI GTSIADPDVR
TVLVEGDGGF TQNLQELATV AVNNLNLKMF LFCNNGYASI RMTQKNYFDG AYMGCDVSSG
LGFPDWSKLA EAYGIDCFEL GEAWWDAERF DHLWNHQGPA LFLVPLHPEQ TYSPKIASRI
SANGGMESNP LHRMSPDLDQ ELEDFVTRFI PKKAS