Gene P9303_18731 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9303_18731 
Symbol 
ID4777651 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9303 
KingdomBacteria 
Replicon accessionNC_008820 
Strand
Start bp1634743 
End bp1635900 
Gene Length1158 bp 
Protein Length385 aa 
Translation table11 
GC content60% 
IMG OID640087382 
ProductNAD binding site 
Protein accessionYP_001017880 
Protein GI124023573 
COG category[C] Energy production and conversion 
COG ID[COG0644] Dehydrogenases (flavoproteins) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.413657 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGGAGT CGTGGGATGT ACTGGTGGTT GGGGCCGGTC CTGCAGGGGG GCTAGCCGCC 
CTGGATTGCG CAAGGCGAGG ATTAAGGGTG TTACTGGTGG AAAAACGCGC TTTCCCCCGC
TGGAAAGTGT GTGGTTGCTG CTTTAACAAA CAGGCTCAGG CGACTCTCGC ATCTCTGGGA
CAGAACGATC TAATCATTGA TCGCGGCGGC GTACGACTGC AAACCCTTCG CCTCGGCCTG
AATGGTCGTC AAACATCTCT AGCCATTCCC GATGGTTTTG CTCTCTCCAG GGAACAGTTC
GATCAGGCAC TGATGGACGC GGTCGTTGAA GCTGGAGCTT CCGTTCGCTG TCAAATGAGC
GCCGGGGTCG AAGAAGTCCA ACCAGGCTGG CGGATTGTGC GGCTGAAGGA TCAGCGCAGC
GGTCAGCAGA ACCTCGTCAG GGCGCGTGTC GTGCTTGTCG CAGCTGGGCT TGCCCAGCGT
TGCTTACCCG AGCAGGATGC TGGCATCACC AGGATCCGTA GTCGTTCCAG GGTTGGAGCC
GGTTGTGTTC TTGCTGATGA TGAGAACCAC TACACCGCAG GCGCCATTCA CATGGCGATC
GGTGAACGTG GTTATGTGGG TCTGGTGCGC CGAGAGGACG GTTTACTCAA TGTGGCAGCC
GCTTTCGATC GGCAGGCGCT AGCCCATGGG CAAGGAGCGG CCGGAGCTAC CCAGGATGTG
CTTATGCAGG CTGGTTTCCC ACCACCTGCG GCTCTGAGAC AGGGGCAATG GCAGCTGACG
CCAGCTCTTA GTCGCGGCGC TGAGGTTGTT GCCGGAGAGC GCTTTCTGGT GATGGGTGAC
GCAGCGGGTT ATGTGGAGCC ATTTACAGGA GAAGGAATTG CCTGGGCCCT CACTGCAGGT
GCCGTGGTGG CACCATTTGT TCAAGAAGGC CTCCTGCGCT GGAGCCATGA TCTGGAGAAG
CGCTGGACGC GAGAGCTGAA GCTGCGGATC GGTCGTCGCC AGCGGATCTG TCGCACTCTG
GCCATGGTGC TGAGGCAGCC AAAGCCGACG AGGGCATTGT TTGAACTGAG CAGCCGCTGG
CCGGCACTGT CTGAAACGAT TGTTGCCAGC CTGAACCATG TGACCCTCCC CTCCGCCGGA
AGTCAGCAAT GCCTCTGA
 
Protein sequence
MQESWDVLVV GAGPAGGLAA LDCARRGLRV LLVEKRAFPR WKVCGCCFNK QAQATLASLG 
QNDLIIDRGG VRLQTLRLGL NGRQTSLAIP DGFALSREQF DQALMDAVVE AGASVRCQMS
AGVEEVQPGW RIVRLKDQRS GQQNLVRARV VLVAAGLAQR CLPEQDAGIT RIRSRSRVGA
GCVLADDENH YTAGAIHMAI GERGYVGLVR REDGLLNVAA AFDRQALAHG QGAAGATQDV
LMQAGFPPPA ALRQGQWQLT PALSRGAEVV AGERFLVMGD AAGYVEPFTG EGIAWALTAG
AVVAPFVQEG LLRWSHDLEK RWTRELKLRI GRRQRICRTL AMVLRQPKPT RALFELSSRW
PALSETIVAS LNHVTLPSAG SQQCL