Gene P9303_09171 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9303_09171 
Symbol 
ID4778730 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9303 
KingdomBacteria 
Replicon accessionNC_008820 
Strand
Start bp833380 
End bp834948 
Gene Length1569 bp 
Protein Length522 aa 
Translation table11 
GC content55% 
IMG OID640086426 
Producthypothetical protein 
Protein accessionYP_001016933 
Protein GI124022626 
COG category[S] Function unknown 
COG ID[COG1543] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.127848 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGCCAAAG GTGCTCTGGC CCTGGTTCTC CATGCCCATC TTCCATATGT GCGATCGGCA 
GAGCCCGGTT CATTGGAAGA AGACTGGTTT TTTCAAGCCC TAATTGAGTG CTATTTACCT
CTTCTCCAGG TTCTGGAAGA AGCTGCCGCA GCACCCAACC AACACCCCAG GCTCACGATC
AGCCTTTCTC CGACCCTGCT CTCCCTACTC AGTGACGACG ATCTAAAACA CCGATTCCCC
GCCTGGCTCG CTGTTCGTCT GGACCTCCTA ACCCAAACGG CCTCAGACCT ACAACCTGCA
GCGGACCACC TCGCTGAGAT CATCCAACGA AATCTGCATC AATGGTTGGC ATGCGAAGGC
GATCTCATTG GTCGCTTTGC CCAACTTCAA AGGTCAAAGG TTGTCGACCT GCTCACCTGC
GGAGCCACCC ATGGCTACAT GCCATTACTG AGGGAGCACC CTGAGGCAGT GCGTGGTCAG
TTGCGCACTG CCGTACGCGA GCACCACCGC CTACTAGGAG AGCAGCCCCT GGGAATCTGG
CTGCCGGAAT GTGCCTACTA CGAAGGGCTG GACCGTTGGA TACTCGATGC TGGACTGCGC
TACACCGTGC TCGACGGGCA TGGCCTCCTT CATGCCACTC CTCGTCCGCG TTATGGCGTA
TATGCCCCAA TCTGTAGCCG AAATGGCGTC GCCTTCTTCG GCAGAGACAG TGATGCCACG
CTTCCTGTCT GGTCAGCCCA GCAAGGGTAT CCAGGCGACC CTTATTACCG TGAATTTCAT
AGGGATCTTG GTTGGGACCT ACCAATCGAA CAACTCCATG ACATCGGGCT AAAGGAGCCC
AGACCCTTAG GGCTGAAACT GCATCGAGTG ACAGACCAAA GGTCACCCCT TGATGCCAAA
GAAGTTTATG AACCCGCCAT AGCTTGCGCA CTTACCAAAG AACACGCTCA GCTCTACTTA
AAGGGTCGCC GTATCCAACT CGATCAATTA ACTAACACCA TGGCCATCGA GCCATTGTTA
GTGGCTCCCT TCGACGCAGA GCTCTTTGGA CACTGGTGGT TTGAGGGGCC AACTTTCCTT
GCCGAAATCT TCCGCCAGGC CAGCAAGGAG CAGGTGGATT TCACCAGGCT TCGAGACGTG
CTCACATCAA ACCCCCAACT CCAACTTTGT GAACCATCTC CCTCAAGCTG GGGGCAAGGT
GGCTACCACG ACTATTGGCT CAATGACAGC AATGCATGGG TGGTTCCTGA GTGGAGCCGA
GCCGGAAAAG CAATGATGGA GAGATGCAGC CTAGGAGTGG CCCGAGAATC CGACCTACGG
CTGCTGCAAC AGGCTGCTCG AGAACTTCTC CTGGCCCAGT CCTCCGACTG GAGTTTCATT
CTGCGAGCAG GCACAACCAC GGAGCTGGCG AAGGAACGCA TCCATCGCCA CCTCAACCGT
TTCTGGCAAT TAATGCAGGC CATCAATGAC AAGCAACATC TGCCCGAAGA CTTGCTGATC
ACACTCGAAT CGGAAGATGG CCTCTTCCCA TTCATTCAAG CGACGGACTG GGCTCGCATT
CGTGACTAG
 
Protein sequence
MAKGALALVL HAHLPYVRSA EPGSLEEDWF FQALIECYLP LLQVLEEAAA APNQHPRLTI 
SLSPTLLSLL SDDDLKHRFP AWLAVRLDLL TQTASDLQPA ADHLAEIIQR NLHQWLACEG
DLIGRFAQLQ RSKVVDLLTC GATHGYMPLL REHPEAVRGQ LRTAVREHHR LLGEQPLGIW
LPECAYYEGL DRWILDAGLR YTVLDGHGLL HATPRPRYGV YAPICSRNGV AFFGRDSDAT
LPVWSAQQGY PGDPYYREFH RDLGWDLPIE QLHDIGLKEP RPLGLKLHRV TDQRSPLDAK
EVYEPAIACA LTKEHAQLYL KGRRIQLDQL TNTMAIEPLL VAPFDAELFG HWWFEGPTFL
AEIFRQASKE QVDFTRLRDV LTSNPQLQLC EPSPSSWGQG GYHDYWLNDS NAWVVPEWSR
AGKAMMERCS LGVARESDLR LLQQAARELL LAQSSDWSFI LRAGTTTELA KERIHRHLNR
FWQLMQAIND KQHLPEDLLI TLESEDGLFP FIQATDWARI RD