Gene P9303_25661 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9303_25661 
Symbol 
ID4777914 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9303 
KingdomBacteria 
Replicon accessionNC_008820 
Strand
Start bp2260466 
End bp2262691 
Gene Length2226 bp 
Protein Length741 aa 
Translation table11 
GC content51% 
IMG OID640088087 
Producthypothetical protein 
Protein accessionYP_001018562 
Protein GI124024255 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCGACT TTGTGGTTCT TTCCACTGCA GATTGGGATC ATCCGCTATG GACTAATAAG 
CAGCATGTTG CTGTCTCCTT GGCTGCTGCC GGTCACCGAG TTTTGTATGT TGATTCTCTT
GGTTTGCGTG CACCTCGTGT TGGGGCTGTC GATCGAGGCC GCATCCTGCG GCGTCTTGGT
CGAGTGTTGC GTCCGCCTCG GCGTGTAGGC GAGGCCCTTT GGGTGTGGTC ACCTCTTGTG
TTGCCCGGCG GGACGGCTGG ATTTGCTTTG ATTTTGAATC GGCAATTATT AACTCTGGGG
CTGCGTCTTG CCTTGCTCTG GCTGCGCTTC CAGCAACCAA TCCTTTGGAC TTACAACCCG
CTGACATGCC GCTATTTGGC GCTGGGTAGT TTTGGTGGAA GCATTTATCA CTGCGTCGAT
CGCATCCAGG CTCAACCAGG TATGCCGGCG GAAAGGATCA GCGCAAGCGA GCGGCAACTT
TGCCGAGCCG TGGATGTTGT CTTTACCACT TCCCCGGATC TACAGGCTGA CCTGGAAAAG
ATTCATCCTC ATACTCATTT TTTCGGCAAC GTCGCTGATC AGCAACATTT TGGACAGGCA
TTGAGTGGAA CATTGCCGTG TCCTCCTGCT CTCAATGATC TCCCTAGGCC TCGCTTGCTA
TTCATCGGCG CCATTGATGC CTACAAGCTG AATTTGCCGA TGCTTGAGAT CTTGGCGGAG
CGCCACCCTG AGTGGACCTA TGTCTTTGTT GGTCCTGTGG GGGAGGCAGA TCCCGCAACG
GATGTTTCCA ATTTGCTCAC GTTCTCCAAT GTTCACTTCG TTGGAGCTCA GCCTTATAGC
GATTTGCCTT CTTGGCTTGC TCACTGTGAT GTCGCCTTGC TGCCATTGCG GCACAACAGC
TACACCCGTC ATATGTTCCC GATGAAGTTC TTCGAGTATT TGGCGTCTGG GAAACCAGTC
GTTGCCACTG CGATTCCTGC TCTTCGTCCC CACGCTGTAG CTGCACATCT TTGTGAGTCT
GAAGCGGACT CCTTTGAAGT GGCAATTGCC AAGGCGCTTG CTTCTGAAGG ACCGGCTTTG
ACGGAGCGAC TCGCTGTAGC GGCAGAACAC ACCTATGAGG TGCGAACAGC AGCCATGTTG
TCGGTGTTGA ACGAGCTGGG AATTTTGCCT GACGCTCGAG CCAGTGGTGT TTCCTCTGGA
AGGGCCAGAG TCAGAGTGCG TCGTCTTCGT CACTATTGGC ATGAATGGCT CCTGTCGCAG
CTCGCGACTT CCTTGGCTGC TGGGCTGGAT CGAATTGGTG CCCATCACAA TGCCTTGGAG
ATGTTGCAAG CTTTTAGACA CCGTTGGCCA CTCAATCTGC CTGTACTACG TGCCTTGATT
CCACGTTCAG TGCAAGCAGG TGATTTCAAT TATGCGCTTG AAGTCATGGA AGATCTTTGG
ATTAATTACG GCCAGATTTC TTACTTGCGC AAATTACTTT TTCGTCGGGG TTCGCGTCCT
GAAGATCTAC AACAGCAGAT TGCTTTGTTC GAAACTTTAG CCAGAAGTGT TCGGCTGCCA
TTGACCTACC GATGTTATTC CCGAGTCGTG CTGGCCTATC GGATTGTTGA GAGTGGGGAT
CAAGTCAGAA TGCGTGAATC AGCGGTTGCT TTGCAGTCGT TTGTTGTTCA ACTTGAAAGT
GATCCAGGCA CAAGACTATG TCGGCGAGGA AATCGCTCGA ATCGAGCAAA GTTATTGATT
TCTTGCTATT CAACACTCAC GCGCTTGTAT TTAGCGCTTG GCGATCGAAA GTCACTGGCG
GCTATTGGAC AAAAAGCAGC GGAGTTTATG GATGGGTTCG ATCTGAATGC GATTGATAGA
GATACTTCCT TCCGTTTGAC TCGCAATCTG ATGCGATGCC TCACAATCGA TGTCCTTGAA
GCTTGGCGTT TGGGGGATCA ATCCCTTTAT CAGAGGGCGA GGCAAAGGCT TGTTCTGGTT
GTGGATCATT GTCATCAATC CATCCATGAT GAAAGCAATG CGCAAGAGGA TCATCGAGGT
TTTGCCAAGG CTCTTCTTGA AGAGGTTGAT AGCCTGGAGC CAATGATTAC TGGTCCCAGT
CATGATCCAC AAAGGATTCA CGAATTATTG CGATTAATGG TTAAGAATAA GGGATTATCA
CTTGATGGAG TGTTGCCCTT ATTCCCTGAG TATCTAGATA CTAAAGTGGC TGAGGTCTGT
CAATGA
 
Protein sequence
MADFVVLSTA DWDHPLWTNK QHVAVSLAAA GHRVLYVDSL GLRAPRVGAV DRGRILRRLG 
RVLRPPRRVG EALWVWSPLV LPGGTAGFAL ILNRQLLTLG LRLALLWLRF QQPILWTYNP
LTCRYLALGS FGGSIYHCVD RIQAQPGMPA ERISASERQL CRAVDVVFTT SPDLQADLEK
IHPHTHFFGN VADQQHFGQA LSGTLPCPPA LNDLPRPRLL FIGAIDAYKL NLPMLEILAE
RHPEWTYVFV GPVGEADPAT DVSNLLTFSN VHFVGAQPYS DLPSWLAHCD VALLPLRHNS
YTRHMFPMKF FEYLASGKPV VATAIPALRP HAVAAHLCES EADSFEVAIA KALASEGPAL
TERLAVAAEH TYEVRTAAML SVLNELGILP DARASGVSSG RARVRVRRLR HYWHEWLLSQ
LATSLAAGLD RIGAHHNALE MLQAFRHRWP LNLPVLRALI PRSVQAGDFN YALEVMEDLW
INYGQISYLR KLLFRRGSRP EDLQQQIALF ETLARSVRLP LTYRCYSRVV LAYRIVESGD
QVRMRESAVA LQSFVVQLES DPGTRLCRRG NRSNRAKLLI SCYSTLTRLY LALGDRKSLA
AIGQKAAEFM DGFDLNAIDR DTSFRLTRNL MRCLTIDVLE AWRLGDQSLY QRARQRLVLV
VDHCHQSIHD ESNAQEDHRG FAKALLEEVD SLEPMITGPS HDPQRIHELL RLMVKNKGLS
LDGVLPLFPE YLDTKVAEVC Q