Gene P9211_17931 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9211_17931 
Symbol 
ID5730831 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9211 
KingdomBacteria 
Replicon accessionNC_009976 
Strand
Start bp1620978 
End bp1622126 
Gene Length1149 bp 
Protein Length382 aa 
Translation table11 
GC content41% 
IMG OID641286179 
ProductSqdX 
Protein accessionYP_001551678 
Protein GI159904334 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.335649 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAAGATTG CTTTTTTTAC TGAAACTTTT CTTCCTAAGG TTGATGGAAT AGTGACTCGG 
CTAACGAAAA CACTTGAACA TCTTTCCAAA GCAGGGGATG AAGTAATAGT TTTTTGCCCT
GAAGGATGTC CTGATGAATA CATGGGAGCC AAAATGATTG GTGTCCCTGC AATGCCCTTA
CCTCTTTACC CAGAACTAAA ACTTGGTCTT CCTGGCCCTG CAGTATCTGA AGCTCTTGAA
AACTTAAAGC CTGATTTAAT ACATGTAGTT AATCCAGCCG TTCTGGGGTT AGGAGGTATT
TGGCTTGCAA AGAGTAATAA CATCCCGCTC GTAGCGAGTT ATCACACACA TCTTCCTAAA
TACCTTGAAC ACTATGGAAT GGGGATGCTT GAGCCACTTC TTTGGGAATT ACTAAAAGCC
GCACATAATC AGGCAATTTT AAATCTTTGT ACCTCAACAG CAATGGTGAA GGAGCTGAGT
GATAAAGGAA TTCAAAATAC AGCCTTATGG CAAAGAGGAG TCGACACAGA AACCTTCAAC
CCAGAATTAA GAAGTGATGA AATGCGCCAA AAATTACTCG GAAAACATAG TGATACTGGT
GAATTACTAA TCTATGTCGG AAGATTGTCA GCAGAGAAGC AAATCGAACG CATAAAACCT
GTTTTAGAAG CTTTGCCCAA TACCCGTTTG GCATTGGTTG GAGATGGCCC CTACAGACAA
CAATTAGAGA AAATATTTGA AAACACTGCT ACAACATTCG TTGGTTACCT TTCGGGAAAA
GAGTTAGCTG GAGCCTATGC ATCAGGAGAT GCATTTTTAT TTCCTTCCAG TACAGAAACC
CTAGGCTTAG TACTCCTTGA AGCAATGGCT GCAGGATGCC CTGTAGTCGG AGCGAACAAA
GGGGGTATTC CAGATATTAT TAATGATGGT CAAAATGGCT GTTTATATGA TCCTGATGGG
GCGAATGGAG GGGCCACAAG CCTTATAAAC GCAACTAAGA AATTACTAGG TAATGAAATT
GAAAGACAAT CAATGAGAAA TGCAGCGAGA ATAGAAGCAG AAAAATGGGG TTGGTCTAGC
GCAACTACTC AACTAAGAGA TTTTTATCGA GCAATTCTTG AAAAACAATC CAACAAAATA
GCCGCTTAA
 
Protein sequence
MKIAFFTETF LPKVDGIVTR LTKTLEHLSK AGDEVIVFCP EGCPDEYMGA KMIGVPAMPL 
PLYPELKLGL PGPAVSEALE NLKPDLIHVV NPAVLGLGGI WLAKSNNIPL VASYHTHLPK
YLEHYGMGML EPLLWELLKA AHNQAILNLC TSTAMVKELS DKGIQNTALW QRGVDTETFN
PELRSDEMRQ KLLGKHSDTG ELLIYVGRLS AEKQIERIKP VLEALPNTRL ALVGDGPYRQ
QLEKIFENTA TTFVGYLSGK ELAGAYASGD AFLFPSSTET LGLVLLEAMA AGCPVVGANK
GGIPDIINDG QNGCLYDPDG ANGGATSLIN ATKKLLGNEI ERQSMRNAAR IEAEKWGWSS
ATTQLRDFYR AILEKQSNKI AA