Gene P9301_16881 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9301_16881 
Symbol 
ID4911854 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9301 
KingdomBacteria 
Replicon accessionNC_009091 
Strand
Start bp1420499 
End bp1421590 
Gene Length1092 bp 
Protein Length363 aa 
Translation table11 
GC content29% 
IMG OID640161286 
Productglycosyl transferase family protein 
Protein accessionYP_001091912 
Protein GI126697026 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1215] Glycosyltransferases, probably involved in cell wall biogenesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGATCTG TTAATACTTG GAATTTAACT AATAATAAAC TTCATCAATT ATTTAAAGAC 
AATAACGAAT TTATTTCTAT TAAAGTCCGT GGTAATACTT GGGAGCCAAT CACTAGATGG
CTAAGATTAG ATTCAAGAAT TTTTAGAGAA ACTACTAGCA AAGCAAGAAT AACTTTATGC
GATATCGAAT CTCTAGCAGA AATTTATAAC TACAGATCAA TTAGATGGAA AGCAAAAAAA
CTAACTCCTA TTCCCACTAA AGTAATACCA CAATCTATTA AAAATATTTT TCGCAAAATA
CCAATCATAA AACAACTCGC CTATGAACTC GAAATAGTTT TTTATAAATA CAGTGAAAAT
ATTTCTGAAC ATTTAATATC AATAGTGATT CCTGCAAGAA ATGAAGCTGG TAATAAAGAA
CTATTAATTA ACGCTTTAAA TAAATTCAAA AATATACCAA ATAAGTTAGA AATTATATTT
GTTGAAGGTA ATAGCAATGA TAATACATAT GACATTTTGA AAGAATTAAA AGAAAATTTC
TCAAATTTCT TCAAGATATT TCTTTTAAAA CAAACTTCTA AAGGGAAGAA AAATGCAGTC
GTGGAAGGGT TTAATATTTC TTCAGGTGAG ACTCTCGCCA TAATTGATTC TGATTTTACA
GTAGATATTG ATGACAGTAT TGCAGCAATT ATGGAATCAA CCAAAAATAA AAATATACTT
ATTAATTGCG CCCGCACAAC TTTTCCAATG GAAAAAGATG CGATGAGATG GGCAAATTAT
ATAGGAAATA GACTTTTCGC AATTTTTCTA TCAATTCTAA TAAATAAGCC CGTATCAGAT
TCACTCTGTG GAACAAAAGT TTTTTCAAGA AAATTCTTTA AACTTATGAA ACAAAACGGA
AGTTGGGATT CCAAGTCTGA CCCATTTGGA GACTTTACAA TAATATTTGA AGCTGCGAAA
AATAACATTA AAATACTAAA TTATCCTGTT AGATATTACG CTAGAAAATC AGGCGCACCA
AATATATCTA GATGGATAGA TGGATTAAAA CTGCTCAAAG TATGCTGGAT TTATATGATT
TCTGATATCT AG
 
Protein sequence
MRSVNTWNLT NNKLHQLFKD NNEFISIKVR GNTWEPITRW LRLDSRIFRE TTSKARITLC 
DIESLAEIYN YRSIRWKAKK LTPIPTKVIP QSIKNIFRKI PIIKQLAYEL EIVFYKYSEN
ISEHLISIVI PARNEAGNKE LLINALNKFK NIPNKLEIIF VEGNSNDNTY DILKELKENF
SNFFKIFLLK QTSKGKKNAV VEGFNISSGE TLAIIDSDFT VDIDDSIAAI MESTKNKNIL
INCARTTFPM EKDAMRWANY IGNRLFAIFL SILINKPVSD SLCGTKVFSR KFFKLMKQNG
SWDSKSDPFG DFTIIFEAAK NNIKILNYPV RYYARKSGAP NISRWIDGLK LLKVCWIYMI
SDI