Gene P9211_14251 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9211_14251 
Symbol 
ID5730330 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9211 
KingdomBacteria 
Replicon accessionNC_009976 
Strand
Start bp1284709 
End bp1286310 
Gene Length1602 bp 
Protein Length533 aa 
Translation table11 
GC content39% 
IMG OID641285802 
Productglycosyltransferase 
Protein accessionYP_001551310 
Protein GI159903966 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1807] 4-amino-4-deoxy-L-arabinose transferase and related glycosyltransferases of PMT family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.007378 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAAAAGAA TCACTCTCAT TACAGATGCA CCTTTGATCG GGTTATTAGT TTTATGGTTT 
ATTGCTTTTT GCCTTTCTTT ATATGGACTA GGAGGTGTTC CATTAAGAGA TTTTGATGAA
GCCACAGTGG CTAGAGTTGC ATATGAATTA AGTCAAAGCA ATGGTGCTGA CCAACTACTC
CCAACAATTT GGTCAAACCC TTATTTAAAC AAACCTCCTG GCCTTCATTG GGCAATAGCC
AAGTTGATTA ATTTGAATAA TTTTTTCTCA TCAAATCAAC TTCCATCAGA ATTCCTAATA
AGACTTGCAC CTGCATCTTT ATCCACATTA GTTATTCCAT TGGGTGGTCT AATTCAATGG
AAGTTGCGAC CAAAGGAACC AATAACAACT CTTGCAACCA GCACTATTCT CCTGACATTG
CTGCCAATAA TTAGACATGG GCGACTAGCA ATGCTTGATG GGACACAATT GACGGCTATC
GCCTTGTTTT GGCTATTACT GGTGTCTATC GATAATTCTC CAAGAGACAA TATTAGGTTT
TTAATGTCTG GCCTAATTGC CAGCTTCATG CTCTTACTAA AAGCTCCTTT ATTAATTCCA
ATTATTTGTG CAGCTGTGTT GCCAATGCTA ATGGAAAGGG TCCTATGGGA ATGGTCATGG
GGCAAATGGT TCTGCATAGG ACTGGTTCCA GGAAGTTTAT GGCATATATG GCATGCCATT
AATCGTGGAA ATGAGGCATT AAATCTTTGG CTAGGTGATG GTGCGTCAAG AGTTATTTTC
GATTCCGGGC AAGGGAGTGA TCTTGGCTTT TTGGTTCCCT TAATTGAATT AATTGAAGGA
GGCTGGCCTT GGCTGGTTCT ATTACCCTTT GCAACTTACC TGGCGTGGCA TGAGCGTAAA
AGCAAGTGGG GCAAGTGGGT TATAGGTACA TCCTTAATTC TTATAATCTC TATTTTGCCT
TTAAAGACTC AATTACCTTG GTATTCCCAT CCGCTGTGGT TACCATTTGC TTTACTTTGT
GGGCCAGCAG TTTCTGAATT GATCAAGAAA AGTAAAAGTA TTTTTCACCG AAACCTTTTA
CTGCGACTAA TACCATATTC GTTCTTATTA CTAGGGTCTT TAGTGCTTTC TTTTTCTCTT
CTAAGCTTTG CTGGGCTAAT TAAAGGTTTT GAGTCATATT TACTTATTTC GATTCCAGTT
GGTTCAGGCT GGCTAGTGGG AGGTTATTTA CTAAATCACA AGAAAATGAA AATAAGGCAA
CTGGCAATAT CCTCTTTAGC TTTAGGTAAC TTAGTAGGTC TATTTATTCT AATGGGGTCC
CCAAACTGGA TATGGGAGCT AAATGAAACT TGGAATGCCA AGCCTGTAGG TGAAATGATT
AGAAAAGCCA ATCCAGGAGA AGTAGTAATG GAAGGAAGTA ACGAGAGACC AAGCTTAAAT
TGGTATGCAG AGCAAAGAAT AGTAAATAAA AGTAGTAGGC AATCTGATCA ATGGTTGCTA
ACTTCTAACC AGAGAAAAAG TAAATATCTG ATTGAAAAAG AAAAATGCCA AGAGAAAGGT
TCAGAAGGTA AATGGATGCT AATATATTGC AAAAGTAAAT AA
 
Protein sequence
MKRITLITDA PLIGLLVLWF IAFCLSLYGL GGVPLRDFDE ATVARVAYEL SQSNGADQLL 
PTIWSNPYLN KPPGLHWAIA KLINLNNFFS SNQLPSEFLI RLAPASLSTL VIPLGGLIQW
KLRPKEPITT LATSTILLTL LPIIRHGRLA MLDGTQLTAI ALFWLLLVSI DNSPRDNIRF
LMSGLIASFM LLLKAPLLIP IICAAVLPML MERVLWEWSW GKWFCIGLVP GSLWHIWHAI
NRGNEALNLW LGDGASRVIF DSGQGSDLGF LVPLIELIEG GWPWLVLLPF ATYLAWHERK
SKWGKWVIGT SLILIISILP LKTQLPWYSH PLWLPFALLC GPAVSELIKK SKSIFHRNLL
LRLIPYSFLL LGSLVLSFSL LSFAGLIKGF ESYLLISIPV GSGWLVGGYL LNHKKMKIRQ
LAISSLALGN LVGLFILMGS PNWIWELNET WNAKPVGEMI RKANPGEVVM EGSNERPSLN
WYAEQRIVNK SSRQSDQWLL TSNQRKSKYL IEKEKCQEKG SEGKWMLIYC KSK