Gene P9211_14261 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9211_14261 
Symbol 
ID5731027 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9211 
KingdomBacteria 
Replicon accessionNC_009976 
Strand
Start bp1286369 
End bp1288036 
Gene Length1668 bp 
Protein Length555 aa 
Translation table11 
GC content31% 
IMG OID641285803 
Productglycosyltransferase 
Protein accessionYP_001551311 
Protein GI159903967 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1807] 4-amino-4-deoxy-L-arabinose transferase and related glycosyltransferases of PMT family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.259667 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00324745 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGATTAAAA AGAGCTTAAG TAGCAGAATT GAGAATATTT ATAGTCCTAT ATGCTCATAT 
ATGAAAAAAC TTTATGCGGA GAAATATTTT TCAATATTGG TCATTATATG TATTTGTGGA
TTTGTTGGAT TTGTTTGCCT TTTCAATGAA AGCACACAAA GTTTTTTTGC TCACGATGAA
GGTCTTTATG CAAGAAGAGC AAAGTTGATA TTGGATACAG GAGATTGGTT TGCACCTTTT
TCTAAAGCTC ACCATAAGAC AATTGGTAGT TATTGGTTAA TAGCATTAAG CATGAAGCTT
TTTGGCTTAG GTGAATACTC AGCAAGAATA CCAAGTGCTT TGTTTTCGGT TCTTAGTTCA
ATAGTTGTTT ATAAGATAGG CTTAGAAATA TCAAATAAAA GATCAGCATT TATATCGGCT
TTAATACTTC CATCTATGCC GCTTTGGTTT CAATATAGTC ATTACGCCAG CCCTGACATG
GCGTTTGTTT TTTTAAATTT ATTTGCTATT TATATGATTC TAAGAGCAAG TAGTGAAAAT
AACAGACAAT CTAACCATCA ATTCTTTTAT TGGTTTTTAA CAGGCATTTC CTTCTCTCTT
GCTTTCCTGA TAAGAAGTTT TTTAGCTCTA CTACCGATTT TTGCTTTACT GCCTTTTATA
TGCCTATCTT TAAGACAGCA GGGTAGAAAA AAAGTTTATT TCTTATTGGG GGGGATGATA
ATAGGCTTTA TACCCGTCAT AGTCAGCATT TTCTATGCAT ACAATGCATA TGGATCTGAG
GCTTTTCTGG AGTTATTTGA CTTTGCTAGA AGAAAGTCTA TGGGAGGGAA TTTGTTTAAG
GGATTATTCT ATTACCCTAT TATTCTCATT ATTCTTTCTT ATCCATCAGG ATTGATCAGT
ATATTTGGCT TTATCAGGGT AAATCAATAC AAGAACTTAA AGCTCAGATA CCTCTTGTCT
ATTTTCCCGT TGGTCATATT ATTTGCCTTA ATGGTTGCAT CGACGGCTTT AAGTCATTAT
GCCTTGATGC TAATACCTTG GATTGCTATA GCATCAGGAA TTGCAATTGA TTCATTGATT
TCATCAGAGC TTCTTAAATC ATATCAATTT AAGAAGCTTT GCGCGTACAT ATTTTTATTA
ATAGGATTAT CTCTGATGGT AATTCTTTTA TTTAAAATTA CAGGACTTGT ATCAATAGAT
GTGCTTGATA AGCCAATAAT AATAAGCTCT TTCTTTGTTG TATCATTAGT TAATATATCT
GCTGGATTAG TTGGAATTAG GGGATTAAAT AATCATAGAA ACTTTTCAAT TTCAATTGGT
TTGATGGTAA TTACACAAAC TATTCTTTTA ACTATACTTT ATGGAATTGG AATCTTGGGT
AATCCTAATC AAGAGATAAA AACGTTTGTT CGAGAGCCAT TCGTAAATGA AATACTACGT
TCAAATACTG TTTATCTTAT AGGTGTGAAT AGAAATACTA AAGTTAGAAC TTTAATGGAA
TTTTATCTGC CTAATTATCA GGATTATCAA AAGTCACTTG ATCAAGTCAA GGGAAACTCC
TATTTTATGG TTAGCAAGGA TGCTCTTTTG GAACTCTTAA AATTTAAAAA ATATAGTATT
AATAAAATAG CGAAGTACAA GGAATTCTTT TTTATAAGAG TTAATTAA
 
Protein sequence
MIKKSLSSRI ENIYSPICSY MKKLYAEKYF SILVIICICG FVGFVCLFNE STQSFFAHDE 
GLYARRAKLI LDTGDWFAPF SKAHHKTIGS YWLIALSMKL FGLGEYSARI PSALFSVLSS
IVVYKIGLEI SNKRSAFISA LILPSMPLWF QYSHYASPDM AFVFLNLFAI YMILRASSEN
NRQSNHQFFY WFLTGISFSL AFLIRSFLAL LPIFALLPFI CLSLRQQGRK KVYFLLGGMI
IGFIPVIVSI FYAYNAYGSE AFLELFDFAR RKSMGGNLFK GLFYYPIILI ILSYPSGLIS
IFGFIRVNQY KNLKLRYLLS IFPLVILFAL MVASTALSHY ALMLIPWIAI ASGIAIDSLI
SSELLKSYQF KKLCAYIFLL IGLSLMVILL FKITGLVSID VLDKPIIISS FFVVSLVNIS
AGLVGIRGLN NHRNFSISIG LMVITQTILL TILYGIGILG NPNQEIKTFV REPFVNEILR
SNTVYLIGVN RNTKVRTLME FYLPNYQDYQ KSLDQVKGNS YFMVSKDALL ELLKFKKYSI
NKIAKYKEFF FIRVN