Gene A9601_13921 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagA9601_13921 
Symbol 
ID4718113 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. AS9601 
KingdomBacteria 
Replicon accessionNC_008816 
Strand
Start bp1160134 
End bp1161261 
Gene Length1128 bp 
Protein Length375 aa 
Translation table11 
GC content23% 
IMG OID640079113 
Productputative glycosyl transferase, group 1 
Protein accessionYP_001009783 
Protein GI123968925 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.69221 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGATATG CTGAAGAACC TCATACTGGT CTAACTAGGT TTACTGTAAA CATTTTTAAA 
AATTTAATCA AAAATTCATC TAAAAACTAT TTTTATTATT TATTACTTCC TCCTAAAGAA
TGCAGTAAAC ATTTTATTGA TGACTTTCCC TCTGATTTAA AAAACTTTAA AAAAATATTT
TGGAGACAAA AAAGAGGATT AAAATGGAAA ATACCATTTG TTTTATTTGA TTTAGATATT
TTATTATTAG TTAAAGAAAT TAAACCTAAT TTATTCATAT CTCCTTATAT TGATCCTCCA
TTTATTCCAT TTGTAAAAGT TATCGCAACT ATTCATGATT TAATATTTAT TGAAGTAAAA
GATTATTTTC AGCACTTATC TCTATTAAAA AGATTAGTTG CATATTTTAG AATATTAATA
ACGATATTAA TTTGTGATAA TTTATTAGTG GTCTCATCTG CCACGAAAAA AAAATTAATT
AATAGATTTA ATTGGATCCC AAATAGCTTT AAAAGTAAGA TTAAAAATGC AAGTATTATT
TCTAATGGAA TAGACTTGTT AAGTTTAGAT AAAAAAAAAT ATGTTGAAAT TAAAGAGTTA
ATTAACAAAG ATTTCTTTCT CTATGTAGGA GATAGAAGAC CCCATAAAAA TATTATTTAC
TTAATTAAAC TAGTCAAAGC TATTAATAAA AAATTTTCTA AAAATACTAT TTTAATTTTA
GCTGGATCAA ATAAGTATAA GAATTTAAAG CTTAATAAAT TAATTACTAA AAATAATTCC
TTAGTTCATG AGATTGTAAA TCCTTCAGAT TTAACATTAG ATTTCCTTTA CAGGAACTGT
AAATCATTTT TCTTGATTTC AAAAGAAGAA GGATTTGGTA TACCAGTCAT TGAAGCTGCA
AGTAGAGGCG CTAAGATTGT AATAAGTAAT ATTCCTGCTT TAAGAGAAAT ATCGCCCAAG
CATTCATGTA TTATTAATTT ACGAGAAATT ACTGAAGATG TTAATAAGAT TTCATGTTAT
TTGAAAAATG ATCTAAGACC AAATTCAAAA GAAGTTATCA AAAAATGGAG CTGGCAAAAT
TCCTCTAAAA ATTTGTTTGA ATTAATAAAA ATTGTTTTAG AATCTTAA
 
Protein sequence
MRYAEEPHTG LTRFTVNIFK NLIKNSSKNY FYYLLLPPKE CSKHFIDDFP SDLKNFKKIF 
WRQKRGLKWK IPFVLFDLDI LLLVKEIKPN LFISPYIDPP FIPFVKVIAT IHDLIFIEVK
DYFQHLSLLK RLVAYFRILI TILICDNLLV VSSATKKKLI NRFNWIPNSF KSKIKNASII
SNGIDLLSLD KKKYVEIKEL INKDFFLYVG DRRPHKNIIY LIKLVKAINK KFSKNTILIL
AGSNKYKNLK LNKLITKNNS LVHEIVNPSD LTLDFLYRNC KSFFLISKEE GFGIPVIEAA
SRGAKIVISN IPALREISPK HSCIINLREI TEDVNKISCY LKNDLRPNSK EVIKKWSWQN
SSKNLFELIK IVLES