Gene NATL1_16521 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNATL1_16521 
Symbol 
ID4780947 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. NATL1A 
KingdomBacteria 
Replicon accessionNC_008819 
Strand
Start bp1345823 
End bp1346932 
Gene Length1110 bp 
Protein Length369 aa 
Translation table11 
GC content33% 
IMG OID640084935 
Productglycosyl transferases group 1 
Protein accessionYP_001015474 
Protein GI124026358 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.524889 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGATAAAAA AGCAATTAAA ACTTATTCTT GTAAGTACAC CTATAGGTTA TCTAGGCAGT 
GGCAAAGGTG GAGGAGTTGA ACTTACTATT GTTTCTCTTA TAAAAGGATT AATTTCATTG
GGTCATAAAA TTATTTTAAT TGCACCAAAA GGATCAAAAT TACCTTTCGA AAGTGAGTTC
CTTGAAATAA GATTAATAGA TGGAGTTGAT CAACCTAGTT GGCAGCATCA GAATAGAAAA
GATCCAGTTT TAATTCCTTC TAAAAGTGTC TTACCAAAAT TATGGGAAGA GGTGATTGAT
ATCGCAAATG AATCCGACGC AGTTATTAAT TTTGCATATG ATTGGCTTCC ATTATGGTTA
ACAAAAACAC AATCAATTAA AATATTTCAC TTAATTAGTA TGGGCGCTGA ATCAATAGTA
ATGAAAGAAA TTATTAGTGA AATAAGTGAA TTATCTCCTT TTCGGCTGGC TTTTCATACT
AAAAGACAAT CTAAAGATTA TTTTTTAAAA ACTGATCCAA TTATCGTTGG AAATGGTTTT
GATACTGATG ACTATTTATT CAATAAAAAT GAGAATGGAC CATTAGGTTG GGCTGGAAGA
ATCGCGCCAG AGAAAGGCTT AGAAGATGCA GTAAAAGTTG CGAATAATTT GGGTGAAAAA
TTATTAGTTT GGGGACTCAT AGAAGATAAA GAATATGCAT TAAAAATTGA AAATACCTTC
ACAAAAAAAA TTATTGAATG GAAAGGATTT CTTCCAACGA AGAAATTTCA GGAACAACTA
GGACGATGTA GAGCGTTGAT AAACACGCCT AAATGGAATG AAGCCTACGG CAACGTTATT
GTTGAAGCGA TGGCTTGTGG TGTTCCTGTA ATTGCATATG ATCTGGGAGG ACCAGGGGAA
TTGATCGAAG ATGGATTCAA TGGCTTTTTG GTTAAACCCA ATGATATTGA AGGATTGATG
AAAGCAACAA AATCAATCTC AGAAATCAAA AGAAAAAATT GTAGAGCTTG GTTTGAAAAA
AAAGCCACTA GCAAAGTCTT TGCAGAAAGA GTGGAGAATT GGCTTTATAA AGGCTTAAAT
AAGAAAATCT CAGCAGACTT TAAAGATTAA
 
Protein sequence
MIKKQLKLIL VSTPIGYLGS GKGGGVELTI VSLIKGLISL GHKIILIAPK GSKLPFESEF 
LEIRLIDGVD QPSWQHQNRK DPVLIPSKSV LPKLWEEVID IANESDAVIN FAYDWLPLWL
TKTQSIKIFH LISMGAESIV MKEIISEISE LSPFRLAFHT KRQSKDYFLK TDPIIVGNGF
DTDDYLFNKN ENGPLGWAGR IAPEKGLEDA VKVANNLGEK LLVWGLIEDK EYALKIENTF
TKKIIEWKGF LPTKKFQEQL GRCRALINTP KWNEAYGNVI VEAMACGVPV IAYDLGGPGE
LIEDGFNGFL VKPNDIEGLM KATKSISEIK RKNCRAWFEK KATSKVFAER VENWLYKGLN
KKISADFKD