Gene NATL1_17391 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNATL1_17391 
Symbol 
ID4781269 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. NATL1A 
KingdomBacteria 
Replicon accessionNC_008819 
Strand
Start bp1424368 
End bp1425591 
Gene Length1224 bp 
Protein Length407 aa 
Translation table11 
GC content37% 
IMG OID640085026 
Productglycosyltransferase 
Protein accessionYP_001015559 
Protein GI124026444 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGAGCGTAA ATTCATCTTT AGATCTTCCA AGCAATATTG CATTAGTACA TGAGTGGTTT 
ACTCCAAGAT CGACAGGAGG TGCTGAGAAT GTTGTTCAGG TGATTGATGA TTTGTTATCT
GAAATTGCAT CGCAGCCAGA ACTGTTTTCT TTGGTTAATG AAGAGAGGTT AGAAAAAAAT
AGCTGGTTAT TTGATCGAAA AGTACATACT AGCTTTATTC AAAATTTACC ATTCGGGATC
TCACATGTTC AACAATATTT ACCCCTTTTG CCTTTTGCAA TTGAGCAACT CGATTTTGAG
GGATATCCAT TGATTTTAAG CAGTAATCAT CTTGTCGCTA AGGGAATTTT GACATCGCCT
GATCAACTTC ATATTAGTTA TGTCCATACA CCTGTTAGAT ACGCTTGGGA TCAAATGAAT
ATATATTTGA AAAGATCTTT TTTAAGAAAA ATTGGTTTAG GGCCGATAAT TAGATGGCAA
TTGCATACTT TGAGGCAATG GGATCAATTA AGTAGCTCAA GAGTAGATTA TCTGTTGGCC
AATTCTAATT TTACGGCAAA AAGGATTTGG AAGTATTGGA GAAGGCGTTC AGAGGTTCTG
CATCCACCTG TTGATGTAAA TCGTTTTGAA TGGAATAGGC CTAGGGAAGA TTTCTATTTA
AGTGTCTGTA GATTGGTTCC TAATAAAAGG GTTGATTTAC TTGTTAGGGC TTTTAATAGG
CTTAAATTGC CTTTAATAGT TGTTGGCGAC GGAGTGGAAA AGGAATATTT AAAAAAACTT
GCAGGTCCAA CTGTTCAAAT TATTGGTTTT CAAAGCAAAG AGAAGATTGA AAGTCTAATG
AGCAGATGTA GAGCCTTTGT CTATGCTGGT ATTGAGGATT TTGGAATAGC TCCTGTGGAG
GCAATGGCCT CAGGTGCTCC TGTGATTGCT TTTGGTAAGG GAGGGGTTTT AGATACAGTT
AAATGTTTTC ATTCTGATTC TGATAAAGGA GCAACTGGCC TTTTGTTCCC TTCTCAGACA
GTAAAGTCCC TGGTTGAAGC AATCGAATTT TTCAAGCAAA AGCAACTTTG GAGAGATTTA
AAACCTGAGT TCATTAGAGA TTGGAGCAAT TCTTTTTCTC AAGATTCTTT TAAAGATAAA
TTTGCCAAAA CCATAAATAG AGCTTGGAGG GAGCATGTCA ATTCTTGTGA CATTGCTACT
AGTGACCTTA CTTCTTCATC ATAA
 
Protein sequence
MSVNSSLDLP SNIALVHEWF TPRSTGGAEN VVQVIDDLLS EIASQPELFS LVNEERLEKN 
SWLFDRKVHT SFIQNLPFGI SHVQQYLPLL PFAIEQLDFE GYPLILSSNH LVAKGILTSP
DQLHISYVHT PVRYAWDQMN IYLKRSFLRK IGLGPIIRWQ LHTLRQWDQL SSSRVDYLLA
NSNFTAKRIW KYWRRRSEVL HPPVDVNRFE WNRPREDFYL SVCRLVPNKR VDLLVRAFNR
LKLPLIVVGD GVEKEYLKKL AGPTVQIIGF QSKEKIESLM SRCRAFVYAG IEDFGIAPVE
AMASGAPVIA FGKGGVLDTV KCFHSDSDKG ATGLLFPSQT VKSLVEAIEF FKQKQLWRDL
KPEFIRDWSN SFSQDSFKDK FAKTINRAWR EHVNSCDIAT SDLTSSS