Gene NATL1_21361 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNATL1_21361 
Symbol 
ID4780864 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. NATL1A 
KingdomBacteria 
Replicon accessionNC_008819 
Strand
Start bp1792732 
End bp1793880 
Gene Length1149 bp 
Protein Length382 aa 
Translation table11 
GC content40% 
IMG OID640085433 
ProductSqdX 
Protein accessionYP_001015956 
Protein GI124026841 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAAAATAG CTTTCTTTAC AGAAACTTTC CTACCCAAAG TTGATGGAAT AGTCACTCGT 
TTAACTAAAA CAATTCAAAA TCTAGTTGCA TCTGGTGATG AAGTAACCGT TTTTTGTCCA
GAAGGCTGTC CATCAAGCTA TATGGGTGCA AAAGTTGTAG GTGTCCCTGC GATGCCATTA
CCTTTATATC CAGAACTCAA GCTAGGGCTC CCTGGCCCAG GTGTTTCAGA TGAATTAGAA
AACTTTAAAC CTGACTTAAT ACATGTCGTC AACCCTGCGG TGCTTGGATT GGGTGGTATT
TGGCTAGCCA AAACAAACAA TATCCCCCTT GTAGCCAGTT ACCACACTCA TTTGCCAAAG
TATTTAGAAC ACTATGGAAT GGGAATGTTA GAGCCGCTTT TATGGGAATT GTTAAAAGCG
GCTCATAATC AAGCAACTCT AAATCTGTGT ACTTCCACTG CAATGGTGCA AGAACTCTCA
GAAAAAGGAA TTCAAAATAC TGCTTTATGG CAAAGAGGAG TGGATACAGA TATTTTCAAA
CCAGAACTTA GAGACGAAGA AATGAGAAAG CGTCTTTTAG GAAGTTTTAG CGATGAAGGA
TCTCTATTGA TATATGTGGG AAGACTCTCA GCAGAAAAAC AAATTGAAAG AATTAAACCT
GTACTTGAAG CACTTCCAAG CACTCGACTA GCTCTTGTTG GAGATGGGCC ATATAGACAA
CAATTAGAAA AAATTTTCCA AGGAACCTCA ACTACCTTTG TGGGATATTT AAGTGGGAAT
GAACTAGCAA GTGCTTATGC ATCCGGTGAT GCCTTTTTAT TCCCCTCAAG TACAGAGACC
CTAGGATTAG TTCTGTTGGA GGCAATGGCT GCTGGATGCC CAGTCGTAGG AGCGAATAAA
GGTGGAATAC CAGATATAAT TTCAGATGGC GAAAATGGAT GTTTATACAA TCCTGATGGA
GAAAATGATG GGGCTTTAAG TTTAATTGAA GCTACAAAAA AATTATTGGG CAACGAAACA
GAACGCACAT CTATGAGAAA AGCAGCTCGT TCAGAAGCTG AGAGATGGGG ATGGGCAGGC
GCTACAAAAC AATTGAAAAG TTATTACGAA GACGTACTAG ACAAAAAACG ATCAAATATT
GCTGCTTAG
 
Protein sequence
MKIAFFTETF LPKVDGIVTR LTKTIQNLVA SGDEVTVFCP EGCPSSYMGA KVVGVPAMPL 
PLYPELKLGL PGPGVSDELE NFKPDLIHVV NPAVLGLGGI WLAKTNNIPL VASYHTHLPK
YLEHYGMGML EPLLWELLKA AHNQATLNLC TSTAMVQELS EKGIQNTALW QRGVDTDIFK
PELRDEEMRK RLLGSFSDEG SLLIYVGRLS AEKQIERIKP VLEALPSTRL ALVGDGPYRQ
QLEKIFQGTS TTFVGYLSGN ELASAYASGD AFLFPSSTET LGLVLLEAMA AGCPVVGANK
GGIPDIISDG ENGCLYNPDG ENDGALSLIE ATKKLLGNET ERTSMRKAAR SEAERWGWAG
ATKQLKSYYE DVLDKKRSNI AA