Gene NATL1_16511 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNATL1_16511 
Symbol 
ID4780142 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. NATL1A 
KingdomBacteria 
Replicon accessionNC_008819 
Strand
Start bp1343984 
End bp1345822 
Gene Length1839 bp 
Protein Length612 aa 
Translation table11 
GC content33% 
IMG OID640084934 
Productglycosyltransferase 
Protein accessionYP_001015473 
Protein GI124026357 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1807] 4-amino-4-deoxy-L-arabinose transferase and related glycosyltransferases of PMT family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.760538 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.297105 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGAAAAGCT TCACTAAACC TTTTTATCCA ACTTCAATAC AAAGAAGACG TGGTCTTCTT 
TGTATTTTAT TTATTGGATT AATAATTTTT GTTTGGCAAT TAGGGACAAC AGGTTTAGTT
GATGAAACAC CACCTTTGTT TGCTGCAGCC AGTCGAGCAA TGGCTGAAAC GGGTAATTGG
TTTACTCCTC AAGTAAATGG ATTACCTCGA TTTGACAAGC CTCCTTTGGT TTATTGGTTG
ATGGGACTGG GATTTTCAAT CCCAGGGAAT ATTTCTTGGG ACCCTTTAGG GACATGGGCT
GCAAGGCTTC CTTCTGCATT ATCAACAATT TTTTTAATGC TCGTTTTAGG GGACACAATG
ATGAAATATC CCCAAGAAGA AAATAATTTC AATAGAAGAA CCGCAGTTAT TACTGCGCTG
GTATTTGCGT TGTCGCCTTT GGTAATAATT TGGGGAAGAA TTGCTGTAAG TGATGCACTC
CTTTGCAGCA CTTTAGGAAT TTGCTTACTT CTTAAATGGA GAAGATTCGC TAATCCAGAG
GGAGAAGCTT GGTGGTTGTC TTGGATTTTT TTGTCATTTG CAGTTTTAAC TAAAGGTCCT
GTCGCTTTGG TTTTATCAGC ATTAACAATT TTATTTTTTT CCTTATTTCA AAATAATTTG
CTCGGAATTT TAAAAATATT GAAAGTAATT CCAGGACTTT TTCTTGTTTT TATTATTAGT
TTTCCTTGGT ATTTAATTGA ATTATTAATA GAGGGAAAAC CCTTCTTAGA TAGTTTTTTT
GGGTATCATA ATCTTCAAAG ATTCACCTCA GTAGTAAATT CTCATGGGGA GCCTTGGTGG
TTTTTTTTAA TAGTATTATG TATTGCTTCT TTACCACTTA CTCCATTTTT ATTAATAAGT
ATTTGGAAGA ATTTTTACAA TGTATCTAAA TGGTCTAGAA GAGTTATTAA AAAACCAGAA
AAGTCATTAT TTGAGTTTTC ATTTTTTTGG TTAATAGCTG TATTTGTTTT CTTTACTATT
TCAGCTACCA AGCTACCGAG CTATTGGCTT CCTGCTACAC CTGCAGCATC AGTTTTAATA
TCACTTTCCT TAACCAATAA TGTTAAAGAG GAATTCTTAA AATCTTTGGC TTGGAAATTT
ACAATATTTT TATCAATACT CTTTTCCATA TTTCTCTTCT CTTCAAAGCT ATGGATTCAG
CTTCTTAACG ATCCTGAGAT TCCAAACTTC TCAGAAGAAT TAACCAATAG CTTTTTAATT
GAAAAGTCTG GTTTTATTTT TCTTTTACTT GCTGTTCTAG GAATAATATT TTCCTCTAGA
AAAATATATG GAAAACTATT GATTTTGCAA ATCCCTATAG CTTTATCTCA TTTCCTGATT
GTGTTACCAA CTTTTGATTT GGCAGATAGA TTAAGACAAC TCCCCTTAAG AGAAGCCTCA
GAATTACTTT TAAATTCCCA AAATAGAAAC GAGCCTTTAG TAATGGTAGG AGCTATGAAA
CCCTCAATTC ATTTTTATAC TAATCAAGTC ATTGTTTTTG AAGGTAGATC TAAAAATGCT
TTTGTAAATG TTTCAGATCG ACTTAAAAAC GAGAAGAGAA GAGGCTGGAA AGGAAGGCCA
ATATATGGAT CCAATGGCTC AGAAACTACA CTTTTACTTA TTGATAAAAG ATCAGTGGAA
AAATCTTATT GGCAAGGACT TAATCCAGAA GTCTTAGGTA ATTTTGGTGT TTATAGCGTT
TGGAGACTTG ATAGAGAAAT TATTGAAAGA CGAGCTAAAT TGTTGAAAAT AGATGGTGTC
ACTTCTACTT GGAAAAATCC TAGACCAGAG AGATTTTAA
 
Protein sequence
MKSFTKPFYP TSIQRRRGLL CILFIGLIIF VWQLGTTGLV DETPPLFAAA SRAMAETGNW 
FTPQVNGLPR FDKPPLVYWL MGLGFSIPGN ISWDPLGTWA ARLPSALSTI FLMLVLGDTM
MKYPQEENNF NRRTAVITAL VFALSPLVII WGRIAVSDAL LCSTLGICLL LKWRRFANPE
GEAWWLSWIF LSFAVLTKGP VALVLSALTI LFFSLFQNNL LGILKILKVI PGLFLVFIIS
FPWYLIELLI EGKPFLDSFF GYHNLQRFTS VVNSHGEPWW FFLIVLCIAS LPLTPFLLIS
IWKNFYNVSK WSRRVIKKPE KSLFEFSFFW LIAVFVFFTI SATKLPSYWL PATPAASVLI
SLSLTNNVKE EFLKSLAWKF TIFLSILFSI FLFSSKLWIQ LLNDPEIPNF SEELTNSFLI
EKSGFIFLLL AVLGIIFSSR KIYGKLLILQ IPIALSHFLI VLPTFDLADR LRQLPLREAS
ELLLNSQNRN EPLVMVGAMK PSIHFYTNQV IVFEGRSKNA FVNVSDRLKN EKRRGWKGRP
IYGSNGSETT LLLIDKRSVE KSYWQGLNPE VLGNFGVYSV WRLDREIIER RAKLLKIDGV
TSTWKNPRPE RF