Gene NATL1_21281 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNATL1_21281 
Symbol 
ID4780909 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. NATL1A 
KingdomBacteria 
Replicon accessionNC_008819 
Strand
Start bp1786081 
End bp1787397 
Gene Length1317 bp 
Protein Length438 aa 
Translation table11 
GC content36% 
IMG OID640085425 
Productglycosyl transferase family protein 
Protein accessionYP_001015948 
Protein GI124026833 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1215] Glycosyltransferases, probably involved in cell wall biogenesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.133481 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTTTAG CCAAAATCAG TGGAGAAAAC CGACGCATTA AATCAATATT GTTTTTGATT 
TGCTGCGCTT TGGCAGGAAT TTTTCCTCAT CTGATGGCGC CTAGCCAGAA CTTATTCCCT
TCAATTACTT TGGCTGTCTT GCTGGGTGGA TATGGATTAA GAGTTGTTTT ACGAGATCGC
AGGGACAATT ATCAATTGCA AAACGAGTAT CTAATTAAGG GAGAAAAATT TGTTGAGACT
CTTCCCAAAG TAGATGTGTT GGTGGCAGCA CGTGATGAAG AAAATGTAAT AGAGCGATTA
GTTTCACGAC TTTTATCAAT TGAATATCCA GAGGAGAAGA TTTCTTTATG GATAATTGAT
GATGGAAGTC AAGATCGAAC ACCTGATTTA TTAAAAAAAT TAGGTACAAG CTTTTCTAGG
ATTAATGTTT TAAGTCGCCC TTTGATGAGT GGTGGTGGAA AATCAGGAGC TCTGAATTCT
GTATTGAATA AAACTGATGG AGAGTGGTTA TTTATTCTTG ATGCTGATGC ACAACTACAG
AGCAATGTAT TACTTAGAGC AATAGCTCTT GCTTTGCATG GTGGATGGTC TGCGGTGCAG
TTAAGAAAAT CAGTTGTAAA TTGTGAGTTG AATCAAATTG CTTCATATCA AGCAATGGAG
ATGGCAATGG ATGCCGTTAT TCAAAGGGGC AGACTTGCTA GTGGGGGTGT TTCGGAGTTA
AGAGGTAATG GACAATTAAT TAATAGAAAA GTTCTTGAAT GTTGTGGAGG CTTTAATGAA
CAAACTATCA CTGACGATTT AGATTTAAGT TTCCGGTTTT TATTAACACG ATCGCCAATA
GTAATAATGT GGGATCCACC TATTCAAGAG GAAGCAGTGG AATCATTGTC GGCTTTATTC
CGACAAAGAA AAAGATGGGC CGAGGGAGGT TTGCAAAGAT TTTTTGATTA CTGGCCCTTA
TTGATCTCAA ATCGACTTTC AAAATTTAAA AAAATAGATT TATCTTGCTT TTTCTTATTG
CAATACGTTT TACCGGTTGT ATCATTTATA GATTTTATTG TATCAATTAT TTTATTTGAA
ACGCCACTTT ATTGGCCACT ATCAATAGTT GCATTTGGTA TTTCTAGTTT AGCTTTTTGG
AAAGGATGTT CTCAAAATAG CGAAGGTCCA AGATTACCAT CACCTAATTT TATTAATATA
CTTGGAGCAA CAATTTACCT TGCACATTGG TTTATAGTTA TACCCTTTAT AGCAGTTAAG
ATGTCATTAT TTCGTAAGAC GTTGATTTGG GAAAAAACCG ATCATATTGG TGCTTGA
 
Protein sequence
MALAKISGEN RRIKSILFLI CCALAGIFPH LMAPSQNLFP SITLAVLLGG YGLRVVLRDR 
RDNYQLQNEY LIKGEKFVET LPKVDVLVAA RDEENVIERL VSRLLSIEYP EEKISLWIID
DGSQDRTPDL LKKLGTSFSR INVLSRPLMS GGGKSGALNS VLNKTDGEWL FILDADAQLQ
SNVLLRAIAL ALHGGWSAVQ LRKSVVNCEL NQIASYQAME MAMDAVIQRG RLASGGVSEL
RGNGQLINRK VLECCGGFNE QTITDDLDLS FRFLLTRSPI VIMWDPPIQE EAVESLSALF
RQRKRWAEGG LQRFFDYWPL LISNRLSKFK KIDLSCFFLL QYVLPVVSFI DFIVSIILFE
TPLYWPLSIV AFGISSLAFW KGCSQNSEGP RLPSPNFINI LGATIYLAHW FIVIPFIAVK
MSLFRKTLIW EKTDHIGA