Gene NATL1_04491 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNATL1_04491 
Symbol 
ID4779175 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. NATL1A 
KingdomBacteria 
Replicon accessionNC_008819 
Strand
Start bp410307 
End bp411581 
Gene Length1275 bp 
Protein Length424 aa 
Translation table11 
GC content39% 
IMG OID640083726 
Productputative glycosyl transferase, group 1 
Protein accessionYP_001014278 
Protein GI124025162 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGACCCATA TCGCCTGGCT AGGCAAAAAA ACACCTTTTT GCGGGAATGT CTGTTACGGA 
CTTACCACGA CTGAAGAGCT TAGAGAAAGA GGATATCAAA CAAGTTTTAT TCACTTTGAC
AACCCGATGA GAGATGGGAA TAATAAAACC TCCTTGCTAG CCAATGATCC TGATGTAAGT
CTTCCTTATT TAATTAAATC TCAGGTTTAC ACAATTCCTT CTCTAAATGC GCAAAGAGAG
CTTAGAGAAT CGCTCTCAAG ACTGAAACCT GATTTAGTTC ATGCAAGTCT TACTCTTTCT
CCATTGGATT TTCGACTACC TGAGCTTTGT CATCAACTTA ATTTACCTTT GATAGCAACT
TTTCACCCAG CTTTCGATTC AAAGCTCAGA AATCTTACTG CCAATACACA GCAACTTACC
TACCAACTTT ATGCTCCATC ATTAGCTAAG TATGACAAAG TAATTGTTTT TTCAGATTTG
CAAGCAGAGG TTCTCGCAAA ACTAGGAGTA AAAGAGAGCA GACTAGAAGT AATACCTAAT
GGAATTGACA TAAAGAAATG GAACACGCTT AAACCAAATA ATTCACAAAA TGAACTTCAT
TTCGAAATAA AGAAAAAGCT TGGTTCTGAG AGAATATTTA TCTATATGGG CAGAATAGCT
TCTGAAAAAA ATGTTGAAGC TCTACTACGT GCATGGAGAT TTGTCCAGCC AAAAGGATGT
CGATTAGTAA TAGTGGGAGA TGGTCCACTA AGGCCAACAC TCGAAAATCA TTCAATTTTC
AACAAAGAAG ATAATGTTTT TTGGTGGGGA TACGAAGCTG ATCAGAATAA AAGAGTCGCT
CTTCTTCAAA TCGCTGAGGT CTTTTTACTT CCAAGTCTTG TAGAAGGATT ATCTATTGCC
TTGTTGGAAG CTATGGCAAC AGGAACAGCA TGTGTCGCGA CAGATGCTGG AGCAGATGGA
GAAGTGCTTG AAAATGGAGC AGGAATCATA CTTAACACTG AGGGTGTAAC GTCTCAATTA
AGGACTCTTT TACCTGTTTT GCACGATCAG CCAGTGCTTA CCCACGAACT AGGTAGACGT
GCACGCTTAA GAGTAGAAGA AAAATATACA CTTCAACAAA ATATTGACTC ACTTGAAAAT
CTATATGCAA ACGTACTTAG ATCATCTAGG TCGAAACTTC CTCAGCCAAT TGACGACTCT
TCTGCGCTGC CAAAACAACT GCTTCAATTA AGGCGGATCT CAAGCCAGAA AGTTCAAGGT
GTCGAAGAGC ACTAA
 
Protein sequence
MTHIAWLGKK TPFCGNVCYG LTTTEELRER GYQTSFIHFD NPMRDGNNKT SLLANDPDVS 
LPYLIKSQVY TIPSLNAQRE LRESLSRLKP DLVHASLTLS PLDFRLPELC HQLNLPLIAT
FHPAFDSKLR NLTANTQQLT YQLYAPSLAK YDKVIVFSDL QAEVLAKLGV KESRLEVIPN
GIDIKKWNTL KPNNSQNELH FEIKKKLGSE RIFIYMGRIA SEKNVEALLR AWRFVQPKGC
RLVIVGDGPL RPTLENHSIF NKEDNVFWWG YEADQNKRVA LLQIAEVFLL PSLVEGLSIA
LLEAMATGTA CVATDAGADG EVLENGAGII LNTEGVTSQL RTLLPVLHDQ PVLTHELGRR
ARLRVEEKYT LQQNIDSLEN LYANVLRSSR SKLPQPIDDS SALPKQLLQL RRISSQKVQG
VEEH