Gene NATL1_08711 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNATL1_08711 
Symbol 
ID4779610 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. NATL1A 
KingdomBacteria 
Replicon accessionNC_008819 
Strand
Start bp807555 
End bp808775 
Gene Length1221 bp 
Protein Length406 aa 
Translation table11 
GC content33% 
IMG OID640084146 
Producthypothetical protein 
Protein accessionYP_001014694 
Protein GI124025578 
COG category 
COG ID 
TIGRFAM ID[TIGR03573] N-acetyl sugar amidotransferase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.484525 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0000000933108 
Fosmid HitchhikerNo 
Fosmid clonabilityunclonable 
 

Sequence

Gene sequence
ATGGAATCAC TTTATGGAAC ACCTGAAATA ATTTGTTTTT GCAAAAATTG CGTAATATCA 
AATCAAAGAC CTCGATCAAC AATTGAGTTT AAACATACAA GTTCTGACAA GAAACAAACT
ATGGGCTTTG ATGAAAATGG TATTTGTGAT GCGTGTAAAT ACAGTCAAGT AAAGGCAAAC
TTAATAGACT GGAAACTTAG AGAAAGCAAA TTAATTGAAT TATTAGATAA ATACAGAAAA
AACAATGGTC AATATGATGT TGTAGTACCA GGCAGTGGAG GTAAGGACAG TGCTTTTACA
TCTCACATAC TAAAATACAA ATATGGAATG AATCCGCTAA CTGTAACTTG GGCGCCACAT
TTATACACAG ATATAGGATG GAAAAACTTC ACAAACTGGT CCCATATTGG CGGACACGAT
AATTTATTAT TCACCCCTAA TGGAAAATTA CATAGACACC TTACTCGATT AGCATTTTTA
AACTTATTAC ATCCTTTTCA GCCTTTTATT GTTGGTCAAA GAATCATTGG TCCATTACTA
GCAGCCAAAC ATGGTATCAG TTTAGTAATG TATGGAGAAA ATCAAGCTGA ATATGGAAAT
GATCCAAATG AAAACTTTGA TCCAACAATG GACCAAAAGT TCTTTACAGA GGGTAATCCT
CTAGAGATGA AACTTGGTGG TATTTCAATC AAGGATATAA TTAAAAGTAG TGAATTCACG
CTTCAAGATT TTTCACCTTA TATTCCTCCA TCTGCTGAAT TCTTAGAGAA ACAAAAGGTA
GTGGTTCATT ACCTAGGATA TTATCTCGAA TGGGATCCAC AAGAATGTTA TTACTACGCT
GTTGAAAATA CAGGCTTTGA GGCGAATACG GAAAGAACAC CTGGCACATA TTCTAAATAT
TCCAGTATTG ATGACAAAAT TGATATGTTT CACTATTACA CAACATACAT AAAATTTGGT
ATTGGTAGGG CTACCTATGA TGCATCACAA GAAATACGAA ATGGGAAAAT AACTAGAGAA
GAGGGTGTGC ATTTAGTAAA TAAGTATGAT GCAGAATTTC CAGAAAAATA CTTCAAAGAT
TTCTTAGAAT ATATTGATAT AGATCGAGAA AAATTTATAG AGACAGTAGA TAATGCCAGG
TCACCTCACC TTTGGGAGAA AGTAAATAAT GAATGGAAAT TAAGGTATAA AGTTAGTAAT
AAAAACCATT TTGATTCGTA G
 
Protein sequence
MESLYGTPEI ICFCKNCVIS NQRPRSTIEF KHTSSDKKQT MGFDENGICD ACKYSQVKAN 
LIDWKLRESK LIELLDKYRK NNGQYDVVVP GSGGKDSAFT SHILKYKYGM NPLTVTWAPH
LYTDIGWKNF TNWSHIGGHD NLLFTPNGKL HRHLTRLAFL NLLHPFQPFI VGQRIIGPLL
AAKHGISLVM YGENQAEYGN DPNENFDPTM DQKFFTEGNP LEMKLGGISI KDIIKSSEFT
LQDFSPYIPP SAEFLEKQKV VVHYLGYYLE WDPQECYYYA VENTGFEANT ERTPGTYSKY
SSIDDKIDMF HYYTTYIKFG IGRATYDASQ EIRNGKITRE EGVHLVNKYD AEFPEKYFKD
FLEYIDIDRE KFIETVDNAR SPHLWEKVNN EWKLRYKVSN KNHFDS