Gene NATL1_21351 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNATL1_21351 
SymbolsqdB 
ID4780871 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. NATL1A 
KingdomBacteria 
Replicon accessionNC_008819 
Strand
Start bp1791502 
End bp1792695 
Gene Length1194 bp 
Protein Length397 aa 
Translation table11 
GC content39% 
IMG OID640085432 
Productsulfolipid (UDP-sulfoquinovose) biosynthesis protein 
Protein accessionYP_001015955 
Protein GI124026840 
COG category[G] Carbohydrate transport and metabolism
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0451] Nucleoside-diphosphate-sugar epimerases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAAAGTTT TCGTTCTTGG TGGTGACGGC TTCTGCGGAT GGCCTTGTGC AGTAAATCTT 
GCTGACAAAG GTCATGATGT ATTCATCGTG GATAATCTGA GTCGTCGAAA AATCGATATA
GACCTTGAAG TTGAATCCTT AACTCCAATT ACAAGTATTG GAGAAAGGAT TAAAGCTTGG
TCTGAGATAG GCGGAAAACC TATTCAATTT ATTCATTTAG ACCTTGCCTC TGAATATCAA
AAGCTTTTAG ATCTGTTAAT CGAGGAAAAG CCGGATTCAA TAATTCATTT TGCTGAACAA
CGTGCTGCTC CATATTCCAT GAAAAGCAGC GCAACCAAGA GATATACAGT TGATAACAAT
GTCAATGGTA CTCATAATCT CCTTGCCGCA ATTGTTGAAT CTCAATTAGA TATTCACATT
GTTCATCTTG GGACAATGGG AGTTTATGGC TATGGATCTC ATAGAGGCGC GACCATTCCT
GAAGGTTACT TAAAAGTGGA AGTACCCCAA CCAGATGGCA GTCGTTTTGA GGAAGAAATC
CTTCATCCAG CAAGTCCTGG CAGCGTTTAT CACATGACAA AAACGCTTGA TCAATTGCTC
TTTCTTTACT ACAACAAAAA TGACCAGATC AGAATCACTG ATCTTCATCA AGGAATTGTA
TGGGGAACGA ATACTGATGT CACAAGTCGT GACCCAAGAT TGACTAATCG ATTTGACTAT
GACGGTGATT ATGGAACAGT GTTAAATCGA TTTTTAATGC AGGCGGCCAT TGGATACCCA
TTAACTGTTC ATGGTACTGG TGGACAAACA AGAGCTTTTA TACATATTCA AGATTCAGTA
AAATGTGTTC AATTGGCACT CGAGAACCCT CCTGAGAAAG GTGAAAGGGT CAAAATTTTC
AACCAAATGA CGGAAAGTCA CCAAGTTGGG GAATTAGCTA AAAAAGTAGC CTCTCTCACT
GGAGCAAAAA TTAATTATCT GCCAAACCCA AGAAACGAAG CGGTTGAAAA TGATTTGATA
GTTGATAATA GATGTTTTAT TGAGCTTGGA TTAGATCCGA CAACTCTCGA TAATGGACTT
TTAGAAGAAG TGGTTAATGT CGCGAAAAAA TTCTCCAATC GCTGTGATTT AAAACGAATA
CCTTGTGTAT CTGCATGGAC ATCAACCCAA GCAAAAGCAA TCCATAAATC CTAA
 
Protein sequence
MKVFVLGGDG FCGWPCAVNL ADKGHDVFIV DNLSRRKIDI DLEVESLTPI TSIGERIKAW 
SEIGGKPIQF IHLDLASEYQ KLLDLLIEEK PDSIIHFAEQ RAAPYSMKSS ATKRYTVDNN
VNGTHNLLAA IVESQLDIHI VHLGTMGVYG YGSHRGATIP EGYLKVEVPQ PDGSRFEEEI
LHPASPGSVY HMTKTLDQLL FLYYNKNDQI RITDLHQGIV WGTNTDVTSR DPRLTNRFDY
DGDYGTVLNR FLMQAAIGYP LTVHGTGGQT RAFIHIQDSV KCVQLALENP PEKGERVKIF
NQMTESHQVG ELAKKVASLT GAKINYLPNP RNEAVENDLI VDNRCFIELG LDPTTLDNGL
LEEVVNVAKK FSNRCDLKRI PCVSAWTSTQ AKAIHKS