Gene NATL1_01671 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNATL1_01671 
Symbol 
ID4780822 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. NATL1A 
KingdomBacteria 
Replicon accessionNC_008819 
Strand
Start bp160198 
End bp161778 
Gene Length1581 bp 
Protein Length526 aa 
Translation table11 
GC content33% 
IMG OID640083431 
ProductRND family outer membrane efflux protein 
Protein accessionYP_001013996 
Protein GI124024880 
COG category[M] Cell wall/membrane/envelope biogenesis
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG1538] Outer membrane protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGGAGAG TGAAGAGAAA GTTTCTAATT GTTGCAGGTT TATTTATATC TGGGCTAAAT 
CCTTTGTGGG CTACAAGTTC GCAAAAAATA ATTTCCGATA TAAAGATAAA AGGAAACTCA
AGTAAAAGCC TAAGTCAAAA TCAAAAACAA CAGCGGATCT TATATGAATT AAATGCCCCT
GAAGATCTTT TTTTACCCTC TAGATCACGC GAAGTATTAG TAAAAACTTA TCAAAAAGTT
AACCTTGATC AGTTAGAAAA TTTACTTATA AATAACAACC GAACAATTAA AATCTACTTA
GAAAGAGTTG AGCAAGCCAA ATCAATATTA AAAAGTTCTT TATCCTCATG GTACCCAACA
TTAAACCTAA CAGCTAATGG CATTCCCCAA TATTTTGAAT CTAATAACTA TAATGAATCA
AGCGTAATAC AAGATACTTC GAGTAAACAA TGGAGTTCCT CTATCTCCGC TCAATTAAAA
TGGGATTTAA TTAATCCTGC AAGAGTCCCA GAGATAGCAT CAGCTAGAGA TAGTTTTGAA
AAGTCAAAAT ATTCTTACGC AATAATTTTA AGAGATTTAA AATTAGAGGC AAAAAAACGT
TACTTCAATT TGCAAAAAGC CAATGAGGAA ATAGAAGTAG CAAAGAAATC AATTGAATCC
TCGACTATTG GGTTAAGAGA CGCAGAAATT AGATTTGAAT CAGGTATTGG TACGAAATTA
GAAGTTCTAG AAGCTAAAAC TCAATTAGCT AGAGATCAGC AATTGTTTAA TATTAAATCT
GGTGATCAGA AAATTGGTCA AAGATCTCTT GCTGAAATAC TTAATTTCCC AGAGGATGTT
ACACCATTAA TTGGTTCAAA AACTCAAGTT ACAGGTATAT GGGATTTATC ATTAGAGGAT
AGTATTATAG CTGCTTATAA TTCAAGAGAA GAACTCGAAA GTATCCTACT AGACATATCA
ATTAATAATA GTAATGCAAA TGCTGCACTT GCTGCTAGCC AACCAAAATT AAGCATCGTA
AATACATCGA CCTCTTCATT TGCGAAAGGT GAGTTAAATC AAATATCTCC AAATACCAGC
AACACATCCT CCAATTTTTC TAACACCATT GGGCTCAATG CAACATGGTT TATTTTTGAT
GGAGGTAATT CAAGATCTTT GTATAATTAC AATAAAAGTA AAGCAAAAGA AGCAAAACTA
AATTTTGCCG CAAGAAGAGC CCAAATCAGA CAGGAAGTTG AACAAGTATT CTTCAAACTA
GACTCGGCTA AACTAAATAT TTCTGCTTCG TATACAGAAG TTTTGTCTGC AAGAGAGTCT
TTAAGACTTG CAAAACTTAG ATACAAATCA GGTATTACTT CACAACGAGA AGTGGTAAAC
AACCAAAGAG ATTTAACTGA TTCCGAGGTT CGTTATATTA TTTCCGTCAC TAGCTATAAC
ACTCTATTAG CTGACTTAAG TAGACAAACG GGTTTAGATA ACATCAAACC ATGTGATATC
AAAGTCAATC AACAAAATCA AAGTGACATA GATAGCAAAT CACTCTATGA AACAAATTTA
ATTCCTCTAT GTCAGCTATA G
 
Protein sequence
MRRVKRKFLI VAGLFISGLN PLWATSSQKI ISDIKIKGNS SKSLSQNQKQ QRILYELNAP 
EDLFLPSRSR EVLVKTYQKV NLDQLENLLI NNNRTIKIYL ERVEQAKSIL KSSLSSWYPT
LNLTANGIPQ YFESNNYNES SVIQDTSSKQ WSSSISAQLK WDLINPARVP EIASARDSFE
KSKYSYAIIL RDLKLEAKKR YFNLQKANEE IEVAKKSIES STIGLRDAEI RFESGIGTKL
EVLEAKTQLA RDQQLFNIKS GDQKIGQRSL AEILNFPEDV TPLIGSKTQV TGIWDLSLED
SIIAAYNSRE ELESILLDIS INNSNANAAL AASQPKLSIV NTSTSSFAKG ELNQISPNTS
NTSSNFSNTI GLNATWFIFD GGNSRSLYNY NKSKAKEAKL NFAARRAQIR QEVEQVFFKL
DSAKLNISAS YTEVLSARES LRLAKLRYKS GITSQREVVN NQRDLTDSEV RYIISVTSYN
TLLADLSRQT GLDNIKPCDI KVNQQNQSDI DSKSLYETNL IPLCQL