Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | NATL1_21761 |
Symbol | |
ID | 4780327 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. NATL1A |
Kingdom | Bacteria |
Replicon accession | NC_008819 |
Strand | + |
Start bp | 1832634 |
End bp | 1833878 |
Gene Length | 1245 bp |
Protein Length | 414 aa |
Translation table | 11 |
GC content | 37% |
IMG OID | 640085474 |
Product | major facilitator superfamily multidrug-efflux transporter |
Protein accession | YP_001015996 |
Protein GI | 124026881 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.407538 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGAGAAGGC TAAAAATTCC CACCCTTTTA GGAGCTTTCA TCACACTTCT AGATGATCGA TTAGGCGAAA CCATTGTTTT ACCTTTATTA CCTTTTTTAT TAGAACAATT CACGACAAGC GCGACGACTC TTGGTTTTTT AACTGGAACT TATGCGATAT CTCAGTTTGC TGCAGCCCCA CTAATTGGAG CTATGAGTGA TCGTTTCGGT CGTAAGCCAA TCATGATCAC ATGTGTATCT GGTTCAGTAA TAGGAATATG TCTATTTGCA TTAACTGTAA GCCTAAATTG GGATAATTAT TTACCTTTAT GGGCCTCAAC TTTACCTTTA TCTTTACTAT TTTTAGCCAG AATAATTGAT GGTATAAGTG GTGGTACCGC AGCTACTGCT ACTACAATAC TTGCAGATAT ATCAACTCCG GAAAATCGCG CAAAAACCTT TGGATTAATT GGAGTAGCGT TTGGTTTAGG TTTTATTCTT GGGCCAGGAT TAGGAACAGC TCTTGCTAAA TTTAGTGTTA CTTTACCGGT ATGGGTGGCC AGCGGATTTG CAATATTTAA TCTTATTTTT GTAATTTGGT TTCTACCGGA AACACTGCCC AAAAACAAAA GAAATTTACT ACCAAGAAAA AGAGATTTGA ATCCAATTAG TCAGCTACTA GTTGTATTTA AAAACCCCTT AGCTAGAAGA CTTTGCTTAT CGTTCTTTGT TTTCTTTATG GCATTTAATG GCTTTACAGC TGTTTTAGTC CTTTATTTAA AAGAAAAATT TGGATGGAGT CCTGAATTAT GTAGTGCTGC TTTTATTGTC GTTGGAGTTA TTGCGATGAT TGTTCAAGGA GGCCTAATTG GTCCTCTTGT AAAAAGATTT GGGGAGTCGA GATTAACTTT TGCTGGTATT GGCTTTGTAA TGACAGGATG CATTCTTTTA ACGCTCGCCA ATATAGACAC TTCAATTCCT CTTGTATTTT CTGGCGTCGC AATACTTGCA ATGGGAACTG GACTAGTAAC TCCTAGTTTA AGAGCACTAA TTTCAAGAAG ACTAAGTTCT ATTGGTCAAG GAGCAGTATT GGGAAATCTG CAAGGTTTAC AAAGTCTGGG AACTTTTCTT GGAGCAATAG CAGCAGGACG GTCATATGAT CTTTTGGGTC CAAGAAGTCC ATTCTTTGGC ACAATATTGC TTCTACTATT TGTTATGTTT TTAATTTCAG GGAAAAGTCT TACCAAGAAA AAAGTAATCT CCTAG
|
Protein sequence | MRRLKIPTLL GAFITLLDDR LGETIVLPLL PFLLEQFTTS ATTLGFLTGT YAISQFAAAP LIGAMSDRFG RKPIMITCVS GSVIGICLFA LTVSLNWDNY LPLWASTLPL SLLFLARIID GISGGTAATA TTILADISTP ENRAKTFGLI GVAFGLGFIL GPGLGTALAK FSVTLPVWVA SGFAIFNLIF VIWFLPETLP KNKRNLLPRK RDLNPISQLL VVFKNPLARR LCLSFFVFFM AFNGFTAVLV LYLKEKFGWS PELCSAAFIV VGVIAMIVQG GLIGPLVKRF GESRLTFAGI GFVMTGCILL TLANIDTSIP LVFSGVAILA MGTGLVTPSL RALISRRLSS IGQGAVLGNL QGLQSLGTFL GAIAAGRSYD LLGPRSPFFG TILLLLFVMF LISGKSLTKK KVIS
|
| |