Gene NATL1_02911 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNATL1_02911 
Symbol 
ID4780047 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. NATL1A 
KingdomBacteria 
Replicon accessionNC_008819 
Strand
Start bp269243 
End bp270910 
Gene Length1668 bp 
Protein Length555 aa 
Translation table11 
GC content42% 
IMG OID640083556 
Productputative sulfate transporter 
Protein accessionYP_001014120 
Protein GI124025004 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0659] Sulfate permease and related transporters (MFS superfamily) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGGCGTTAA TTAATGGCTT TCACCTGAAA AATATTAGAG GTGATGTTTT AGGCGGCTTA 
ACAGCTGCAG TTGTGGCATT ACCACTAGCT TTAGCGTTTG GTAATGCTGC TCTTGGTCCT
GGGGGCGCTA TTTATGGCCT TTACGGGGCA GTCGTAGTTG GCTTCCTCGC CGCTTTATTT
GGAGGAACGC CTGCTCAGGT AAGTGGTCCG ACTGGGCCGA TGAGTGTAAC CGTTGCTGGA
GTTGTGGCGA GCTTGGCGGC TGTAGGAGTA CCTAGGGATT TATCTGCTGG CGAAATTCTT
CCTCTAGTAA TGGCTGCGGT AGTCATTGGA GGATTATTTC AGATTTTATT TGGCCTACTA
AGGCTTGGTA AATACATAAC ACTTGTTCCA TACTCAGTTG TTTCAGGCTT TATGTCTGGA
ATTGGAGTAA TCATAATTAC TCTTCAAATA GGTCCTTTAC TTGGAATATC AACTCGAGGA
GGAGTATTAG AGTCTCTTTC AACTTTATTT AACAATTTCG AGCCAAATGG GGCTGCAGTA
GCAGTTGCAG TAATGACACT TGCCATTGTG TTTCTGACCC CCCGTAAAAT AAGTCAATGG
GTCCCGTCTC CTCTTTTAGC ATTATTAATA GTTACTCCTT TATCAATACT TCTTTTTGGC
GATAGTTCCA TAGACAGGAT AGGCGCAATA CCAGAGGGTG GTCTTTCTTT AAGTCTCCCA
GATCCAAGCC TTGGCAATTT CTTCCCAATA ATTCTTAAAG CTGGACTAGT CCTTGCTGTT
CTTGGGGCTA TTGACTCTCT TCTTACATCT CTTGTCGCAG ATAATATTTC TCAAACTAGG
CACAATTCTG ATAGAGAACT CATTGGACAA GGTATTGGTA ATGCAGTAGC TGGCATTTTC
ACAGGTTTGC CTGGTGCTGG GGCAACCATG AGAACAGTAA TAAATGTCAA ATCTGGAGGT
TCCACACCAA TCTCAGGGAT GGTTCATTCA GTCGTATTGC TATTTGTTCT TGTTGGTGCC
GGTCCTCTAG CCGCTCAGAT CCCTACCGCT CTTCTTGCAG GAATTTTAAT AAAAGTTGGC
TTAGACATTA TTGATTGGGG ATTTCTTTTA AGAGCTCATA AACTTTCGCT GAAAACAGCA
AGTGTGATGT ATGGAGTCCT ATTTATGACA GTTTTCTGGG ACTTGATATG GGCTGTACTA
GTTGGAGTAT TCATAGCCAA TATGCTCACA ATCGATTCAA TTACAGAGAC GCAATTGGAG
GGAATGGAAG CTGACAACCC ATTCGATTCA ACATCAAATA ACAATGCAGC AAATGCACAA
CTCCCCTCAG ATGAGAAAGC ACTTTTAGAT CGTTGTGCGG GAGAAGTAAT GCTATTCAGA
TTAAAAGGGC CACTTAGTTT TGGAGCAGCA AAAGGAATTA CTGAAAGAAT GATGCTTGTA
AGAAACTATA AAGTTCTAAT TCTTGATATC ACTGATGTTC CTAGACTTGG AGTCACGGCA
ACTCTCGCAA TAGAAGATAT GGTTCAAGAA GCTCTAAGCA ACTCAAGGAA AGCATATGTT
GCTGGCGCAA CAGGAAGAGT AAAGGACAGA CTTGCTAAAT TTGGAGTTGA TGTAATTGGC
ACAAGAAAAG AAGCACTTGT AGCTGCAATT GACAGCTTGA ATGGTTAA
 
Protein sequence
MALINGFHLK NIRGDVLGGL TAAVVALPLA LAFGNAALGP GGAIYGLYGA VVVGFLAALF 
GGTPAQVSGP TGPMSVTVAG VVASLAAVGV PRDLSAGEIL PLVMAAVVIG GLFQILFGLL
RLGKYITLVP YSVVSGFMSG IGVIIITLQI GPLLGISTRG GVLESLSTLF NNFEPNGAAV
AVAVMTLAIV FLTPRKISQW VPSPLLALLI VTPLSILLFG DSSIDRIGAI PEGGLSLSLP
DPSLGNFFPI ILKAGLVLAV LGAIDSLLTS LVADNISQTR HNSDRELIGQ GIGNAVAGIF
TGLPGAGATM RTVINVKSGG STPISGMVHS VVLLFVLVGA GPLAAQIPTA LLAGILIKVG
LDIIDWGFLL RAHKLSLKTA SVMYGVLFMT VFWDLIWAVL VGVFIANMLT IDSITETQLE
GMEADNPFDS TSNNNAANAQ LPSDEKALLD RCAGEVMLFR LKGPLSFGAA KGITERMMLV
RNYKVLILDI TDVPRLGVTA TLAIEDMVQE ALSNSRKAYV AGATGRVKDR LAKFGVDVIG
TRKEALVAAI DSLNG