Gene NATL1_14611 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNATL1_14611 
Symbol 
ID4779191 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. NATL1A 
KingdomBacteria 
Replicon accessionNC_008819 
Strand
Start bp1170103 
End bp1171647 
Gene Length1545 bp 
Protein Length514 aa 
Translation table11 
GC content39% 
IMG OID640084742 
Productputative sulfate transporter 
Protein accessionYP_001015284 
Protein GI124026168 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0659] Sulfate permease and related transporters (MFS superfamily) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.324968 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGAAATTGA AAAATTTATT GGTAAAGGAG AGAATTATCT CTCCTTCCCG CGATGTTATT 
GCTGGTCTTG TGGTTGCATT TGCAATGATC CCAGAGGCGA TAGCTTTTTC TGGTATTGCA
GGCGTTGACC CAAGAGTTGG GTTGTTTGGT GCTTTTTTGC TTTCTGTCAC CCTTGCAATT
TTTGGGGGCA GAATGGCCAT GATTACCTCA GTAACTGGTT CAACTGCTCT TTTGATGACT
GGGATTGTTC AACAGGGAGA AAATATTAGC CCTGGCCTTG GTCTGCAATA TCTTTTGGCT
GCTGGATTGC TTACGGGAGT TCTTCAGATC GCTTGGGGGT ATTTAAGACT TGCTCATCAG
ATGAGATTTG TACCTCAACC AGTCATGGAT GGCTTTGTAA ATGGTTTAGC AATATTGATC
TTCCTTGCTC AATTGCCTCA TTTGGGGATT GACATAGCTC ATTCTGAGAA AGTTGTTACT
GCGGTCCAGC TACCTGCTGT TTGGGGCTTG ACCATACTTA CGTTGTTAAT TATCTATTTG
CTACCTAAAT TTACCAAGCT TTTGCCATCA GCATTGGTCG CGATTTTTAT TTGTACAGCG
ATTTCAATTG TATTTAAATT AAATGTTCCA ACTGTATCGA ATTTAGGAAT TCTCCCAAAT
GGATTACCTA GCTTTGGTAT TCCAAAAGTT CCATTTAATT TCGAAACACT TGGTTTGATA
CTTCCAACAG CACTAGCAAT TTCTTTGGTT GGTCTTATGG AGACATTTCT CACTCAAGAT
ATTCTTGATG ATATGACTGA TAAAAGTACA AATAAAAATG TTGAGGCTCG AGGGCAAGGT
ATGGGAAATA TTGTTAGTTC GCTTTTTGGA GGTATGGCTG GATGCGCTTT GGTTGGACAA
TCGGTTATGA ATGTGGGTTA TGGGGGAAGA ACTCGTCTTT CAACATTAAG CTCTGGTGTT
TGTTTAATAG CAATGATTCT TGCGGCTAAG GATTGGGTAA ATCAAATACC AATGGCAACA
TTGGTTGGAG TTATGATAAT GATTGCTATA AATACTGCTA ATTGGGGCTC AATTAAAGAT
ATTCGCCGAA TTCCTCGAAG CGATAGCTCA GTTATGATTT TGACTGTATT CGTAACTGTT
ATTACACATA ATTTAGCTCT TGGTCTTCTT TCTGGTGTTG GACTTGCAGC AATATTATTT
AGTAGAAAGG TCGCAAAAGT TATTAAGGTT GAGTCTTCTT TGAATGGGAA AGACCACAGG
ATTTATAAAG TTTCAGGCCA ATTATTTTTT GTTAGTAGCA TTTACTTTAG ACAAGGGTTT
GAACTACATG AACATCCTAA AAAAATTACA ATAGATATGG CCGAAGCTCA TATTTGGGAT
CAAAGCGGCG TAACGGTTCT AGACCAAGTA ATCAGAAGAA TAAAAATAGG GGGGTCTGAG
GTTGAAGTGA TTAATTTAAA TGATGAAAGC TTGAATTTAT TTTCTCGAAT AGGACAAGCA
TCAGAAGCTG GGGGAAGAGG TGGTGAGTTT AAATCGGCAC ATTAA
 
Protein sequence
MKLKNLLVKE RIISPSRDVI AGLVVAFAMI PEAIAFSGIA GVDPRVGLFG AFLLSVTLAI 
FGGRMAMITS VTGSTALLMT GIVQQGENIS PGLGLQYLLA AGLLTGVLQI AWGYLRLAHQ
MRFVPQPVMD GFVNGLAILI FLAQLPHLGI DIAHSEKVVT AVQLPAVWGL TILTLLIIYL
LPKFTKLLPS ALVAIFICTA ISIVFKLNVP TVSNLGILPN GLPSFGIPKV PFNFETLGLI
LPTALAISLV GLMETFLTQD ILDDMTDKST NKNVEARGQG MGNIVSSLFG GMAGCALVGQ
SVMNVGYGGR TRLSTLSSGV CLIAMILAAK DWVNQIPMAT LVGVMIMIAI NTANWGSIKD
IRRIPRSDSS VMILTVFVTV ITHNLALGLL SGVGLAAILF SRKVAKVIKV ESSLNGKDHR
IYKVSGQLFF VSSIYFRQGF ELHEHPKKIT IDMAEAHIWD QSGVTVLDQV IRRIKIGGSE
VEVINLNDES LNLFSRIGQA SEAGGRGGEF KSAH