Gene NATL1_15901 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNATL1_15901 
Symbol 
ID4781206 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. NATL1A 
KingdomBacteria 
Replicon accessionNC_008819 
Strand
Start bp1299760 
End bp1301085 
Gene Length1326 bp 
Protein Length441 aa 
Translation table11 
GC content39% 
IMG OID640084872 
Productsodium-dependent transporter 
Protein accessionYP_001015412 
Protein GI124026296 
COG category[R] General function prediction only 
COG ID[COG0733] Na+-dependent transporters of the SNF family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAAGAAA GGGAACAATG GCGATCAGGA CTGGGATTCG CGCTCGCTGC GGCAGGTAGT 
GCTGTAGGCC TTGGAAATCT TTGGGGCTTT GCTTATAGGT CATCTCAAGG CGGTGGACTT
GCTTTTCTTA TCCTTTATGT ATTGGTTGTT TTAGTTGTTT GCCTACCTGT CTTAGTTGCA
GAGATGGTCT TGGGCAGGAG TACTGCTAGT AGTCCTTTCC TTGCGCCAAT TAAAGCTGCT
GGTGAAAATT GGAAGCCTTT AGGGTGGCTT TTTGCAATAG CTTCTTGTGG AATTCTTTCT
TATTACGCAG TGATAATGGG ATGGACAATT GATACTTTCT TCCATTCTTT ATTTATTGGC
CTACCCTCAG ATATGACTGA GGCGGGAGAA TTTTTTGGTA AAATTAGCAG TGGTAATAGT
GTATTTGTAG GTCAAATAAT CAGCTTACTG TTAACTGCTT TTGTTGTGGT TGCAGGCGTT
CGTGGAGGTA TAGAAAAACT AACAAAATGG GCAATGCCAT TTTTATTTGG ACTCCTTTTG
TTGTTAGCTA TATGGGCTGC AACTTTATCT GGTGCATGGG AAGGATATAC ATCATTTTTA
CTTAAATGGG ACTCCTCTCA ACTTTTTGAT AAAAACACAA TAAGTAATGC GTTTAAACAA
GCTTTCTTTT CTTTAAGTTT GGGTATTGGA ATTATGGTTG CCTATTCCTC ATATCTAAAC
CGTAAAAATC ATCTTCCTAA AGAAGCTTTG AGAGTTGCAA CTTTGGATAC TGCTGTGGGT
TTACTTGCAG GGCTGATTAC ATTCCCTGTC GTTATGAGTT TTGGTTTGAA AGATGTTATA
AGTGAATCAA CTGTTGGGAC TTTATTTATT GCTCTTCCAA CTGGATTCGC AAATCTTGGA
TTGTTTGGGA GATTGATTGC TGCCGTTTTC TTCGGATTGG CCTTTATAGC TGCTATTACC
TCTTCTATTT CATTAATGGA AGTACCAGTA TCTTCTTTGA TGGACAGGCT GAATTGGAGT
AGGAAGAAGG CCGTTTGGAC TTCAACATTA GTGATCTTTT TGATTGGAAT ACCGTCTGCT
ATTTCTACAG ACTTTTTAGG CAAGTCTGAC GCTATCTGTA ATACACTCTT GATATTAGGT
GGCCTTCTAA TTTCAATTCT TTTGGGTTGG ATTGTTCCAA ATCGTTATGA TGAGGATCTT
GCAAATTCTA ATGCTAATTT AAGAGTTAGA AGGTATCTAA AGTTTATGCT GCGCTGGGTG
TCTCCACCAG TTATTGCTAT TGGACTTTAT TTAACGGTCT TATCTACAAT AGAAACATTT
GCTTAA
 
Protein sequence
MQEREQWRSG LGFALAAAGS AVGLGNLWGF AYRSSQGGGL AFLILYVLVV LVVCLPVLVA 
EMVLGRSTAS SPFLAPIKAA GENWKPLGWL FAIASCGILS YYAVIMGWTI DTFFHSLFIG
LPSDMTEAGE FFGKISSGNS VFVGQIISLL LTAFVVVAGV RGGIEKLTKW AMPFLFGLLL
LLAIWAATLS GAWEGYTSFL LKWDSSQLFD KNTISNAFKQ AFFSLSLGIG IMVAYSSYLN
RKNHLPKEAL RVATLDTAVG LLAGLITFPV VMSFGLKDVI SESTVGTLFI ALPTGFANLG
LFGRLIAAVF FGLAFIAAIT SSISLMEVPV SSLMDRLNWS RKKAVWTSTL VIFLIGIPSA
ISTDFLGKSD AICNTLLILG GLLISILLGW IVPNRYDEDL ANSNANLRVR RYLKFMLRWV
SPPVIAIGLY LTVLSTIETF A