Gene NATL1_01501 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNATL1_01501 
SymboltrkG 
ID4780314 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. NATL1A 
KingdomBacteria 
Replicon accessionNC_008819 
Strand
Start bp147720 
End bp149105 
Gene Length1386 bp 
Protein Length461 aa 
Translation table11 
GC content35% 
IMG OID640083414 
ProductTrk family sodium transporter 
Protein accessionYP_001013979 
Protein GI124024863 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0168] Trk-type K+ transport systems, membrane components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGTATTA GGCAAGAAAC TTATAGAAGG CTTACGGTTC CGCAGTTTAC AGTGGTAACA 
GGTTTACTTG TGATTGCTTT TGGAACATTA TTATTGGCTA CACCTTTTTG TTCTAATGCA
AATGTAGGTC TATGGGAGGC ATTATTTACG GCAACTTCTG CTGTCACAGT TACGGGGTTA
TCTATTATTG ATATAGGAAT AGATTTGACA TTTTTTGGAC AAGTAATTTT AGCGATTATG
TTGTTAACTG GGGGCCTTGG TTTAATGGCT ATTACTACAT TTTTGCAGGG CTTTATTGTT
AGTGGAACAG AATTAAAAAC ACGTCTTGAT AGAGGGAAAA CTCTTGATGA ATTTGGAGTC
GGGGGTGTGG GTACAACGTT TAAAGGTATT GCGATTACAG CATCTATACT TATTTTTCTT
GGTTCTATTA CTTTGTATTT TTTCGGCTTT AAAAATATAA CTAGCTCAAG TGAAAGGATT
TGGGCATCAA TTTTTCATAG TATATCTGCT TACAATAATG CTGGGTTCAG TTTATGGTCA
AGCAGCTTAC AAAATTATAG AGGTAATTGG GTAGTGAATT TTGTTTTAAT TACTTTAATT
ATATTAGGTG GTTTTGGATG GAGAGTAACT AATGATATTT GGATAAATCG TAGATCTTTA
AAATTAAGAA ATTTAAGTCT TCATACACGT TTAGTAATTA GATCATCTTT CATATTGATT
GCTCTGGGAT TCTTTGGATT GATTTTTACT GAATCGTTAG CTAGGGGTAG CTTCTTTTCG
TTAATTAATT TCGATGATCG TATTTTAACC GCTTTATTTA CTTCTGTTAG TTCACGAACT
GCAGGCTTTA CGAATTTGCC CATATCAATT GAAAGTGTCT CTGACTCAGG TCTCTTGTTG
ATAATGTTTC TTATGTTTAT TGGGGCAAGT CCAGGAGGCA CTGGAGGCGG AATTAAGACG
ACAACTATTG CTGCATTAAT GGCAGCCACA AGAGCAACTC TACGTGGTCA AAATGAAATT
ATTATTCGGA ATCGTCAGAT ATCTGACAAA GTAATTCTTA AAGCTGTTGG TATAACTGTT
GGTTCATTTT TATTTGTGTT GATTATGGCT TTATTATTAA GTTTGAGTAA TGGATTCAAT
AGTGGAGAGA ATTTTTCATT TTTAGAAATG CTTTTCACTT GTATTTCTGC TTTTGCAACT
GTAGGTTTTG ATCTGGGCGT AACCTCTAAG TTAGGACATG TCGGTCAATT AATTCTGATT
ATTGGAATGT TTGTTGGCAG ACTAGGAATC CTTTTATTCT TGAGCGCTGT ATGGCAAGCT
CTTAATAAAA GTAAGATTCA ACATCGCAAT CGAATTGGCT ATCCGAAGGA GGATCTCTAT
GTTTAA
 
Protein sequence
MSIRQETYRR LTVPQFTVVT GLLVIAFGTL LLATPFCSNA NVGLWEALFT ATSAVTVTGL 
SIIDIGIDLT FFGQVILAIM LLTGGLGLMA ITTFLQGFIV SGTELKTRLD RGKTLDEFGV
GGVGTTFKGI AITASILIFL GSITLYFFGF KNITSSSERI WASIFHSISA YNNAGFSLWS
SSLQNYRGNW VVNFVLITLI ILGGFGWRVT NDIWINRRSL KLRNLSLHTR LVIRSSFILI
ALGFFGLIFT ESLARGSFFS LINFDDRILT ALFTSVSSRT AGFTNLPISI ESVSDSGLLL
IMFLMFIGAS PGGTGGGIKT TTIAALMAAT RATLRGQNEI IIRNRQISDK VILKAVGITV
GSFLFVLIMA LLLSLSNGFN SGENFSFLEM LFTCISAFAT VGFDLGVTSK LGHVGQLILI
IGMFVGRLGI LLFLSAVWQA LNKSKIQHRN RIGYPKEDLY V