Gene NATL1_20501 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNATL1_20501 
SymbolkefB 
ID4779929 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. NATL1A 
KingdomBacteria 
Replicon accessionNC_008819 
Strand
Start bp1693553 
End bp1694926 
Gene Length1374 bp 
Protein Length457 aa 
Translation table11 
GC content41% 
IMG OID640085344 
ProductCPA2 family Na+/H+ antiporter 
Protein accessionYP_001015870 
Protein GI124026755 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0475] Kef-type K+ transport systems, membrane components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0786073 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTATTAA CTCCTTTAGT CTCAGCACTT AATACTCATG ATGTTGAGGT AGCGGAAACG 
CTGATAGGGG TCATTAATTT CTTGATGATC TTTGTTGCGG CAAGGACTTT GGCTGAAATC
TTGGTTCGAC TAAGTTTGCC TACAATAGTC GGTGAACTTC TTGCGGGAGT TTTAATTGGT
GCCTCAGGAT TACATCTTTT ATTGCCGCCT AGTGCTCATG CTGAGTTGAA TCAAGGTTTT
GTTTCTGTTA TTAGCACACT TGCTTCAGTA CCCAAAGAAG CAGTCCCAGA TATTTATTTT
GAAACTTTTC CATCATTACA AGCCGTAGCA ACTTTAGGGC TATACGCTTT GTTATTTCTG
ACCGGACTGG AGAGTGAATT AGAGGAACTT GTAGCAGTTG GAGCACAAGC TTTCACTGTT
GCAATGGCAG GAGTGATTTT GCCATTTGCT TTTGGAACTT TTGGATTAAT GTTTATTTTC
CAAGTAGATA TCATTCCTGC GATATTTGCT GGTGCATCTA TGACCGCGAC AAGTATTGGT
ATTACTGCAA GTGTTTTTGG TGAGCTTGGT TATTTAAAAA CTCGTGAGGG TCAAATTGTT
ATTGGCGCAG CAGTATTGGA TGACATACTT GGAATTGTTA TTCTTGCCGT TGTTGTAGCT
CTTGCGACGG GAGGATCATT ACAAATTGCT CCCATTGTTA AATTAGTACT TGCTGCAACT
GTTTTTGTGT TTGCTGCAAT TGCTTTAAGT CGAACAGCTG CTCCAGCCTT TGATTGGTTG
CTTGAGAGAC TTAAAGCTCC CGGAGCGGTT GTTGTAGCTT CTTTTGTGAT ACTAGTCTTG
AGCTGTTTTG TTGCTACTGC TATAGGTTTA GAAGCTGCTT TGGGTGCTTT TGCTGCTGGA
TTGATACTAA GTAGTTCAAA AAATAATCAC GCAATTCAAC AATCTGTTTT GCCATTAGTA
TCGCTCTTCG CAACTATCTT CTTTGTACTT GTTGGAGCAG GTATGGATCT TTCTGTGATC
AATCCTCTTG ATCCACAGAG TCGCTCTGCT CTAATCGTGG CTGGTTTTTT ATTTATTGTG
GCAATTGTCG GGAAAATCGC TGCTGGCTGG TGCTTTGTCA TTGATAAACC AACAAATAGA
TTAGTAGTTG GCCTGGGAAT GATGCCTCGA GGTGAGGTCG GATTGATTTT TCTAGGTTTG
GGAACAAGTG CTGGTTTGCT TACCCCTTCT CTTGAGGCTG CAATATTGTT AATGGTAATT
GGAACAACAT TTTTAGCACC TGTTTTGCTT AGAGTTGTTT TGAAAGATAA ACCTCCTTCA
GGAGGCAACT CTATTCCTGA TGAGATTGCA GCTGATCCAG TTGGATTGGT TTAG
 
Protein sequence
MVLTPLVSAL NTHDVEVAET LIGVINFLMI FVAARTLAEI LVRLSLPTIV GELLAGVLIG 
ASGLHLLLPP SAHAELNQGF VSVISTLASV PKEAVPDIYF ETFPSLQAVA TLGLYALLFL
TGLESELEEL VAVGAQAFTV AMAGVILPFA FGTFGLMFIF QVDIIPAIFA GASMTATSIG
ITASVFGELG YLKTREGQIV IGAAVLDDIL GIVILAVVVA LATGGSLQIA PIVKLVLAAT
VFVFAAIALS RTAAPAFDWL LERLKAPGAV VVASFVILVL SCFVATAIGL EAALGAFAAG
LILSSSKNNH AIQQSVLPLV SLFATIFFVL VGAGMDLSVI NPLDPQSRSA LIVAGFLFIV
AIVGKIAAGW CFVIDKPTNR LVVGLGMMPR GEVGLIFLGL GTSAGLLTPS LEAAILLMVI
GTTFLAPVLL RVVLKDKPPS GGNSIPDEIA ADPVGLV