Gene NATL1_11521 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNATL1_11521 
Symbol 
ID4780978 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. NATL1A 
KingdomBacteria 
Replicon accessionNC_008819 
Strand
Start bp1029659 
End bp1030630 
Gene Length972 bp 
Protein Length323 aa 
Translation table11 
GC content43% 
IMG OID640084431 
ProductABC transporter, substrate binding protein, phosphate 
Protein accessionYP_001014975 
Protein GI124025859 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0226] ABC-type phosphate transport system, periplasmic component 
TIGRFAM ID[TIGR00975] phosphate ABC transporter, phosphate-binding protein 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.234302 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0131021 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCTTCG CCAAGAAGGC CCTCATCTTT ACTTCTTTGC TTGCAGTGGG CGCAGGCATG 
TCCGCAACTG CAGCTAGTCG TCTTAGTGGA GCAGGTGCAT CCTTCCCCGC TAAAATCTAC
ACTCGTTGGT TTTCCGATTT AGCAAAAGAG GGTGGTCCTC GTGTTAACTA CCAAGCTGTT
GGTTCAGGTT CTGGCCGTAA AGCATTCATT GATCAAACCG TAAACTTCGG TGCTTCTGAT
GATCCAATGA AAGCAAAGGA TATTGCAAAA GTTACTCGTG GATTAGTTCA AATCCCAATG
GTTGGAGGCA CAATTGCCTT TGGTTACAAC TACGATTGCG ACCTTAAACT TACTCAAGAG
CAAGCTGTTC GCGTTGCTAT GGGTAAAATC TCAAATTGGA AAGAAGTTGG TTGTCCAGCA
GGAAAAATGA CATGGGCACA TCGCTCTGAT GGCTCCGGTA CAACCAAGGC TTTTTCAAAC
TCTATGCAAG CTTTCTCTAA GACATGGAAT TTAGGAACAG GTAAATCTAT TGCTTGGCCT
GCTGGTGTTG GTGGAAAAGG TAACGCTGGT GTTGCAGGCG TAATTCGTAA TACTCCTGGT
GCAATTGGTT ATGTAAACCA GTCATATATT AAAGGAGAAA TCAAGGCTGC CGCTCTTCAA
AACTTATCTG GAGAGTATTT AAAGCCATCT ACTGAGTCAG GAGCTAAAGC TCTTAATGGA
ATTAAGTTAG ATGAAAATTT AGCAGGTAAA AACCCCAACC CAAAAGCTAA GGGTGCATAC
CCAATCGCTA CGTTGACATG GATTCTTGCT TATGAAGAAG GCAATGGTAG AAATACTAAA
GCCATTCAAA AATCACTTAA CTACTTGCTA AGTGATAAAG CTCAGGCTAA GGCTCCTTCT
CTTGGATTCG TACCTCTTAA AGGTGAAATT CTTAAAAAAT CACGTGCTGC CGTAAAGCGT
ATTGGTAAAT AA
 
Protein sequence
MTFAKKALIF TSLLAVGAGM SATAASRLSG AGASFPAKIY TRWFSDLAKE GGPRVNYQAV 
GSGSGRKAFI DQTVNFGASD DPMKAKDIAK VTRGLVQIPM VGGTIAFGYN YDCDLKLTQE
QAVRVAMGKI SNWKEVGCPA GKMTWAHRSD GSGTTKAFSN SMQAFSKTWN LGTGKSIAWP
AGVGGKGNAG VAGVIRNTPG AIGYVNQSYI KGEIKAAALQ NLSGEYLKPS TESGAKALNG
IKLDENLAGK NPNPKAKGAY PIATLTWILA YEEGNGRNTK AIQKSLNYLL SDKAQAKAPS
LGFVPLKGEI LKKSRAAVKR IGK