Gene NATL1_14271 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNATL1_14271 
Symbol 
ID4780656 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. NATL1A 
KingdomBacteria 
Replicon accessionNC_008819 
Strand
Start bp1154682 
End bp1155743 
Gene Length1062 bp 
Protein Length353 aa 
Translation table11 
GC content37% 
IMG OID640084707 
Productextracellular solute-binding protein 
Protein accessionYP_001015250 
Protein GI124026134 
COG category[E] Amino acid transport and metabolism
[T] Signal transduction mechanisms 
COG ID[COG0834] ABC-type amino acid transport/signal transduction systems, periplasmic component/domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATTAAAA AAATTATGCG TCGATTGGTA TCTGTAATAG TTGGTCTAGC AGCTCTATCA 
GCTGGCTGTG CGACAACAAA TCAAGACAAT AGTTCTAGAC TTAACCTAAT AAAAAATCGC
AATGAGTTGA TTTGTGGAGT AAGTGGAAAG ATTCCTGGAT TTAGTTTTCT GAAAAGTGAT
GGCACTTATC AAGGACTAGA TGTCGATATA TGTAAAGCAT TTGCTGCTGC GATCATAGGA
GATTCAGAAA AAATTCAATA TAGACCTCTA ACTGCAGCAG AAAGATTCAC AGCTATTAAA
ACTGGGGAAA TTGACCTTTT GTCTAGAAAT ACCACTTTCA CTCTCAGTAG AGATTCCTCA
GGAGGAAATG GATTAACTTT TGCACCAGTT GTCTTCCATG ATGGCCAGGG ATTGATGGTC
AAGAAGGAAA GTAAAATTAG TGGTCTCAAA GATCTTGCAA ATAAATCTAT ATGTGTAGGC
TCAGGCACCA CTACTGAGCA AAATATAAAT GATGCATTTG AGAGTGCCTC ACTGCCTTAT
ACACCAATCA AATATCAAGA TCTTAATCAA GTGGTTGCTG GTTATTTACA GGGTCGTTGT
TCAGCTATGA CTTCTGATCG TTCACAGTTG GCTGCAGCTA GATCTGGCTT TAAGAATCCA
AAAGAACATA TTATTCTTGA AGATGTCCTG AGCAAGGAGC CACTTGCTCC TGCTTCCGAT
GGCCAAGATC AGAAACTAGC TGATGCAATG AGATGGGTTG TCTTTTCCCT TATATCGGCA
GAAGAGCAAG GGATAACAAA ATCAAATATT GATAAAAAAG TTCAAATTGC AAAGAATAAT
CCTCAGTTAA AACCTTTAAG AAGATTTTTA GGTATTGATG GGGGACTAGG AGAAAAAATT
GGACTTAGCA ATGACTTCGT AGTTAAAGTA ATTAGCTCAA CAGGCAATTA TGGAGAGATT
TACGAAAGAC ATTTAGGACA AAATAGCGAA GTACCTATTC CAAGAGGACA AAATGAGTTG
TACAAGAAAG GAGGTGTACA TATTTCACCA CCATTTAACT AA
 
Protein sequence
MIKKIMRRLV SVIVGLAALS AGCATTNQDN SSRLNLIKNR NELICGVSGK IPGFSFLKSD 
GTYQGLDVDI CKAFAAAIIG DSEKIQYRPL TAAERFTAIK TGEIDLLSRN TTFTLSRDSS
GGNGLTFAPV VFHDGQGLMV KKESKISGLK DLANKSICVG SGTTTEQNIN DAFESASLPY
TPIKYQDLNQ VVAGYLQGRC SAMTSDRSQL AAARSGFKNP KEHIILEDVL SKEPLAPASD
GQDQKLADAM RWVVFSLISA EEQGITKSNI DKKVQIAKNN PQLKPLRRFL GIDGGLGEKI
GLSNDFVVKV ISSTGNYGEI YERHLGQNSE VPIPRGQNEL YKKGGVHISP PFN