Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9211_06131 |
Symbol | alsT |
ID | 5731382 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9211 |
Kingdom | Bacteria |
Replicon accession | NC_009976 |
Strand | - |
Start bp | 559388 |
End bp | 560731 |
Gene Length | 1344 bp |
Protein Length | 447 aa |
Translation table | 11 |
GC content | 41% |
IMG OID | 641284975 |
Product | Na+/alanine symporter |
Protein accession | YP_001550498 |
Protein GI | 159903154 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG1115] Na+/alanine symporter |
TIGRFAM ID | [TIGR00835] amino acid carrier protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.407748 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 0.332279 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAAGGAA CAATTTCAAA TGCTATAGAG CTAATAAACA GCCCTATAAA TGGCTTTGCT TGGGGTTGGC CAACAGTGAG CCTTATTGCG ATTACTGGGA TCGTACTTAT GTTGGGGCTC GGATTTATGC CTCTACTGCG GCTTCCTTAT GGATTTAAGA TTTTGCTTAA TAGCTCTACT AAAGACACAC AAGAAGGAGA AATAAGTCCA TTCCAAGCCT TAATGACATC ACTTTCCGCG ACAATCGGAA CTGGAAATAT TGCTGGTGTA GCTGCAGCAA TTGCAATAGG CGGACCAGGC GCAATTTTTT GGATGTGGTT AATAGCAATT TTTGGAATTG CCACCAAATA TGCTGAAGGG GTTTTAGCTG TTCACTATCG CGAAGTTGAC TCTCTTGGGA ACCACGTAGG TGGTCCGATG TATTACATAA AAAATGGCCT AGGTAGTAGA TGGACTTGGT TAGGAGGATT ATTTGCTCTT TTTGGCATGT TGGCAGGGTT TGGTATTGGG AATGGGGTTC AGTGCTTTGA AGTCTCAAGT GCTCTTGCAT TAGCCGGTAT TCCAAAGCTA CTCACCGGAG TTGTCCTGGG AATTCTTGTT TTCTCTGTCA TCGTTGGAGG TGTTAAACGT ATAGCAAAAG CTGCTTCTGC CATAGTTCCA TCTATGGCAC TTTTATATGT ATTGGCTTGT TTAATAATTA TACTTAGCAA TTTTTCAGAA GTCCCATCAG CCTTTTCAAC AATATTTTCA AATGCCTTTA CAGGCAAAGC TGCTGCTAGC GGAGCATTTA CTCAAGTAAT TCTAATGGGA TTTAAAAGAG GTATTTTTTC AAATGAAGCC GGTCTAGGAA GCGCCCCAAT TGCGCATGCC TCTGCTCAGA CCAATGATCC TGTCAGACAA GGAACCATAG CAATGCTTGG AACTTTTATT GACACCATAA TTATTTGTAC AATGACTGCG CTAGTAATCA TCACCACTGG TGCATATCAA ACAGGAGAAT CTGGAGCTGA TCTTTCAATT ACTGCATTTA ATAGTGGGAT TGCTGGTAGT GGATGGATTG TTCTCGTGGG TCTAGTTTTA TTTGCATTCA CAACTATTCT TGGGTGGAGC CTATATGGAG AACGTTGCAC TGAATATCTT TTTGGGACTA AAGCAATACT TCCTTTTAGG TTAGTTTGGG TCTCTGTTGT AGTTATTGGC GCTGTTGCAG GAGATAGAGG GATCGTTTGG GCTGTTGCAG ATACTTTAAA TGGATTAATG GCTATCCCTA ATTTAATAGC TCTTTTACTT CTTTCAAAAA CCGTATTCAA ACTTTCCCGT AACTACCATT TTAAAAGTCA ATAA
|
Protein sequence | MQGTISNAIE LINSPINGFA WGWPTVSLIA ITGIVLMLGL GFMPLLRLPY GFKILLNSST KDTQEGEISP FQALMTSLSA TIGTGNIAGV AAAIAIGGPG AIFWMWLIAI FGIATKYAEG VLAVHYREVD SLGNHVGGPM YYIKNGLGSR WTWLGGLFAL FGMLAGFGIG NGVQCFEVSS ALALAGIPKL LTGVVLGILV FSVIVGGVKR IAKAASAIVP SMALLYVLAC LIIILSNFSE VPSAFSTIFS NAFTGKAAAS GAFTQVILMG FKRGIFSNEA GLGSAPIAHA SAQTNDPVRQ GTIAMLGTFI DTIIICTMTA LVIITTGAYQ TGESGADLSI TAFNSGIAGS GWIVLVGLVL FAFTTILGWS LYGERCTEYL FGTKAILPFR LVWVSVVVIG AVAGDRGIVW AVADTLNGLM AIPNLIALLL LSKTVFKLSR NYHFKSQ
|
| |