Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9303_21241 |
Symbol | |
ID | 4777103 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9303 |
Kingdom | Bacteria |
Replicon accession | NC_008820 |
Strand | - |
Start bp | 1885545 |
End bp | 1887392 |
Gene Length | 1848 bp |
Protein Length | 615 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 640087632 |
Product | amino acid transporter |
Protein accession | YP_001018124 |
Protein GI | 124023817 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0531] Amino acid transporters |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.97289 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGAGCTTTT TCCAGAAACT TCTGGGGCAT CCACTCCTCC GCAACAAGGC TGACGATGAG CGGCTGCCCA ATGTTCAGGC TCTGCCGATC CTCTCCTCGG ATGCCTTGTC GTCAGTGGCT TATGCCACTG AAGCTGCACT TGGTGTTCTG ATTCTTGGCG GCAGCGGTGC CCTTGGCCTT TCTGTACCGA TCACCCTGGC GATTATTGCT CTGGTTGCCA TCGTTGTGCT CTCCTACCGC CAGGCGATTG AGGCTTATCC GAAAGGGGGC GGCTCTTACG TCGTGGCACG CGACAACCTT GGCCGCAACG TTGGTTTGAT TGCAGCGGCG GCATTGCTGA TCGATTACAC CCTTACCGCG GGGGTGAGCT TGATGGCTGG CACTCAGGCT CTCTCCTCCC TAGTGCCATC GATGCTCGAT CACGAGGTTT CGCTCGCGTT GCTTTTGTTG GCGCTGATCG GCTGGGCCAA CCTGCGCGGC CTCAAGGAGA CTGGACGGCT GTTTTCTTTG CCCACCTATG CCTTTGTGGC GATGGTGGCT TTGTTGATCC TTGCCGGTTT GAAGGATCTG ATTTTTGAGC ATGGCTTCGT GCCGGATATG CCTCCAGCTG TGCAAGCCGT TCAGCCGTTG GGGTGGTTTT TGATCCTGCG GGCCTTCAGT TCAGGTTGTT CAGCGATGAC TGGTATTGAG TCCATTGCCA ATGGCGTGAA GGTTTTTCAG GAACCTGCGG TGGTCAATGC CAGACGCACG TTGTTGGTGA TGGGTGTGTT GCTAGCCGCC ATGTTTTTGG CGGTTAGTGG GCTGGGTTAC ATGTATGGCA TCGCTCCGAA TGATCGGGTC ACTGTGTTGG CTCAAATCGG TTCGCGGGCT TTTGGTTCGG GCAGTGTTCT GCTTTGGGCT CTGCAGCTTT CCACTCTCTT GATCCTTGTT TTGGCGGCCA ACACCGCCTT TGCAGGTTTC CCTCGTTTAG CGGCGATGTT GGCAGAAGAT CATTGCCTGC CAAGGCAGTT GAGCTGGATT GGTGATCGCT TGGTTTATCA GAACGGTATT GGCGTTCTTT TGTTGGTCAC GGCGCTGATC ATCGTGATCT GCAAAGGCGA TACCACTGTT GCTGTGAACC TTTATGCCCT GGGGGTCTTC ACTGCCTTCA CCCTGTCTCA GCTGGGATTG GTTCGTCGTT GGTGGCGGTT GCGGGGGAAT GGTTGGCAGG GTCGACTGTT GATGAATGCT CTTGGTGCAG TCACCACTTT TGTGGTGCTT GTGGTGATCG TGGTGAGCAA ATTCCAGGAG GGAGCCTGGA CTGTGGTGAT TACCATCCCT GCATTGGTAT GGGGTCTGGC GCAGATTCGA CGCCGCTATC GAAAGGCTTA TGCAGCGCTC GCTTTGGAGC CTGACTTTGG CCCATTGCAG GTGGCGCCTC GCCAACCTCC TTTGGGTAAT CACTGCATCG TTTGGATTCC GGGTTTGTGG CGTGCCTCCA TGGAAGCGTT GCGTTATGGA TGTTCCATTG CTGATTCCGT GACGGCTGTC TTTGTGCTCG GCGATGATGA TGATCCAGAC GCAATTCGTA CCGCCTGGGA CAGACTTGTT GGCGATCATC CCGGTGAACT CGAGCTCCGC CTTTTGGAGA GTCGCTTTAG TTCGGTGATT GATCCTTTTT GTGATTATGT CGTGGAGCAG GAAGAGCTTC ATCCGGAGCG CACGACCACT GTGGTGATGG CGCTAGTGAT CACTCGCGAT TGGCTGGATC AGACGCTTCT CAATCAGCGG GCTGTCTATT TGTTTAAGGC CCTGTCTGGC GACTACAGCC GAGTCTTTTG CGTGGTGCGT TATTACCTAG CGGGATAG
|
Protein sequence | MSFFQKLLGH PLLRNKADDE RLPNVQALPI LSSDALSSVA YATEAALGVL ILGGSGALGL SVPITLAIIA LVAIVVLSYR QAIEAYPKGG GSYVVARDNL GRNVGLIAAA ALLIDYTLTA GVSLMAGTQA LSSLVPSMLD HEVSLALLLL ALIGWANLRG LKETGRLFSL PTYAFVAMVA LLILAGLKDL IFEHGFVPDM PPAVQAVQPL GWFLILRAFS SGCSAMTGIE SIANGVKVFQ EPAVVNARRT LLVMGVLLAA MFLAVSGLGY MYGIAPNDRV TVLAQIGSRA FGSGSVLLWA LQLSTLLILV LAANTAFAGF PRLAAMLAED HCLPRQLSWI GDRLVYQNGI GVLLLVTALI IVICKGDTTV AVNLYALGVF TAFTLSQLGL VRRWWRLRGN GWQGRLLMNA LGAVTTFVVL VVIVVSKFQE GAWTVVITIP ALVWGLAQIR RRYRKAYAAL ALEPDFGPLQ VAPRQPPLGN HCIVWIPGLW RASMEALRYG CSIADSVTAV FVLGDDDDPD AIRTAWDRLV GDHPGELELR LLESRFSSVI DPFCDYVVEQ EELHPERTTT VVMALVITRD WLDQTLLNQR AVYLFKALSG DYSRVFCVVR YYLAG
|
| |