Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Smed_5996 |
Symbol | |
ID | 5320298 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sinorhizobium medicae WSM419 |
Kingdom | Bacteria |
Replicon accession | NC_009621 |
Strand | + |
Start bp | 953229 |
End bp | 954647 |
Gene Length | 1419 bp |
Protein Length | 472 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 640777672 |
Product | amino acid carrier protein |
Protein accession | YP_001314604 |
Protein GI | 150378009 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG1115] Na+/alanine symporter |
TIGRFAM ID | [TIGR00835] amino acid carrier protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 0.188928 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGATACGA TCATAGGCTT TCTGAACACC ATCCTCTGGG GCTATGTGCT CATCTATGGT TTGCTAGCCG TCGGCCTTTA TTTTACAATT CGCCTCGGCC TTCCTCAGAT CATCCACTTC GGAGAAATGT TCCGAGTGCT AAGCAGCGGC CAATCCAAGG ACCCCTCCGG CATCAGCCCC TTTCAGGCCT TGATGGTCAG CCTCGCCTCG CGCGTCGGCA CAGGCAATCT GGCCGGCGTC GCTGTCGCGC TCTACCTGGG CGGACCGGGC GCGACCTTCT GGATGTGGAT GGTAGCACTC GTCGGCATGG CGACGGCCTA TGCGGAAAGT GCGCTGGCGC AGCTCTACAA GGTGCGCAAT GAAGATGGCC AATATCGTGG CGGCCCGGCC TTCTACATCG CTCACGGGCT CAATGCCCCT TGGGCGGCGG CAATCTTCTC CGTCTGCCTC ATCATCTCCT TTGGTCTCGT TTTCAATGCC GTTCAGGCAA ACTCCATCGC TGATGCAGTC CAAGGAGCCT TCGGCGTTCC GAAGCTAATA GTCGGTTTCG GGCTCGCGGT GCTGTCCGGC GTCGTTATCT TTGGGGGCAT CCGACAGATT GCCCGCGTTG CTGAAATCAT CGTGCCCTTC ATGGCGGTCG CGTATCTGCT CATGGCGGTT TATGTGCTGA TCGCCAACGC GGCGCTGGTG CCTCACGTTC TCTGGACGAT AGTTTCGAGC GCCTTCGGCC TGCAGGAAGC AGGCGGTGGC GTCACAGGTG GAATTGCAGC GGCGATGCTA AACGGCGTCA AGCGCGGTCT TTTCTCCAAC GAGGCCGGCA TGGGTTCGGC TCCAAACATT GCGGCCGTTG CCACTCCTGT GCCGCATCAT CCTTCCTCGC AGGGCTTCGT TCAGTCTCTC GGTGTCTTCA TCGACACGAT TCTGATCTGC ACGGCGACAT CGGTGATGAT CCTGCTTTCG GGAACGCTTG ACCCCGGTTC CAACATCACC GGAACGCAGC TCACGCAAGC CGCAATGAAT GTCCACATTG GCGCCGCCGG AACCTATTTC ATCGCAATCG CGATATTCTT CTTCGCGTTC ACTTCAATAA TCGGCAACTA CTCCTACGCT GAAAACGCGT TGACTTTTCT TGGTGCCGGC AATCGCCTGG GTCTGACGAT CATGCGCTGC GCCACGCTAG TAATGGTGGT CTGGGGCGCC TACGAGAGCA TTGCTACCGT TTTCGACGCA GCCGATGCCT CGATGGGCCT GATGGCGACG ATCAATCTCA TCGCAATCCT GCTCCTCTCA GGCACCATCG CGAAATTGAC GAAGGACTAT TTTAAGCAGC GGAAGCAGGG TCTCGCACCG GTTTTCCACG CTGCGGATTA CCCGGAGCTG CAAGGCAAGA TCGACCACGG GATCTGGTCA CGCGTCTGA
|
Protein sequence | MDTIIGFLNT ILWGYVLIYG LLAVGLYFTI RLGLPQIIHF GEMFRVLSSG QSKDPSGISP FQALMVSLAS RVGTGNLAGV AVALYLGGPG ATFWMWMVAL VGMATAYAES ALAQLYKVRN EDGQYRGGPA FYIAHGLNAP WAAAIFSVCL IISFGLVFNA VQANSIADAV QGAFGVPKLI VGFGLAVLSG VVIFGGIRQI ARVAEIIVPF MAVAYLLMAV YVLIANAALV PHVLWTIVSS AFGLQEAGGG VTGGIAAAML NGVKRGLFSN EAGMGSAPNI AAVATPVPHH PSSQGFVQSL GVFIDTILIC TATSVMILLS GTLDPGSNIT GTQLTQAAMN VHIGAAGTYF IAIAIFFFAF TSIIGNYSYA ENALTFLGAG NRLGLTIMRC ATLVMVVWGA YESIATVFDA ADASMGLMAT INLIAILLLS GTIAKLTKDY FKQRKQGLAP VFHAADYPEL QGKIDHGIWS RV
|
| |