Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mlg_2003 |
Symbol | |
ID | 4270477 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Alkalilimnicola ehrlichii MLHE-1 |
Kingdom | Bacteria |
Replicon accession | NC_008340 |
Strand | - |
Start bp | 2270113 |
End bp | 2272287 |
Gene Length | 2175 bp |
Protein Length | 724 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 638126759 |
Product | Na+/solute symporter |
Protein accession | YP_742835 |
Protein GI | 114321152 |
COG category | [E] Amino acid transport and metabolism [R] General function prediction only |
COG ID | [COG0591] Na+/proline symporter |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 34 |
Fosmid unclonability p-value | 0.357117 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAACCCA AGGCCATATG GCTTTTGGTT TTCGTCGGCC TCTATTGGGG GTACTGTATC TTCTGGGGCA TCAAGGGTGC CCTGGCGACG AAAACCGCCA GTGATTATTT CATTTCCGGG CGGTCGGTGC CCATGTGGGT GTTCATACTC GCAGCCACCG CCACATCCTG GTCGGGGTGG ACCTTTGTCG GTCACCCCGG CCTGCTTTAC ATGACCGGGC TGCAGTACGG TTTCATTGGG CTGTACGCCA TCGGCATCCC CATCTCCGGT ATGTTGTTCC TCAAGCGGCA GTGGATGATC GGGCGCCGCT GGGGGTTCGT CACGCCGGGA GAGATGTACG GGACCTACTT CCGCAGCAAT GCCATCATTT GGCTGGTGGT CATCGTCGCC ACCATCTTCG CGATTCCCTA TCTGGGTATC CAACTGCGGG CCTCCGGGTT CCTGTTCAAC ATCCTCACTG ACGGTGCCCT GGGCACCAAC GTGGGGATGT GGGCGCTCTC TGCCATCGTG TTGTTCTACG TCGCCTCCGG CGGTCTTCGG GCGGTGGCCT ACGTGGACGC CATGCAGTGT GTCCTGCTGC TGTTCGGCAT GACCGCGATC AGCTTCGTGG CCATCAACTA CATGGGCTCC ATCGGCGAGC TGTCACGGGC GATCGCCGCG GCGAGCCAGT GGGACCTGAT CACCGGTGGG CAGGAGGCTG GTCGACCGGG GCTTACGCCG GCTGGGCACA GCGGCTACGT GGCGACACCC GGGGTGATCC AGTGGGTCAG CAACGTCGGG GATGCGACCG GTGGTGCCTG GACCTCGGTG ATGGTGCTCA GCTACATGAT GAGCATGGCC GGCATCATGG CCTCGCCCTC CTTCACCATG TGGGCCTTCT CCAACAAGGA CCCGCGGCCG TTCGCCCCGC AGCAGACCTG GGCCTCGGCC CTGATCTCCG GTGCGGTGAT CGTGGTGCTG CTGGCGGTGC AGGCCATGGC CGGTCACGGC CTGGGGGCCA ACACGGACCT GGCCCGCGAC GTCCACAACC CGGCCTATGA GGAGACCCTG GGGCAGTACC GGGATCTGTA CGACAGCCGG GAGGATGTGG TGCTGCAGCG TGGCCTGATC GCGACCTTCA ACCCGGAGTT GAGCCGCGGC GAGGTCAACC AGATGATCGA TGACGGTCTG GTGGCCCTGC GTGCCGGGGA GGACCCGCGT GAAGTGGCCG GTTGGGTGGA CCTGCGCCGC GAGGGTGGTG GTGACACCGG CCTGGTGCCG CAGCTGATGG GCCTGTTGGA GGGGGTGGCC CCCTGGTTTG TGGGTCTGCT GGCGGTTTGT GCGCTGGGTG CGTTCCAGTC CACGGGTGCG GCCTACATGT CCACGACATC CGGGATCTAT ACCCGCGATG TGCTCCGGCG GTTCATCAAC CCCAATGTCA GTCACAACGT TCAGAAGCAA GTGGGTCGCA TCGTCGTGGT CATCCTGGTG TTTGCGGCGT TGATGGTGGC GACCTTCACC ACGGACGCCT TGGTGTTGCT GGGCGGTACC GCGGTGGCAC TGGGCCTGCA GATGTGGGTG CCGCTGATCG CGGTCTGCTA CTGGTCGTGG CTGACCCGGC AGGGCGTGGT GGCTGGCCTG ATCGTCGGTA TCCTGGCCGT GCTGTTCACC GATAACATGG GCCTGGCACT GGCCAGCACG CTGGGGCTGG ATCTGCCCTG GGGCCGTTGG CCGCTCACCA TCCACTCCGG TGGTTGGGGT ATTGTGCTGA ACATGGGTGT GGCCATTGCG GTGTCGGCGT TCACCCAGGA TGCCCGCGAG ATGGAACACA AGGAGACGTT CCACAAGTGG CTGCGGGAGC ACGCCGGGGT GCCGCAGGAT AAGCGCCGGC TCATCCCGGT GGCCTGGGGC ATCGTTGCGG TGTACTACAT CTTCGCCATC GGCCCGGGCA ACATCATCGG GACCTACCTG TTCGGTAACC CGAGTGATCC GTCGACCTGG TGGGTGTTCG GCTTCCCGTC CATCTACGTC TACCAGATCC TGTGCTGGCT GTTCGGCGTG TTCATGATGT GGTTCCTTTG CTACAAGATG GAGATGAGCA CCGTGCCGAA GAAGGAGATC GAGATCCTCT ACGACGAGGA TGCGGTCAGC AGCCCGGATG TGAGGCAACC AGAGCCTGCG CCGGCCAAGA GCTGA
|
Protein sequence | MEPKAIWLLV FVGLYWGYCI FWGIKGALAT KTASDYFISG RSVPMWVFIL AATATSWSGW TFVGHPGLLY MTGLQYGFIG LYAIGIPISG MLFLKRQWMI GRRWGFVTPG EMYGTYFRSN AIIWLVVIVA TIFAIPYLGI QLRASGFLFN ILTDGALGTN VGMWALSAIV LFYVASGGLR AVAYVDAMQC VLLLFGMTAI SFVAINYMGS IGELSRAIAA ASQWDLITGG QEAGRPGLTP AGHSGYVATP GVIQWVSNVG DATGGAWTSV MVLSYMMSMA GIMASPSFTM WAFSNKDPRP FAPQQTWASA LISGAVIVVL LAVQAMAGHG LGANTDLARD VHNPAYEETL GQYRDLYDSR EDVVLQRGLI ATFNPELSRG EVNQMIDDGL VALRAGEDPR EVAGWVDLRR EGGGDTGLVP QLMGLLEGVA PWFVGLLAVC ALGAFQSTGA AYMSTTSGIY TRDVLRRFIN PNVSHNVQKQ VGRIVVVILV FAALMVATFT TDALVLLGGT AVALGLQMWV PLIAVCYWSW LTRQGVVAGL IVGILAVLFT DNMGLALAST LGLDLPWGRW PLTIHSGGWG IVLNMGVAIA VSAFTQDARE MEHKETFHKW LREHAGVPQD KRRLIPVAWG IVAVYYIFAI GPGNIIGTYL FGNPSDPSTW WVFGFPSIYV YQILCWLFGV FMMWFLCYKM EMSTVPKKEI EILYDEDAVS SPDVRQPEPA PAKS
|
| |