Gene Mlg_2003 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_2003 
Symbol 
ID4270477 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp2270113 
End bp2272287 
Gene Length2175 bp 
Protein Length724 aa 
Translation table11 
GC content64% 
IMG OID638126759 
ProductNa+/solute symporter 
Protein accessionYP_742835 
Protein GI114321152 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG0591] Na+/proline symporter 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value0.357117 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAACCCA AGGCCATATG GCTTTTGGTT TTCGTCGGCC TCTATTGGGG GTACTGTATC 
TTCTGGGGCA TCAAGGGTGC CCTGGCGACG AAAACCGCCA GTGATTATTT CATTTCCGGG
CGGTCGGTGC CCATGTGGGT GTTCATACTC GCAGCCACCG CCACATCCTG GTCGGGGTGG
ACCTTTGTCG GTCACCCCGG CCTGCTTTAC ATGACCGGGC TGCAGTACGG TTTCATTGGG
CTGTACGCCA TCGGCATCCC CATCTCCGGT ATGTTGTTCC TCAAGCGGCA GTGGATGATC
GGGCGCCGCT GGGGGTTCGT CACGCCGGGA GAGATGTACG GGACCTACTT CCGCAGCAAT
GCCATCATTT GGCTGGTGGT CATCGTCGCC ACCATCTTCG CGATTCCCTA TCTGGGTATC
CAACTGCGGG CCTCCGGGTT CCTGTTCAAC ATCCTCACTG ACGGTGCCCT GGGCACCAAC
GTGGGGATGT GGGCGCTCTC TGCCATCGTG TTGTTCTACG TCGCCTCCGG CGGTCTTCGG
GCGGTGGCCT ACGTGGACGC CATGCAGTGT GTCCTGCTGC TGTTCGGCAT GACCGCGATC
AGCTTCGTGG CCATCAACTA CATGGGCTCC ATCGGCGAGC TGTCACGGGC GATCGCCGCG
GCGAGCCAGT GGGACCTGAT CACCGGTGGG CAGGAGGCTG GTCGACCGGG GCTTACGCCG
GCTGGGCACA GCGGCTACGT GGCGACACCC GGGGTGATCC AGTGGGTCAG CAACGTCGGG
GATGCGACCG GTGGTGCCTG GACCTCGGTG ATGGTGCTCA GCTACATGAT GAGCATGGCC
GGCATCATGG CCTCGCCCTC CTTCACCATG TGGGCCTTCT CCAACAAGGA CCCGCGGCCG
TTCGCCCCGC AGCAGACCTG GGCCTCGGCC CTGATCTCCG GTGCGGTGAT CGTGGTGCTG
CTGGCGGTGC AGGCCATGGC CGGTCACGGC CTGGGGGCCA ACACGGACCT GGCCCGCGAC
GTCCACAACC CGGCCTATGA GGAGACCCTG GGGCAGTACC GGGATCTGTA CGACAGCCGG
GAGGATGTGG TGCTGCAGCG TGGCCTGATC GCGACCTTCA ACCCGGAGTT GAGCCGCGGC
GAGGTCAACC AGATGATCGA TGACGGTCTG GTGGCCCTGC GTGCCGGGGA GGACCCGCGT
GAAGTGGCCG GTTGGGTGGA CCTGCGCCGC GAGGGTGGTG GTGACACCGG CCTGGTGCCG
CAGCTGATGG GCCTGTTGGA GGGGGTGGCC CCCTGGTTTG TGGGTCTGCT GGCGGTTTGT
GCGCTGGGTG CGTTCCAGTC CACGGGTGCG GCCTACATGT CCACGACATC CGGGATCTAT
ACCCGCGATG TGCTCCGGCG GTTCATCAAC CCCAATGTCA GTCACAACGT TCAGAAGCAA
GTGGGTCGCA TCGTCGTGGT CATCCTGGTG TTTGCGGCGT TGATGGTGGC GACCTTCACC
ACGGACGCCT TGGTGTTGCT GGGCGGTACC GCGGTGGCAC TGGGCCTGCA GATGTGGGTG
CCGCTGATCG CGGTCTGCTA CTGGTCGTGG CTGACCCGGC AGGGCGTGGT GGCTGGCCTG
ATCGTCGGTA TCCTGGCCGT GCTGTTCACC GATAACATGG GCCTGGCACT GGCCAGCACG
CTGGGGCTGG ATCTGCCCTG GGGCCGTTGG CCGCTCACCA TCCACTCCGG TGGTTGGGGT
ATTGTGCTGA ACATGGGTGT GGCCATTGCG GTGTCGGCGT TCACCCAGGA TGCCCGCGAG
ATGGAACACA AGGAGACGTT CCACAAGTGG CTGCGGGAGC ACGCCGGGGT GCCGCAGGAT
AAGCGCCGGC TCATCCCGGT GGCCTGGGGC ATCGTTGCGG TGTACTACAT CTTCGCCATC
GGCCCGGGCA ACATCATCGG GACCTACCTG TTCGGTAACC CGAGTGATCC GTCGACCTGG
TGGGTGTTCG GCTTCCCGTC CATCTACGTC TACCAGATCC TGTGCTGGCT GTTCGGCGTG
TTCATGATGT GGTTCCTTTG CTACAAGATG GAGATGAGCA CCGTGCCGAA GAAGGAGATC
GAGATCCTCT ACGACGAGGA TGCGGTCAGC AGCCCGGATG TGAGGCAACC AGAGCCTGCG
CCGGCCAAGA GCTGA
 
Protein sequence
MEPKAIWLLV FVGLYWGYCI FWGIKGALAT KTASDYFISG RSVPMWVFIL AATATSWSGW 
TFVGHPGLLY MTGLQYGFIG LYAIGIPISG MLFLKRQWMI GRRWGFVTPG EMYGTYFRSN
AIIWLVVIVA TIFAIPYLGI QLRASGFLFN ILTDGALGTN VGMWALSAIV LFYVASGGLR
AVAYVDAMQC VLLLFGMTAI SFVAINYMGS IGELSRAIAA ASQWDLITGG QEAGRPGLTP
AGHSGYVATP GVIQWVSNVG DATGGAWTSV MVLSYMMSMA GIMASPSFTM WAFSNKDPRP
FAPQQTWASA LISGAVIVVL LAVQAMAGHG LGANTDLARD VHNPAYEETL GQYRDLYDSR
EDVVLQRGLI ATFNPELSRG EVNQMIDDGL VALRAGEDPR EVAGWVDLRR EGGGDTGLVP
QLMGLLEGVA PWFVGLLAVC ALGAFQSTGA AYMSTTSGIY TRDVLRRFIN PNVSHNVQKQ
VGRIVVVILV FAALMVATFT TDALVLLGGT AVALGLQMWV PLIAVCYWSW LTRQGVVAGL
IVGILAVLFT DNMGLALAST LGLDLPWGRW PLTIHSGGWG IVLNMGVAIA VSAFTQDARE
MEHKETFHKW LREHAGVPQD KRRLIPVAWG IVAVYYIFAI GPGNIIGTYL FGNPSDPSTW
WVFGFPSIYV YQILCWLFGV FMMWFLCYKM EMSTVPKKEI EILYDEDAVS SPDVRQPEPA
PAKS