Gene Mlg_2347 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_2347 
Symbol 
ID4268445 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp2660335 
End bp2662101 
Gene Length1767 bp 
Protein Length588 aa 
Translation table11 
GC content69% 
IMG OID638127105 
ProductTrkA domain-containing protein 
Protein accessionYP_743177 
Protein GI114321494 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0471] Di- and tricarboxylate transporters 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones41 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCTTTG ACGCCTGGCT TACCGCCGGT GTGGTCAGCG CCATGCTGGC CGCGCTCACC 
TTTACCCGCG CCGCCCCGGA CCTGGTGTTC GTGGGCGGCG TTGTCGTGTT ACTGCTCGCG
GGGGTGGTGA CACCGGCGGA TGCCTTTTCC GGTTTTTCCA ACCAGGGCGT GATCACGGTG
GCCGCCCTCT ACGTGGTGGT GGCCGGGCTG CGCGAGACCG GCGGCATTCA GTGGCTGGTG
CAGGCGGTGC TGGGCCAGCC GCGCTCACTC CCCCGGGCCC AGTGGCGGCT GACCGGGCCG
GCGGCTTTCT TCAGCGCCTT TCTCAATAAC ACGCCGGTGG TGGGCATGCT GATTCCGGCG
GTCAGCGACT GGGCGCGCAA GTTCGACCTG CCGGTCTCGC GGCTGATGCT GCCGCTGTCC
TATGCCGCCA TCCTGGGGGG CACCTGCACC CTGATCGGCA CCAGCACCAA CCTGGTGGTC
AACGGGCTGT TGCTGGACCG CACTGACGCG GGCCTGGGCC TGTTCGACAT CGCCTGGGTG
GGCCTGCCGG TGGCCCTGGT GGGGCTGGTC TTCATGCTGG TGTTCAACCG CTGGTTGTTC
CCGGACCGGC GGCCGGCGAT CAGCGAGATC GACGACCCTC GGGAGTACAC CCTGGAGATG
CTGGTGGACC CGAAGGGGCC GCTGGTGGGC AAGTCCATCG AGGAGGCGGG GCTGCGCCAC
CTGCCGGGCG GCTTTCTGGC CGAGCTGGAC CGGGACGGCA CCCTGCTGCC GGCGGTCTCG
CCACAGGAGG TGCTACGCGG TGGCGACCGG TTGATCTTCG TGGGGGTGGT GGAGTCGATG
GTGGACCTGC AGAAGATGCG CGGGCTCACC CCGGCCACCG ACCAGGTCTT CAAGCTGGAC
GGCCACCGGG CGGACCGGGC GCTGCTGGAG GTGGTGGTCT CGGACACCTG CCCGGTGGCC
GGCGAGACCG TCCGCGACGG TGAGTTCCGC AACCGCTACA ACGCGGTGGT GTTGGCGGTG
GCGCGTAACG GTGAGCGGGT GAAGGGCAAG GTCGGTGATA TCCGCCTGCG GCCGGGGGAT
ACGCTGCTGG TGGAGGCCGG CCCGGGCTTC GCGGCGCAGA ACCGCAACCG GCGCGATTTC
TTTCTGATGA GCCAGGTGCA GGACTCGGCC ACCCCGCGCC ATGAACGCGC CCTGCTGGCC
GGGCTGATCA TGCTGGCGAT GGTGGTGGTG GCCACCACCG GCGTGGTGTC GATGCTGGAG
GCGGCCCTGG CCGCCGCCGG GCTGATGGTG GTGACCCGGT GCGTGACCCT GGAGGGCGCC
CGGAGCAGCC TGGACTGGCC GGTCCTGATC ACCATCGCGG CGGCCTTCGG CGTGGGCGCG
GCGATGGACA ACACGGGCCT GGCCCACATC GTCGGCATGG GCCTGACCGG CCTGGCCGGC
GACACCCCGT GGCTGAACCT GGCGGCCATC TACCTGGTCA CGGCGGTGTT TACGGCGGTG
ATCACCAACA ACGCCGCCGC GGTGTTGATG TTCCCGGTGG CGTTTGCGGT GGCGGGGGAC
TTGGACGTGA GCGTGTTGCC CTTCGCCATC GGCATCATGC TCGCCGCCTC CGCCAGCTTC
GCCACGCCGA TGGGGTATCA GACCAACCTG ATGGTCTACG GGCCTGGCGG GTACCACTTC
GCCGACTACC TGCGTGCGGG TATTCCGCTT AACCTGGTGA CCGGGCTGGT GGCCTTGGCC
GTTATACCGC AAGTCTGGAC GTTCTAA
 
Protein sequence
MPFDAWLTAG VVSAMLAALT FTRAAPDLVF VGGVVVLLLA GVVTPADAFS GFSNQGVITV 
AALYVVVAGL RETGGIQWLV QAVLGQPRSL PRAQWRLTGP AAFFSAFLNN TPVVGMLIPA
VSDWARKFDL PVSRLMLPLS YAAILGGTCT LIGTSTNLVV NGLLLDRTDA GLGLFDIAWV
GLPVALVGLV FMLVFNRWLF PDRRPAISEI DDPREYTLEM LVDPKGPLVG KSIEEAGLRH
LPGGFLAELD RDGTLLPAVS PQEVLRGGDR LIFVGVVESM VDLQKMRGLT PATDQVFKLD
GHRADRALLE VVVSDTCPVA GETVRDGEFR NRYNAVVLAV ARNGERVKGK VGDIRLRPGD
TLLVEAGPGF AAQNRNRRDF FLMSQVQDSA TPRHERALLA GLIMLAMVVV ATTGVVSMLE
AALAAAGLMV VTRCVTLEGA RSSLDWPVLI TIAAAFGVGA AMDNTGLAHI VGMGLTGLAG
DTPWLNLAAI YLVTAVFTAV ITNNAAAVLM FPVAFAVAGD LDVSVLPFAI GIMLAASASF
ATPMGYQTNL MVYGPGGYHF ADYLRAGIPL NLVTGLVALA VIPQVWTF