Gene Mlg_0188 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_0188 
Symbol 
ID4268630 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp217984 
End bp219318 
Gene Length1335 bp 
Protein Length444 aa 
Translation table11 
GC content67% 
IMG OID638124912 
Productbranched-chain amino acid ABC transporter, periplasmic amino acid-binding protein, putative 
Protein accessionYP_741033 
Protein GI114319350 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0683] ABC-type branched-chain amino acid transport systems, periplasmic component 
TIGRFAM ID[TIGR03407] urea ABC transporter, urea binding protein 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones46 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCAAAA CGCAAACGCG TACCAACCCC GTCGCCGGCC TGCGCCGCAC CGCCATCGCG 
GCGGGCGTCA TGGCTGCCGT CGGTCTGGGG GGGCCGGCGG TGGCCGACGA GGGGCCGATC
AAAGTGGGCA TCCTCCACTC CCTGTCCGGC ACCATGGCCA TCAGCGAGAC CTCGCTGCGG
GACGTGGCGT TGATGACCAT CCAGCAGATC AACGAGCAGG GCGGCCTGCT GGGCCGGGAG
CTGGAGCCGG TGGTCATGGA CCCCGCCTCC GACTGGCCCC GCTACGCCGA GCAGGGCCGC
GAGCTGCTGG AGCGCCACGA GGTGGACGTC ATCTTCGGCT CCTGGACCTC CGTCTCCCGC
GAGGCGGTGC TGCCGGTGCT GGAAGAGCTG AACGGCCTGA TGTTCTATCC GGTGCAGTAC
GAGGGCGAGG AGTCCTCCCG CAACATCTTC TACACCGGCG CGGCCCCGAA CCAGCAGACC
ATCCCCGCCG TGGAATACCT GATGAGCCCA GAGGGCGGCG GCGCCGAGCG CTTCTACCTG
GTGGGCACCG ACTACGTCTT CCCGCGCACC ACCAACCGCA TCGTGCGCGC CTTCCTCAAT
CACCACGGGG TCAGCGACGA CGATATCGAA GAGGTTTACT TCCCCTTCGA GCACAGCGAC
TTCCAGTCCC TGGTCGGTGA TATCCGTAGC TTCGCCGACG GCGGCCCCAC CGCGGTGATC
AACACCGTCA ACGGCGACTC CAACGTGGCC TTCTACCAGG AGCTGGCCAA CCAGGGCATC
GACGCCATCG ACATCCCGGT GATGGCCACC TCCGTCGGCG AGGAAGAACT GCGCGGCATG
GACACCGGCC CTCTGGTGGG CCACCTGGCC GCCTGGAACT ACTTCATGTC CATCGATACC
CCGGAGAACG AGACGTTCGT TTCCACCTGG ATGGACTACG TGGAGGCCGA GGGCCTGAGC
GGTGGCAGTG ACCGGGTCAC CAACGACCCC ATGGAGGCCA CCCACATCGG CATCCGCATG
TGGGCCCAGG CGGTGCTGCA GGCCGGTACC ACCGACGTGG ACGCGGTGCG CCAGGCGGTC
TACGGCCAGT GCGTGGACGC CCCCTCCGGT TTCGAGATCT GCATGGACGA GGAGAACCAC
CACCTGCACA AGCCGGTGAT CATCGGCGAG ATCCAGCCCG ACGGCCAGTT CGCCCCGGTC
TGGGAGACCG ACGGTCCGGT GCGCGCGGAG CCCTGGAGCG AGTACCTGGA GGACAGCCGG
GACAAGGTCG CCAACTGGCG TTATCCCTGG GTCTGCGGTG ACTGCACCGA GCCCACCTAC
GAGCTGGACT TCTGA
 
Protein sequence
MSKTQTRTNP VAGLRRTAIA AGVMAAVGLG GPAVADEGPI KVGILHSLSG TMAISETSLR 
DVALMTIQQI NEQGGLLGRE LEPVVMDPAS DWPRYAEQGR ELLERHEVDV IFGSWTSVSR
EAVLPVLEEL NGLMFYPVQY EGEESSRNIF YTGAAPNQQT IPAVEYLMSP EGGGAERFYL
VGTDYVFPRT TNRIVRAFLN HHGVSDDDIE EVYFPFEHSD FQSLVGDIRS FADGGPTAVI
NTVNGDSNVA FYQELANQGI DAIDIPVMAT SVGEEELRGM DTGPLVGHLA AWNYFMSIDT
PENETFVSTW MDYVEAEGLS GGSDRVTNDP MEATHIGIRM WAQAVLQAGT TDVDAVRQAV
YGQCVDAPSG FEICMDEENH HLHKPVIIGE IQPDGQFAPV WETDGPVRAE PWSEYLEDSR
DKVANWRYPW VCGDCTEPTY ELDF