Gene Mlg_2072 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_2072 
Symbol 
ID4270458 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp2349159 
End bp2350112 
Gene Length954 bp 
Protein Length317 aa 
Translation table11 
GC content68% 
IMG OID638126828 
ProductTRAP transporter solute receptor TAXI family protein 
Protein accessionYP_742904 
Protein GI114321221 
COG category[R] General function prediction only 
COG ID[COG2358] TRAP-type uncharacterized transport system, periplasmic component 
TIGRFAM ID[TIGR02122] TRAP transporter solute receptor, TAXI family 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value0.169091 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGTCGAT CCAGCCTGCT TGCCCTGCTC GCGCTGCTGG CCGCCGGCCT GACCGCGCTG 
CCCGCCCACG CCCGCGACCT GACCTTCGGC GGTGCCTCCA TCACCGGCGT CTACTATCAG
GTGGCCCAGC ACGGCTGCCG CCTGCTGGAG CAACACAAAC CGGAGTACAA CTGCGTGGGC
CGCCCCACCC AAGGCTCGGT GTTCAATATC AACGCCCTCT CTCAAGGCTC CATCGACTTT
GGCGTCGCCC AGTCCGACCG CGCCTGGCAG GCCATCAACG GCCAGGCCGA GTGGGAGCGC
CGGGGCGCCT TCGAAGGTCT GCGCAGCCTG TTCGCCATGC ACCCGGAGAC GGTCATGCTG
GTGGTACGGG CCGACAGCGA TATCCACGCC GTAGAGGACA TCACAGGCCA CACCATCAAC
GTCGGCAACC CCGGCTCCGG CCAGCGCCGT AACGCCATGG ACGTCCTGGA GATCTACGGC
ATCGACCCGC GCAGCGACAT CCGCGCCCGC AACCTGCAAC AGCACGAGGC CTCCCGCGCC
CTGGTCGATG GCCAGGTGGA CGGTTTCTTC TACACCGTGG GCAACCCCAG CGCCGCCATT
GAGGAGCCGG CCAACACGGT AGACATCCGC ATGATCCCGC TCGACTCAGA CGCCATCCGC
GCGTTCGTGG ACGAACGGCC CTACTACGTG ATGACCCGGA TACCCGCCGG CACCTACCCC
GGGGTGGACG AGGACATCGG GACTTATGCG GTCACCGCCA CCGTGGTCAC CCACGCCGAC
ATGGACGAGG CCGTGGCCTA CGACCTGACC GCCGCTGTCT TTGAACAGAT GGACGACCTG
CGCAACGCCC ACGCCGCCTT CCGCCATCTG GAGCCCGAGG CCATGATGGA GGGCGTCTCG
GTGGACCTCC ACCCCGGCGC CCTGCGCTAC TACGAAGAGC AGGGCTGGCG CTGA
 
Protein sequence
MRRSSLLALL ALLAAGLTAL PAHARDLTFG GASITGVYYQ VAQHGCRLLE QHKPEYNCVG 
RPTQGSVFNI NALSQGSIDF GVAQSDRAWQ AINGQAEWER RGAFEGLRSL FAMHPETVML
VVRADSDIHA VEDITGHTIN VGNPGSGQRR NAMDVLEIYG IDPRSDIRAR NLQQHEASRA
LVDGQVDGFF YTVGNPSAAI EEPANTVDIR MIPLDSDAIR AFVDERPYYV MTRIPAGTYP
GVDEDIGTYA VTATVVTHAD MDEAVAYDLT AAVFEQMDDL RNAHAAFRHL EPEAMMEGVS
VDLHPGALRY YEEQGWR