Gene Mlg_2113 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_2113 
Symbol 
ID4269363 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp2399620 
End bp2400714 
Gene Length1095 bp 
Protein Length364 aa 
Translation table11 
GC content66% 
IMG OID638126869 
Productaromatic hydrocarbon degradation membrane protein 
Protein accessionYP_742945 
Protein GI114321262 
COG category[I] Lipid transport and metabolism 
COG ID[COG2067] Long-chain fatty acid transport protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.628404 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones37 
Fosmid unclonability p-value0.519362 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGGCAGCT ATATCCTGCC CGAGTTCGAG TACGAATCGG ATGACCCCAC CGCCGGCAAC 
CCCTACCCCG GGGACCACTC CAGCAGGAGC CGGGAGGCCG CCTTCGTGCC GGTGGGTTAC
GCCGCCTGGG AACTGCGGGA CGATATCCGC ATGGGCGTCG GGGTGACGGT GCCCTACGGG
CTGGAGACCG ACTATGACCG CGACTGGATC GGCCGCTACG ACGCCATCAA CACGGAGTTG
CTGACCATCG ACATTAACCC CACCGTCGCC TGGCGGGTCA ATGAACAGTT CGCCGTGGCC
GCGGGGCTGT CCGCACAGTA CGCCGACGCC AGCCTCAGCA GCGCCATACC CGGTCAGGGG
ATGGACCCCT CGACCGACGG CAAGCTCGAC GTGGAGGGGG ACAATTGGGC CTATGGCTTC
AACCTGGGGG CCCTGTTCGA ACCGGTCGAG GGGACCCGCC TCGGGGTGGC CTACCGCTCG
CGCATCACCC ATGACCTTTC GGGGGATGCC GAGTACGACC CGGCGAACTT CGGCCCGGGC
ACCGCCGAGC CCCAGGAGGT CGGGGGGAGC GCCAAACTGC GCCTGCCCGA GACCCTCAGT
CTGGGCATCC ACCAGGCGAT CAACGACCGC TGGGCGGTGA TGGCCGACGC CACCTGGACC
CGCTGGAGCC GCTTCGATGA GCTACGGGTG GATTTCGACG AGGACATCAC CATCGGCACC
ACCCTGATGG GGCCGATCAC CTCCTCCGGC AGCGTCGACG ACTACAGCTG GGACGACACC
TGGTTCGTCG CCCTGGGCGC CACCTTCCGT CCCAACAACG AATGGGCACT GCGGGTCGGG
GTGGCCCACG ACGAAAGCCC GGTCAGCAAC TGCTGCCGCA CCCCGCGCAT CCCGGACGAG
GACCGCACCT GGCTGGCCTT CGGCGCCAGC TACCAGCCCA ATGACAACGT GAAACTGGAC
TTTGGCTACA CCTACATCTG GCTGGACGAC GCCGATATCG TGCTGAACGA CGACAACCCC
AATGTCCCGG ACGTGGAGGG CGAGTACGAA AGCTCTGTGC AGATCCTCAC CGCCTCATTC
AACTACCGGT TCTGA
 
Protein sequence
MGSYILPEFE YESDDPTAGN PYPGDHSSRS REAAFVPVGY AAWELRDDIR MGVGVTVPYG 
LETDYDRDWI GRYDAINTEL LTIDINPTVA WRVNEQFAVA AGLSAQYADA SLSSAIPGQG
MDPSTDGKLD VEGDNWAYGF NLGALFEPVE GTRLGVAYRS RITHDLSGDA EYDPANFGPG
TAEPQEVGGS AKLRLPETLS LGIHQAINDR WAVMADATWT RWSRFDELRV DFDEDITIGT
TLMGPITSSG SVDDYSWDDT WFVALGATFR PNNEWALRVG VAHDESPVSN CCRTPRIPDE
DRTWLAFGAS YQPNDNVKLD FGYTYIWLDD ADIVLNDDNP NVPDVEGEYE SSVQILTASF
NYRF