Gene Mlg_2355 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_2355 
Symbol 
ID4268453 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp2670639 
End bp2672450 
Gene Length1812 bp 
Protein Length603 aa 
Translation table11 
GC content69% 
IMG OID638127113 
ProductABC transporter related 
Protein accessionYP_743185 
Protein GI114321502 
COG category[V] Defense mechanisms 
COG ID[COG2274] ABC-type bacteriocin/lantibiotic exporters, contain an N-terminal double-glycine peptidase domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.916251 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones46 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCGAGA TACTGCGCAA AACCCTGGAC CTGCTCACCG CCCGCGAGAA GCGCCGTGGC 
GCGCTGGTGC TGGCGATGGT GGTGTGCATG GCGTTGCTGG AGACGGCGGG GGTGCTGTCG
GTGGTGCCCT TTCTGGCGGT GCTGGGCAAC CCGGAGGTGG TGCACAACCA GCCGTTGCTG
GCGGCCGCCT TCCGCTGGTC CGGGCTGGAG CGGGTGCAGG CGTTTCTGAT CCTGCTGGCG
CTGCTGGTCT TCCTGCTGCA GGTGGTGGCG GCGGGTTTCC GTATGCTCAC CCATTTCGTG
CTCAACCGCT ATATCGAGGG CCGCCGCCAC AGCCTCAGCC AGCGGCTGCT GGAGACCTAC
CTGCGCCAGC CCTACACCTT CTTTCTCAAC CGCAACAGCG CGGACATGAC CAAGAGCATC
CTCTCGGAGG TGGACATGTT CGTGCTGACG GTGATGCGCC CGCTGCTCTT CGCCACCGCC
TACGCCATTG TCGCGCTGGC GATGATCGCC CTGCTGCTGT TCATCAACCC GTTGCTGGCC
CTGGCGGTGG CGACCATCGT CGGCGGGCTG TACGCGCTGA TGTTCCTGTC GGTGCGCGGC
TGGCTGGGGC GGATCGGCCG CGAGCGCAGC CAGGCCAACC GCGAGCGCTT CGCCACCACC
AGCGAGGTGC TGGGCGGGAT CAAGGACATC AAGCTGCTGG GCCACGAGCA GGCCTATCTT
TCGCGCTTCC GGCCGGCTTC GGCGCGCTTC ACCCGCCACC TGGCCACCAG CGAGACGCTG
GCCCAGATCC CGCGCTTCGC CATCGAGACG GTGGCGCTGG GCGGGGTGTT GATTCTCACC
GTGGTGCTGA TGGCCACCCA CGGCGACGTG GGCGCCATGC TGCCCACCCT GGGGCTGTAC
GTATTCGCCG GTTACAAGCT GCTGCCGGCC ATGCAGCATA TCTATGCCGG GGTCAGTCGC
ATGCGCTTTT CCGGTCAACT GGTGGCGGAC ATCCACGACG ACCTGCGCGA GCGCCCGCGC
CTGGCGCCCA TCGACGGCGC ACCGCCGGCA CCGTTGCGGC CGCGGCGGGA GATCGCCCTG
GAGGGCATCG ATTTCACCTA CCCCCAGGCC GATACCCCGG CCCTGCAGGG GATCGACCTG
CACATCCCGG TGGGCCGGAC GGTGGGGGTG GTGGGCAGCT CGGGCGCGGG CAAGACCACG
CTGATCGATG TGCTGCTGGG GCTGCTGCTG CCCCAGGCGG GGCACATCCG GGTGGATGGC
ACCGCCATCG ATGACCGGCA GCGCCCGGCC TGGCGGCGGG CGTTGGGCTA CGTGCCCCAG
CATATCTTCC TGAGCGATGC CAGCGTGGCG GAGAACATCG CCCTGGGGGT GCCGCTGGAG
CGCATCGACC ACGCCGCGGT GGTGCGCTGC GCCCGGCTGG CGCACATCCA CGAGTTCGTC
TCGGGGTCGC TGCCCCGGGG TTACGACACC CCGGTGGGTG AGCGGGGGGT CCGGCTCTCC
GGTGGCCAGC GCCAGCGGCT GGGCATCGCC CGGGCGCTCT ACCGCGACCC GGCGATCCTG
GTCTTCGACG AGGCCACCAA CGCCCTGGAC AACGAGACCG AGCGGGAGGT GATGGCCGCG
CTCTACGGCC TGGCGCGCAG CAAGACCATC ATCATCATCG CCCACCGGCT CTCCACGGTG
GAACGCTGCG ACCACATCGT GATGCTGGAG CAGGGCCGGA TCATCGACAG TGGCACCTTC
GCCGAGCTGC TGCACAACCC GCGCTTCCGC CGTCTGGCCC AGGCCCGGCC GGGCGAGCCC
GCCACCGGAT GA
 
Protein sequence
MPEILRKTLD LLTAREKRRG ALVLAMVVCM ALLETAGVLS VVPFLAVLGN PEVVHNQPLL 
AAAFRWSGLE RVQAFLILLA LLVFLLQVVA AGFRMLTHFV LNRYIEGRRH SLSQRLLETY
LRQPYTFFLN RNSADMTKSI LSEVDMFVLT VMRPLLFATA YAIVALAMIA LLLFINPLLA
LAVATIVGGL YALMFLSVRG WLGRIGRERS QANRERFATT SEVLGGIKDI KLLGHEQAYL
SRFRPASARF TRHLATSETL AQIPRFAIET VALGGVLILT VVLMATHGDV GAMLPTLGLY
VFAGYKLLPA MQHIYAGVSR MRFSGQLVAD IHDDLRERPR LAPIDGAPPA PLRPRREIAL
EGIDFTYPQA DTPALQGIDL HIPVGRTVGV VGSSGAGKTT LIDVLLGLLL PQAGHIRVDG
TAIDDRQRPA WRRALGYVPQ HIFLSDASVA ENIALGVPLE RIDHAAVVRC ARLAHIHEFV
SGSLPRGYDT PVGERGVRLS GGQRQRLGIA RALYRDPAIL VFDEATNALD NETEREVMAA
LYGLARSKTI IIIAHRLSTV ERCDHIVMLE QGRIIDSGTF AELLHNPRFR RLAQARPGEP
ATG