Gene Mlg_2296 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_2296 
Symbol 
ID4268394 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp2606998 
End bp2608434 
Gene Length1437 bp 
Protein Length478 aa 
Translation table11 
GC content69% 
IMG OID638127056 
Producthypothetical protein 
Protein accessionYP_743128 
Protein GI114321445 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.357562 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value0.109764 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATCACG ATCACCTATA CAGCCTGCGG CGCACCCATC CGGGGTGGCG CCTGTTGCAG 
GCCGGTAACG CGCCGATGGT CGTGGCCTTT CTGCACCGCT GTTTCGTGGT GCCCAACGTG
CGGGCGCTGC CGGCCTCGGA GCTGGAGAAT GCGCTGGAGG ACTACCTCTA CCACCTGCGC
GCCCGGTTGG GTGACGAGGC CTACCAGCAG GGCGCTCACG AGTACCTGGC ATATTGGGCT
GCCGATGAGC GCGGTTGGTT GCGCAAGTAC TACCCGCAGC ACAGCGACGA ACCCCACTAC
GACCTCACCC CGGCCACCGA GCAGGCGATC CAGTGGCTGG CGGGCCTGGA GCAGGCCCAC
TTTATCGGTG CGGAGTCGCG GCTCACCCTG GTGTTTGATC TGCTGAAACA GATCGTGGAG
GGGGCGGAGA CCGACCCGGA GGCCCGCCTG CGCGATCTGG AGGCGCGGCG CGACGCCATC
GAGCGCGAGA TCGACGAGGT GCGGGCCGGG CACCTGAACC TGATGGACCC CACCCGCCTG
CGCGAGCGTT TTCTGCAGAT GGCCGACACG GCCCGCGGGC TGCTGGGGGA CTTCCGCCAG
GTGGAGGCCA ACTTCCGCGC GCTGGACCGC CAGGTGCGCG AGCAGGTGGC CACCTGGGAG
GGCGGCAAGG GCGATATCCT CGACCAGGTC TTCGGCGAGC ACGACCGCAT CGCCGATTCC
GACCAGGGCC GCAGCTTCCG CGCCTTCTGG GATCTGCTCA TGTCCCCGGC TCGCCAGGAG
GAGTTGACCG AACTGCTCGA GCGCACCCTG GCGCTCGAGC CCGTGACCGA GGTGGCGCCC
GACCCGACCC TGGCCCGCAT CCATTACGAC TGGCTGGCGG CGGGCGAGCA CACCCAGCGG
GTGGTGGCGC GGCTCTCCGA GCAGTTGCGC CGCTATCTGG ACGATCAGGC CTGGCTGGAG
AACCGCCGCA TCATGGGGCT GATCCGCGAG TTGGAGCAGC AGGCCCTGCA CCTGCGGGAA
GCCCCGCCCC GGGATTTCAC CATGGCCCTG GACGAACCCG CGCCCCGGGT GGAGCTGCCC
ATGGAGCGCC CGCTCTTCAG TCCGCCGGTT ACGCCGCGCA TCGAGCAGCA GGTGCTGGAG
GCGGGCGAGG CGGAGGGCGA CGTGGCGGCG CTGTTCGACC AGGCCTACGT GGACCGCACC
CGGCTGCAAG GGCAGGTGCG CCGGGCGCTG CAGACCCGCG AGCAGATCAG CCTGGCGGCC
CTGTTGGAGC AACACCCGCT GGAGCAGGGC CTGGCCGAAC TGGTGGTCTA CCTGGCCCTG
GCCACAGAGG ATCACCGCAG CGTGATCGAT GAAACCCGCC AGCAGACCGT ATACTGGCGC
GACCGTGACG GCGTTGCCCG CAGCGCCACG CTACCGCAGG TTATCTTCTG CCGTTGA
 
Protein sequence
MDHDHLYSLR RTHPGWRLLQ AGNAPMVVAF LHRCFVVPNV RALPASELEN ALEDYLYHLR 
ARLGDEAYQQ GAHEYLAYWA ADERGWLRKY YPQHSDEPHY DLTPATEQAI QWLAGLEQAH
FIGAESRLTL VFDLLKQIVE GAETDPEARL RDLEARRDAI EREIDEVRAG HLNLMDPTRL
RERFLQMADT ARGLLGDFRQ VEANFRALDR QVREQVATWE GGKGDILDQV FGEHDRIADS
DQGRSFRAFW DLLMSPARQE ELTELLERTL ALEPVTEVAP DPTLARIHYD WLAAGEHTQR
VVARLSEQLR RYLDDQAWLE NRRIMGLIRE LEQQALHLRE APPRDFTMAL DEPAPRVELP
MERPLFSPPV TPRIEQQVLE AGEAEGDVAA LFDQAYVDRT RLQGQVRRAL QTREQISLAA
LLEQHPLEQG LAELVVYLAL ATEDHRSVID ETRQQTVYWR DRDGVARSAT LPQVIFCR