Gene Mlg_1720 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_1720 
Symbol 
ID4268969 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp1967321 
End bp1968619 
Gene Length1299 bp 
Protein Length432 aa 
Translation table11 
GC content65% 
IMG OID638126478 
Producthypothetical protein 
Protein accessionYP_742556 
Protein GI114320873 
COG category[I] Lipid transport and metabolism 
COG ID[COG0439] Biotin carboxylase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.26756 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones35 
Fosmid unclonability p-value0.377885 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCCAGC ACATCTTCGT AATCGGGCTG GACGACTTCA ACCTCGCTGA ACTGCAGACC 
GTCCGCAACG CCGGGGAGTA CACCTTCCAC GGCCTGGTGG ACTACGACAC CATGGTGTTG
CCCGAGTCCT ACCCGATGCC GGAGATCATG GCCGAGGCCC GCCGGACCCT CGCGGATGCC
CCCGCGGTGG ACGGCATCAT CGGCCACTGG GACTTTCCCA CCACCTCCAT GCTGCCCATC
CTGCGCCGGG AGCACGGCCT ACCCACACCC ACCCTGGAGA GCGTGCTCTA CTGTGAGAGC
AAGTACTGGA ATCGACTCGC CTGCGAGCAG GCGGTGCCCG AGTGCACGCC CGACTTCCAG
GGGCTGGACC CGTACAGCGA CGACCCGCTG GCCGACCTGG ACGTGGCCTA CCCCTTCTGG
CTCAAGCCCA CCGTGGCCTT CTCCTCCTAC CTGGGCTTCC GCATTGAGAA CGAGCAGCAG
TACCTGGACG CCATGGCCAC CATTCGCGAG CACATTCACG TGTTCGCCGA ACCGTTCGAC
TACATCGTCG AGCAGTGCCA GAACCGCGCC GCCCTGCCCG ACCGCGGCAG CGGCGCCACC
TGCATCGCCG AGGGGCTGAT CGGTGGTCGG CTCTGCACCC TGGAGGGGTA CGTGCACAAC
GGCGAGGTGG TGGTGTATGC GGTGGTGGAC TCGCTGCGGG CCGCCAATAA CGTGAGCTTT
TTCAGCTACC AGTACCCCTC CCAGCTCCCC GCCGGGGTCC GCAACCGCAT GATCGGACAC
GCGGAAAAGC TCCTTCACCA CATCGGGTTG GACCACACGC CGTTCAACAT GGAGTTCTTC
TGGGATGAGG CCATCGACAA GATCTGGCTG CTGGAGATCA ACGCGCGGAT CTCCAAGTCC
CACTGCCCGA TCTTCCAGAT CGCCACCGGG GCCTCCCACC ACGAGGTGGC CATCGACATC
GCCCTGGGCC GGCGGCCGGA CTTCCCCCGT CCGGAGGGCC GTTTCCCCAT GGCGGGCAAG
TTCATGCCCC GCGTGTTCGC CGACACAGTG GTGACCCGGG TGCCCTCCGA GGAGGAGATC
CAGGCGCTCA AGCGGGTCCA TCCGGAACTG ATCGTCCACA TCGCCATAGA GGAAGGGATG
CGGTTGTCGG AACTGCGCGC CCAGGACAGC TACAGCTTTG AGATCGGCGA TGTCTTCCTG
GGGGCGGCGG ACGAAGCCGA ATTGCACCAG AAGTTCCGCC ACATCATGCA GGCCCTGGAC
TTCCAGTTCG CTGACGTGGT GCCGACCAAC TACAGCTGA
 
Protein sequence
MTQHIFVIGL DDFNLAELQT VRNAGEYTFH GLVDYDTMVL PESYPMPEIM AEARRTLADA 
PAVDGIIGHW DFPTTSMLPI LRREHGLPTP TLESVLYCES KYWNRLACEQ AVPECTPDFQ
GLDPYSDDPL ADLDVAYPFW LKPTVAFSSY LGFRIENEQQ YLDAMATIRE HIHVFAEPFD
YIVEQCQNRA ALPDRGSGAT CIAEGLIGGR LCTLEGYVHN GEVVVYAVVD SLRAANNVSF
FSYQYPSQLP AGVRNRMIGH AEKLLHHIGL DHTPFNMEFF WDEAIDKIWL LEINARISKS
HCPIFQIATG ASHHEVAIDI ALGRRPDFPR PEGRFPMAGK FMPRVFADTV VTRVPSEEEI
QALKRVHPEL IVHIAIEEGM RLSELRAQDS YSFEIGDVFL GAADEAELHQ KFRHIMQALD
FQFADVVPTN YS