Gene Mlg_0624 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_0624 
Symbol 
ID4270606 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp673717 
End bp674874 
Gene Length1158 bp 
Protein Length385 aa 
Translation table11 
GC content70% 
IMG OID638125371 
Productaminotransferase, class V 
Protein accessionYP_741468 
Protein GI114319785 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1104] Cysteine sulfinate desulfinase/cysteine desulfurase and related enzymes 
TIGRFAM ID[TIGR03402] cysteine desulfurase NifS 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value0.00292698 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAGCGCAC CAACCCCGAT CTACCTGGAC TACAACGCCA GCACGCCCAT CGCCCCCGAG 
GTAGCCAAGG CCATGGCCCC CTACCTGCAG GAGGCCTATG GCAACCCCTC CGCCGGTCAC
TGGGCCGGCG GTCCGGCCCG TGAGGCGGTG GAGCAGGCGC GCCGGCAGGT GGCCCGGCTG
ATCGGGGCGG CGCCGGACGA GATCGTCTTC ACCAGCGGCG GCAGCGAGGC GAACAACCAC
GCCATCAAGG GCACCTGGTA CGCCACAGAG GGGCCATTTC ACATCATCAC CACGGCAGTG
GAACACCCGG CCACCCTGGT CCCCTGCCGT TTCCTTGAAT CCCTCGGTGC TAGCCTGACC
GTACTCCCCG TGGACCGATA TGGGCAGGTG AACCCGGACG CGGTCCAGGC GGCCATCACC
CCCGAGACCC GCCTGATCAG CGTCATGCAC GCCAACAACG AGGTGGGCAC CCTGCAGCCG
GTGGAGGCCA TCGGCCGCAT CGCCCGCGAC CACGGTGTCC GCTTCCACGT GGACGCGGCG
CAATCGGCCG GCAAGGTGCC CATCAACGTC CAGGCCATGG GCGTGGATCT GCTCTCGCTC
GCCGGCCACA AGTTCTACGG CCCCAAGGGC ATCGGCGCGC TCTACGTCCG TCGCGGCATC
GACCTGACGC CGCTGATCCA CGGCGCCGGT CACGAGGGGG GGCGCCGCGC CGGTACCGAG
AGCGCGCTGC TCGCCACGGG GCTGGGCACC GCCGCCGAGA AGGCGCGTGA CCTCAGCCCC
ATGGCTCGGG TTCAGGCGCT GCGCGACCGG CTCTGGACGG GCCTGAAGGG CCATTTCGGT
GATACCCTTT GCCTGAACGG TCACCCTCAG GCCCGTCTGC CCAACACCCT GAACGTCGCC
TTCGCCGACT GTGTAGGCGC GGCCATCCTG GACCGGCTCG ACGGTGTCGC CGCCTCCACG
GGCTCCGCCT GCCACGCCGG CTCTGTCACG CTCTCACCGG TGCTGGCCGC CATGGGGGTC
CCGGAGCGGG TGGGCATGGG CGCACTCCGG TTCAGCCTGG GCCGCTGGAC CACGGAACAG
GAGATCGACG AGGTTATCGC AAGGCTCGCC CGGGCGGTCC CCCAGGCCCG GGCCGCCACC
ACACAGGAAC CATCATGA
 
Protein sequence
MSAPTPIYLD YNASTPIAPE VAKAMAPYLQ EAYGNPSAGH WAGGPAREAV EQARRQVARL 
IGAAPDEIVF TSGGSEANNH AIKGTWYATE GPFHIITTAV EHPATLVPCR FLESLGASLT
VLPVDRYGQV NPDAVQAAIT PETRLISVMH ANNEVGTLQP VEAIGRIARD HGVRFHVDAA
QSAGKVPINV QAMGVDLLSL AGHKFYGPKG IGALYVRRGI DLTPLIHGAG HEGGRRAGTE
SALLATGLGT AAEKARDLSP MARVQALRDR LWTGLKGHFG DTLCLNGHPQ ARLPNTLNVA
FADCVGAAIL DRLDGVAAST GSACHAGSVT LSPVLAAMGV PERVGMGALR FSLGRWTTEQ
EIDEVIARLA RAVPQARAAT TQEPS