Gene Mlg_1243 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_1243 
Symbol 
ID4269027 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp1447898 
End bp1449118 
Gene Length1221 bp 
Protein Length406 aa 
Translation table11 
GC content71% 
IMG OID638125993 
ProductO-succinylhomoserine sulfhydrylase 
Protein accessionYP_742082 
Protein GI114320399 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0626] Cystathionine beta-lyases/cystathionine gamma-synthases 
TIGRFAM ID[TIGR01325] O-succinylhomoserine sulfhydrylase 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones42 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTGATC AGGGCGATTC CCACCACGAC GATCCGCGCC ACCCGAGCCA CTGGGCGCCC 
GCCACCCGGG CGGTGCGGGC GGGACAGACC CGCGGCCTGG AGCAGGAGCA GAGCGAGCCC
ATCTACGCCA GTTCCAGCTT CACCTACCGT AGTGCTGCCG AGGCGGCGGC ACGTTTCTCC
GGCGAGAGCC CGGGGAATAT CTACTCGCGC TTTACCAACC CCACGGTCCG GACCTTCGAG
CAGCGGCTCG CCGCCCTGGA GGGGGCCGAG GCCTGCGTGG CGACCGCCTC CGGGATGTCG
GCGGTGCTGG CCGCTACCCT GGGGCTCCTG CGGGCCGGCG ACCATATTGT CGCCTCCAGC
GGCCTGTTCG GGGCCACGGC CTCGCTGTTC GCCAATTACC TCCCGCGCTA TGGGATCGAG
GTCACCACGG TCCCGCTCAC CGACCTCCAG GCCTGGTCGG ACGCCATGCG TCCGCAGACC
CGCATGCTCT TCCTGGAGAC GCCGTCCAAC CCGCTGACCG AGGTGGCGGA TATCGCCGCG
CTGGCGGACC TGGCCCGGGG CCAGGGGGCG TGGCTGGCGG TGGACAACTG CTTCTGCACC
CCGGCCCTGC AGCGGCCGCT GGAGCTCGGC GCGGATCTGG TCATTCACTC GGCGACCAAG
TATCTGGACG GTCAGGGGCG GTGCATCGGC GGGGCGGTGT GCGGCGATGC CCAGGTGGTG
GGCGAACAGG TTTTCGGCTT CCTGCGCACG GCCGGGCCGA GCATGAGCCC GTTCAACGCC
TGGGTGTTCC TCAAGGGGCT GGAGACCCTG GCCCTGCGGA TGGAGGCTCA CTGCCGGAGT
GCCTCGGAGC TGGCCCACTG GCTGGAGGAG CACCCGGCCG TGGAGCGGGT GTTCTATCCC
GGGCTCGCCC GCCATCCCCA GCACGCCCTG GCGGCGCGCC AGCAGTCCGC CTTTGGCGGT
ATCGTCAGCT TCGAGGTGCG GGGCGGGCGC GACGCGGCCT GGCGGGTGAT TGACAATACC
CGGTTGCTCT CCATCACCGC GAACCTGGGC GACGCCAAGA GCACCATCAC CCACCCGGCG
ACCACCACTC ACGGCCGCAA CACACCGGAG CAACGGGCGG CGGCGGGGAT TACGGAATCG
CTGGTCCGGG TGTCGGTGGG GCTGGAGGCG GTGGCGGATA TCCGCGCTGA TCTGGACGCG
GCGTTGTCGG CCCTGGCGTA G
 
Protein sequence
MSDQGDSHHD DPRHPSHWAP ATRAVRAGQT RGLEQEQSEP IYASSSFTYR SAAEAAARFS 
GESPGNIYSR FTNPTVRTFE QRLAALEGAE ACVATASGMS AVLAATLGLL RAGDHIVASS
GLFGATASLF ANYLPRYGIE VTTVPLTDLQ AWSDAMRPQT RMLFLETPSN PLTEVADIAA
LADLARGQGA WLAVDNCFCT PALQRPLELG ADLVIHSATK YLDGQGRCIG GAVCGDAQVV
GEQVFGFLRT AGPSMSPFNA WVFLKGLETL ALRMEAHCRS ASELAHWLEE HPAVERVFYP
GLARHPQHAL AARQQSAFGG IVSFEVRGGR DAAWRVIDNT RLLSITANLG DAKSTITHPA
TTTHGRNTPE QRAAAGITES LVRVSVGLEA VADIRADLDA ALSALA