Gene Mlg_1718 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_1718 
Symbol 
ID4268967 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp1964410 
End bp1965426 
Gene Length1017 bp 
Protein Length338 aa 
Translation table11 
GC content69% 
IMG OID638126476 
Productcysteine synthase A 
Protein accessionYP_742554 
Protein GI114320871 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0031] Cysteine synthase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.858313 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value0.306784 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACAGTGT GCGACGGTTT TGCCGGGGCG GTGGGCAACA CCCCACTGAT ACGCCTCAAG 
CGGCTTTCCG AGGAGACCGG CTGCGAGATC CTGGGCAAGG CCGAGTTCAT GAACCCGGGC
GGGTCGGTGA AGGACCGCGC CGCGCTCTAC ATCCTCCTGG ATGCCGAACG GCGCGGCCTG
CTGCGCCCCG GTGGCACGGT GGTGGAGGGG ACCGCGGGCA ATACCGGGAT CGGCTTGGCC
CACCTGTGCA ACGCCCGCGG CTATCGCTGT GTGATCGTCA TTCCCGAAAC CCAGACCCAG
GAGAAGATCG ACCTCCTGCG CACCCTGGGC GCGGCGGTGC ACACCGTACC GGCCCGGCCC
TACAAGGACC CGAACAACTA CCAGAAGGTG GCCGGCCGCA TGGCCGAGGA GATGGACAAC
GCGATCTGGG CCAACCAGTT CGACAACACC GCCAACCGCC AGGCCCACTA CGAGACCACC
GGGCCCGAGA TCTGGTCGCA GACCGGAGGG CGGCTGGACG GTTTCGTTGC GGCCACCGGC
ACCGGCGGCA CCCTGGCCGG CGTAGCCCGC TACCTCAAGG CGCAGTCGCC GGATGTGCGC
TGCGTACTGG CCGACCCGCA CGGCAGCGCG CTCTACGGCT ACGTCAAGAC CGGTACGCCG
GAGGTGACCG GCAGCGGCTC CATCACCGAG GGCATCGGCA GCACGCGGGT CACCGCCAAT
CTGGAGGGGA CGCCCATCGA CGACGCCTAC AGCATCCCCG ACCAGGAGGC CGTGGATCGG
GTCTATCAGG CGCTCTACCA GGAGGGGCTG TTCCTGGGCA GCTCCTCCGG CATCAACCTG
GCTGCCGCTG TGCGCCTGGC CCGCGAGCTG GGGCCGGGGC ACACCATCGC CACCATCCTC
TGCGACGGGG GCGCGCGCTA CTACTCGCGG CTGTTCAACC CGGAATGGTT GCGCGAGAAG
GGGCTGACCC CGGAGACTAA CCGGGAAACA GGCAGGGATG ACCCGGAGGC CCCTTGA
 
Protein sequence
MTVCDGFAGA VGNTPLIRLK RLSEETGCEI LGKAEFMNPG GSVKDRAALY ILLDAERRGL 
LRPGGTVVEG TAGNTGIGLA HLCNARGYRC VIVIPETQTQ EKIDLLRTLG AAVHTVPARP
YKDPNNYQKV AGRMAEEMDN AIWANQFDNT ANRQAHYETT GPEIWSQTGG RLDGFVAATG
TGGTLAGVAR YLKAQSPDVR CVLADPHGSA LYGYVKTGTP EVTGSGSITE GIGSTRVTAN
LEGTPIDDAY SIPDQEAVDR VYQALYQEGL FLGSSSGINL AAAVRLAREL GPGHTIATIL
CDGGARYYSR LFNPEWLREK GLTPETNRET GRDDPEAP