Gene Mlg_0601 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_0601 
Symbol 
ID4268480 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp653585 
End bp654793 
Gene Length1209 bp 
Protein Length402 aa 
Translation table11 
GC content65% 
IMG OID638125348 
Productargininosuccinate synthase 
Protein accessionYP_741445 
Protein GI114319762 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0137] Argininosuccinate synthase 
TIGRFAM ID[TIGR00032] argininosuccinate synthase 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000000000197496 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAGCGACG TCAAGAAGGT GGTGCTCGCC TATTCCGGCG GTCTCGACAC CTCGGTCATC 
CTGCAATGGC TGCGCGAGAC CTACGACTGC GAGGTGGTGA CCTTCACCGC CGACCTGGGG
CAGGGCGAGG AGCTGGAGCC GGCACGCAAG AAGGCCGAGG CCTTTGGCAT CAAAGAGATC
TACATCGACG ATCTGCGCGA GGAGTTCGTC CGCGATTTCG TCTTCCCCAT GTTCCGGGCC
AACGCCATCT ACGAGGGTGA GTACCTGCTC GGCACCTCCA TCGCCCGGCC GCTGATCGCC
AAACGCCAGG TGGAGATCGC CCGCGAGACC GGCGCCGACG CCGTCTCCCA CGGGGCCACC
GGCAAGGGCA ACGACCAGGT GCGCTTCGAG CTCGGCTACT ACGGCCTGGA GCCGAACATT
AAGGTGATCG CCCCCTGGCG CGAATGGGAT CTCAACTCCC GCGAGAAGCT GCTGGCCTAC
GCCGAGAAGC ACGGCATCTC CATCGAGGGC AAGCAGTCCG GCGGCTCGCC CTACTCCATG
GACGCGAACC TGTTGCACAT CTCCTACGAG GGCGGGGTCC TGGAGGACAC TTGGACCGAG
TGCGAGGAGG CCATGTGGCG CTGGACGCGC TCGCCCGAGG CGGCCCCGGA CGAGGCCCAA
TATATCGACA TCGAGTTTCA GGGCGGCGAC CCGGTGAGTA TCGACGGCGA GAAGCTCAGC
CCCGCCGCGC TGCTGAGCCG GCTCAACGAC CTGGGCGCCA TGCACGGCGT TGGCCGGATC
GATATCGTCG AGAACCGCTA TGTGGGCATG AAGTCCCGCG GCTGCTACGA AACCCCGGGC
GGCACCATCC TGCTGCGCGC CCACCGGGCC ATCGAGTCCA TCACCCTGGA CCGCGAGAGC
GCCCACCTGA AGGACGAGGT GATGCCCAAG TACGCCGAGT TGATCTACAA CGGCTACTGG
TGGAGCCCGG AGCGCGAGGC CATGCAGGCG TTGATCGATG CCACCCAGCG CCGGGTCAAC
GGCGTGGTGC GGCTGAAGCT CTACAAGGGG AATGTCATTG TGGTGGGACG CGATTCCGCG
AACGATTCGC TGTTCGACCA GACCATTGCC ACCTTCGAGG ATGATCGCGG CGCCTACGAT
CAGAAGGACG CCGAGGGCTT TATCCGCCTC AACGCCCTGC GCCTGCGTAT CGCCCAGCGG
CGCGGCTGA
 
Protein sequence
MSDVKKVVLA YSGGLDTSVI LQWLRETYDC EVVTFTADLG QGEELEPARK KAEAFGIKEI 
YIDDLREEFV RDFVFPMFRA NAIYEGEYLL GTSIARPLIA KRQVEIARET GADAVSHGAT
GKGNDQVRFE LGYYGLEPNI KVIAPWREWD LNSREKLLAY AEKHGISIEG KQSGGSPYSM
DANLLHISYE GGVLEDTWTE CEEAMWRWTR SPEAAPDEAQ YIDIEFQGGD PVSIDGEKLS
PAALLSRLND LGAMHGVGRI DIVENRYVGM KSRGCYETPG GTILLRAHRA IESITLDRES
AHLKDEVMPK YAELIYNGYW WSPEREAMQA LIDATQRRVN GVVRLKLYKG NVIVVGRDSA
NDSLFDQTIA TFEDDRGAYD QKDAEGFIRL NALRLRIAQR RG