Gene Snas_5004 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSnas_5004 
Symbol 
ID8886211 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStackebrandtia nassauensis DSM 44728 
KingdomBacteria 
Replicon accessionNC_013947 
Strand
Start bp5311293 
End bp5312852 
Gene Length1560 bp 
Protein Length519 aa 
Translation table11 
GC content70% 
IMG OID 
Productmalate synthase A 
Protein accessionYP_003513734 
Protein GI291302456 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.522377 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.192489 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGCGAGC GGTTCGCCGA GATCCTCAGT CCGGAGGCCC TGGATTTCCT GGCCGTCCTG 
CATCGGGAGT TCGCCGGACG GCGCGCCGAG CTGCTGGCCG CGCGAGCCCG GCGGCAGGCC
GACTTCGACG CCGGGGCCGG GCTGGACTTC CTGGCCGAGA CCGCCGCGGT GCGCGACGAC
CCCGGCTGGC GGGTGGCCGA ACCGGCTCCC GGGCTGGTCG ACCGGCGGGT CGAGATCACC
GGGCCGACCG ACCGCAAGAT GACCGTCAAC GCCCTCAACT CGGGTGCCAA GGTGTGGCTG
GCCGACCTGG AGGACGCGAA CACTCCACTG TGGGAAAACG TTGTCGCGGG GCAGCTGAAC
CTGCGCGACG CCCTGGACCG GACCATCGAC TTCGACGCCG GGGGCAAGCA GTACCGGCTC
AAGCCCGCCG GGGAGCTGGC AACGATCGTG GTGCGGCCGC GCGGCTGGCA CCTGGACGAG
AAGCACCTGC TGGTGGACGG CGAGCGGATG TCGGGCTCGC TGGTCGACTT CGGGCTGTAC
TTCTTCCACT GCGCCCGCAG GCAACTGGAA CGCGGGCACG GCCCGTACTT CTACCTGGCG
AAACTGGAGA GCCACCTCGA GGCGCGGCTG TGGAACGACG TGTTCGTCCG GGCGCAGCAG
CTTGTGGACA TTCCACGCGG GACGATCCGC GCCACCGTCC TCATCGAGAC CGTCACGGCG
GCCTTCGAGA TGGAGGAGAT CCTGTACGAG CTGCGGGACC ATTCGGCCGG GCTCAACGCG
GGGCGTTGGG ACTACCTGTT CAGCATCATC AAGAAGTTCC GGGCCCGGGG CGCGGAGTTC
CTGCTGCCGG AGCGCAACGC GGTGACGATG ACGGCGCCGT TCATGCGCGC CTACACCGAA
CTGCTGGTGC GCACCTGCCA CAAACGCGGC GCCCACGCCA TCGGCGGGAT GGCCGCGTTC
ATCCCCAGCC GCCGCGATCC CCAGGTCAAC GCCACCGCGC TGGACAAGGT GCGCGCCGAC
AAGTCGCGCG AGTCCGGCGA CGGCTTCGAC GGCTCCTGGG TCGCGCACCC CGACCTGGTG
CCGATCTGCC GGGAGGAGTT CGACAAGGCG CTCGGCGAGA ACGCCAACCA GCTCAACCGG
TTGCGCGAGG AGGTCTCGGT GACCGCCGCC GACCTGCTCG CGGTGGACGC CGACCCGAAC
CAGATCACCC GCGAGGGGCT GCGCAACGAC ATCACCGTCG CGCTGCGGTA CCTGACGGCC
TGGCTGGGCG GCACCGGCGC GGTCGCGATC TTCAACCTGA TGGAGGACGC CGCCACCGCC
GAGATCTCGC GCTCGCAGGT GTGGCAGTGG GTGCACAACG GCGTCACGCT CGCCGACGGC
GCGACGGTGG ACGCCGAGCT CGTCGAGGGC ATCATCGCCG AGGAACTCAA GACCATCGCC
GCCGAACCCG GTGTCGACGC CGACCGGCTG GAACAGGCGA CGCGACTTTT CCGCGCGGTG
GCGCTCGACG ACGACTACGC CGAGTTCCTG ACGCTCCCCG CGTACGAGGA GATGCCCTGA
 
Protein sequence
MGERFAEILS PEALDFLAVL HREFAGRRAE LLAARARRQA DFDAGAGLDF LAETAAVRDD 
PGWRVAEPAP GLVDRRVEIT GPTDRKMTVN ALNSGAKVWL ADLEDANTPL WENVVAGQLN
LRDALDRTID FDAGGKQYRL KPAGELATIV VRPRGWHLDE KHLLVDGERM SGSLVDFGLY
FFHCARRQLE RGHGPYFYLA KLESHLEARL WNDVFVRAQQ LVDIPRGTIR ATVLIETVTA
AFEMEEILYE LRDHSAGLNA GRWDYLFSII KKFRARGAEF LLPERNAVTM TAPFMRAYTE
LLVRTCHKRG AHAIGGMAAF IPSRRDPQVN ATALDKVRAD KSRESGDGFD GSWVAHPDLV
PICREEFDKA LGENANQLNR LREEVSVTAA DLLAVDADPN QITREGLRND ITVALRYLTA
WLGGTGAVAI FNLMEDAATA EISRSQVWQW VHNGVTLADG ATVDAELVEG IIAEELKTIA
AEPGVDADRL EQATRLFRAV ALDDDYAEFL TLPAYEEMP