Gene Snas_4359 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSnas_4359 
Symbol 
ID8885560 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStackebrandtia nassauensis DSM 44728 
KingdomBacteria 
Replicon accessionNC_013947 
Strand
Start bp4646381 
End bp4648225 
Gene Length1845 bp 
Protein Length614 aa 
Translation table11 
GC content71% 
IMG OID 
Productdihydroxy-acid dehydratase 
Protein accessionYP_003513099 
Protein GI291301821 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0445663 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.0917232 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCAGCTC TGCGTTCCCG TACCTCGACC CACGGCCGGA CGATGGCCGG AGCCCGCGCC 
CTGTGGCGGG CCACCGGGAT GACCGACGAC GACTTCGGCA AGCCGATCGT GGCCATCGCC
AACAGCTACA CCCAGTTCGT GCCCGGCCAC GTCCACCTCA AGGACATGGG CGACATCGTC
GCCGAAGCGG TCAAGGCCGC CGGGGGAGTG TCGAAGGAGT TCCACACCAT CGCCGTCGAC
GACGGCATCG CCATGGGCCA CGGCGGGATG CTCTACTCGC TGCCCAGCCG CGAACTCATC
GCCGACTCGG TGGAGTACAT GGCCAACGCC CACTGCGCCG ACGCGCTGGT GTGCATCTCC
AACTGCGACA AGATCACCCC CGGCATGCTG CTGGCGGCGC TGCGGCTCAA CATCCCGACC
GTGTTCGTGT CCGGCGGCCC GATGGAGGCC GGCAAGACCG TGTCGGTGGA CGGCGTGGTG
CAGCGCAAGC TCGACCTGGT CGACGCGATG GTCGCCTCCG CCGACGAGGC CACCTCGGAC
GAGACGCTGG ACGGCATCGA GCGCTCGGCC TGCCCGACCT GCGGCTCGTG TTCGGGCATG
TTCACCGCCA ACTCGATGAA CTGCCTGACC GAGGCGATCG GACTGGCGTT GCCGGGCAAC
GGTTCCACGC TGGCCACCCA CGCCGCCCGC CGCGACCTGT TCACGCGGGC GGGCGCGTTG
ATCGTCGACC TGGCCAAGCG GTACTACGAC GGCGAGGACG AGTCGGTGCT GCCGCGCGCG
ATCGCGTCGC GCGAGGCCTT CGACAACGCG GTGGCCCTGG ACGTGGCCAT GGGCGGCTCC
ACCAACACGG TGCTGCACCT GCTGGCCGCC GCCCGGGAGG CCGAGCTGGA CTACACGGTG
ACCGACATCG ACGCGGTCTC GCGCCGGGTG CCGTGCCTGG CGAAGGTGGC ACCCAACTCG
CCCAAGTTCC ACATGGAGGA TGTACATCGC GCCGGAGGCA TTCCGGCGAT CATGGGCGAA
CTGCACCGCG GCGGCCTGCT GCACACCGGC GTCGGCTCGA TCCACAGCGA CTCGCTCGAC
GCCTGGCTGG CCACGTGGGA CATCCGCGCC GCCGACCCGG CCCCCGAGGC GGTGGAACTG
TTCCACGCCG CCCCCGGCGG GGTGCGCACC ACCCAGCCGT TCTCGACCGA GAACCGCTGG
TCCAGCCTCG ACACCGACGC GAAGGACGGC TGCATCCACT CGGTGGAGCA CGCCTACACC
GCCGACGGCG GCCTGTGCGT TCTGTTCGGC AACGTGGCCC CCGACGGCTG CGTCGTCAAG
ACCGCGGGCG TCCCGGAGAA CAGCCTCGTC TTCGCGGGCC CGGCCCGGGT CTTCGAGTCG
CAGGAGGACT GCGTCTCGGG CATCCTCAAC GGGGCGGTGC AGGCCGGAGA CGTCGTGGTC
ATCCGCTACG AGGGCCCGCG CGGCGGACCC GGGATGCAGG AGATGCTGCA CCCGACGTCC
TTCCTCAAGG GCAAGGGACT CGGTCCCGTC TGCGCGCTGA TCACCGACGG CCGGTTCTCC
GGCGGCACCT CGGGACTGTC CATCGGTCAC ATCTCGCCCG AGGCCGCCTC CGGCGGCCCG
ATCGGGCTGG TGGCCGACGG CGACGAGATC GCCATCGACA TCCCGGCCCG CTCCATCGAG
CTGCGGGTTT CCGACGAGGA GCTCACCCGG CGCCGGGTGG AGCAGGAGAA GCGCGACCAC
CCGTTCACGC CCGTGGACCG CCAGCGCCCG GTGACGGCCG CGCTGCGCGC CTACGCCTCC
ATGACCACAT CGGCCAGTGA CGGCGCGTAT CGCCAGGTTC TCTAA
 
Protein sequence
MPALRSRTST HGRTMAGARA LWRATGMTDD DFGKPIVAIA NSYTQFVPGH VHLKDMGDIV 
AEAVKAAGGV SKEFHTIAVD DGIAMGHGGM LYSLPSRELI ADSVEYMANA HCADALVCIS
NCDKITPGML LAALRLNIPT VFVSGGPMEA GKTVSVDGVV QRKLDLVDAM VASADEATSD
ETLDGIERSA CPTCGSCSGM FTANSMNCLT EAIGLALPGN GSTLATHAAR RDLFTRAGAL
IVDLAKRYYD GEDESVLPRA IASREAFDNA VALDVAMGGS TNTVLHLLAA AREAELDYTV
TDIDAVSRRV PCLAKVAPNS PKFHMEDVHR AGGIPAIMGE LHRGGLLHTG VGSIHSDSLD
AWLATWDIRA ADPAPEAVEL FHAAPGGVRT TQPFSTENRW SSLDTDAKDG CIHSVEHAYT
ADGGLCVLFG NVAPDGCVVK TAGVPENSLV FAGPARVFES QEDCVSGILN GAVQAGDVVV
IRYEGPRGGP GMQEMLHPTS FLKGKGLGPV CALITDGRFS GGTSGLSIGH ISPEAASGGP
IGLVADGDEI AIDIPARSIE LRVSDEELTR RRVEQEKRDH PFTPVDRQRP VTAALRAYAS
MTTSASDGAY RQVL