Gene Snas_5053 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSnas_5053 
Symbol 
ID8886260 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStackebrandtia nassauensis DSM 44728 
KingdomBacteria 
Replicon accessionNC_013947 
Strand
Start bp5360708 
End bp5364247 
Gene Length3540 bp 
Protein Length1179 aa 
Translation table11 
GC content67% 
IMG OID 
ProductDNA polymerase III subunit alpha 
Protein accessionYP_003513783 
Protein GI291302505 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.384157 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.354735 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGAAGG ATTTCGTCCA CCTTCACGTT CACACGGAGT ATTCGATGCT GGACGGGGCC 
GCCAAGATGC GGCCCATGTT CGCCGAGGTG GAACGGCTGG GGATGTCGTC CATCGCCATG
ACCGACCACG GCAACATGTA CGGCGCCTAC GAGTTCTACC AGGAGGCCAA GAAGACCGGC
ATCAAGCCGA TCATCGGCAT CGAGGCCTAC CTGGCGCCCG GCGACCGGGC CCACAAGAAG
CCGGTGCGGT GGGGGCGCCC GGACCAGAAG AGCGACGACG TGTCCGGTGG TGGCGCCTAC
ACCCACATGA CGATGTGGGC GGCCAACGGC AGCGGCCTGC GGAACCTGCT GAAGCTGTCG
AGCCTGGCGT CCTTCGAGGG CCAGTACCAG AAGCCGCGCA TGGACGCCGA GCTGTTGTCC
AAGTACGCCG AGGGCATCAT CGCCACCACC GGCTGCCCGT CGGGTGAGGT GCAGACCCGG
CTGCGGCTGG ACCAGCCGGA CGAGGCCCTG GCAGCGGCGG CCAAGTACCA GGACATCTTC
GGCAAGGACA ACTTCTTCCT GGAACTGATG GACCACGGCC TCGACATCGA GACCCGGGTC
CGCGAGGGCC TGCTGGACAT CAGCAAGAAG CTGGGCCTGC CGCTGCTGGC CACCAACGAC
TCCCACTACG TGCACCAGCA CGACTCCGAC AACCACGCCG CGCTGCTGTG CGTCCAGTCC
GGCAAGACCC TCACCGACCC GAACCGGTTC GCGTTCTCCG GCGACGGCTA CTACATCAAG
TCGCCCGCCG AGATGCGGGA ACTGTGGAGC GAACTGCCCG AGGCCTGCGA CAACACGCTG
CTGATCGCCG AGCGGATCGA GTCCTACGAG GAGGTCTTCG CCGAGGTCGA CCGGATGCCC
CGGTTCCCGC TGAAGGAAGG CGAGACCGAG GACATCCTGC TGCGCCGCGA CGTGGAGAAG
TACACCCCCA ACCGCTTCCC CGACGGTCTG ACCCAGGAGT ACAAGGACCG CATCGACCGG
GAACTGGGCG TCATGTCCGC GATGGGTTTC TCCGGCTACT TCCTCGTCGT CGGCGACCTG
GTGCGGTGGG CGAAGTCGCA GAAGATCCCC ACCGGGCCCG GACGTGGTTC GGCCACCGGT
TCGCTGGTGG CCTACATCCT GCAGATCACC GACCTGGACC CGATCGAGCA CTCGCTGATC
TTCGAACGGT TCCTCAACCC CGAGCGGGTC TCGCCCCCCG ACATCGACCT CGACTTCGAC
GAGCGTCGGC GCGGCGAGGT CATGCAGTAC ACCGTCGAGA AGTGGGGCGA GGAGAACGTC
GCCCAGGTCA TCACCTTCGG CACCATCAAG ACCAAGGCGG CCCTGAAGGA CGCGGCCCGC
GCGCACCTCG GCCAGCCGGG TTTCGCGGTG GCCGAGCGGA TCGCCAAGGC GCTGCCGCCG
CCGGTCGCCG CCCAGGACAT TCCGCTGTCG GGCATCGTCG ACCCGAACCA CGAGCGCTAC
AACGAGGCCG CCGAGGTGCG GGCCCTGGTG GAGAACGAAC CGGAGGTCAA GCAGATCTTC
GACACCGCGC GCGGCCTGGA GGGGTTGATC CGCAACGCGG GTGTGCACGC GTGCGCGGTC
ATCATCTCCA GCGTCCCGCT GCTGGGGCAG GTGCCACTGT GGATGCGGCC CGACGGCTCG
GTCATCACCG GCTGGGACTA TCCGTCCTGT GAGGCCATGG GCCTGCTGAA GATGGACTTC
CTGGGGCTGC GCAACCTCAC GGTCATCGGT GACGCCATCG ACAACGTCAA ACGCAACCAC
GGCGTGGAGC TGAGCACCGA GACGATCACG CTGGACGACC CCAAGACCTT CGAACTGCTG
TGCCGCGGTG AGAGCGCCGG GGTGTTCCAG TTCGAGGGCG CGGGCATGCA GGACCTGCTG
AAGCGGATGC AGCCCAAGAA GTTCGGCGAC ATCGCCGCCA TCTCGGCCCT GTACCGTCCC
GGCCCGATGG CCGCCAACGC GCACCTCAAC TACGCCGAGC GCGCCAACGG GCGGCAGTCG
CCCGAGCCGA TCCACCCCGA GCTGAAGGAC GCGCTGGAGC CGATCCTCGG CGAGACCTTC
CACCTGCTGG TCTACCAAGA GCAGGTCATG GCGATCGCGC AGCAGCTGGC CGGGTACACC
CTGGGTGGCG CCGACCTGCT GCGTCGCGCC ATGGGTAAGA AGAAGAAGGA GATCATCGAG
AAGGAGTTCG AGAAGTTCTC CAACGGCATG ACGGCCAACG GGTACTCCAT GGAGGCGTGC
CAGACGTTGT GGGACGTCAT GCTCCCGTTC GCCGGTTACG CCTTCAACAA GTCGCACACC
GCCGGATACG GCCTGGTCAC CTACTGGACC GCCTACCTCA AGGCGAACTA CCCGGCCGAG
TACATGGCCG CGCTGTTGAC CTCGGTGGGC GACAACAAGG ACAAGTCCGC CCTGTATCTG
GCCGACTGCC GCAAGCTCGG CATCAAGGTG CTGCCGCCCG ACGTCAACGA GTCGCGGCGC
AACTTCTCGG CCGTCGGCGG TGACATCCGC TTCGGGCTGG GCGCGATCCG CAACGTCGGC
ACCGGCGTGG TCGACTCGAT CGTGGCCACC CGCGAGGCCA AGGGCAACTA CGAGTCGTTC
TCGGACTTCC TGCAGAAGTC GGAACTGCCG GTGTGCAACA AGCGGGTCAT CGAGTCGCTG
ATCAAGGCGG GTGCCTTCGA CTCGCTGAAG CACTCGCGCA AGGCGCTGTG CGAACGCCAC
GAGGTGCTGG TCGAGTCCAT CGTGGGCGTC AAGCGCAAGG AGGCCGAGGG CCAGTTCGAC
CTGTTCGGGG GCATGCTGAC CCCGGATCAG CCGGACGCGG CCACTCCCGG CGGCGACACC
GACTTCTCCG GCGACGACTG GCCGCGCAAG ACCACCTTGG AGTTCGAGCG GGAGATGCTG
GGGCTGTACG TCTCCGGGCA CCCGCTGGAG GGCGCGGAGA TGATCCTGCG CAAGAACTCC
GAGAACCGGA TCGCCGACCT GCTCACCTCG GACATCCCCG ACGGTACCTC GGTGACGATC
GCGGGGATCA TCTCCAGTCT GGAGCGTCGG GTCACCAAGC AGGGCAAGCC GTGGGCCAAG
GCGACGGTGG AGGACCTGGA CGCGGCCATC GAGTGCCTGT TCTTCCCCAA GACCTACGAG
TTCGCCGGGC CGCAGCTGGC CCAGGACCTG GTGGTGGCGG TGCGCGGCAA GCTGAACCGC
CGCGACGGCG AGATCTCGAT CGTGGCCATG GATCTGGCGC CGCTGGAGAT CAACGAGTCC
GATCTGGTCA ACGAGCCGAC CCTGACCCTC AACGTGCAGC TGGCCAAGGT CAACGAGCGG
GTCCTCGACG AGCTCAAGGC GGTGTTGCAG GGCAACCGGG GCGACATGGC GGTGCGGGTG
AAACTGTGCG CCCCCAACGC CGAGACGCTG CTGGCCCTCG ACGAGCGGTA CAAGGTGGCC
TCGGGTCCGG GACTGACCTC GGAGCTGAAG AGCCTGCTGG GTGCCGAGTG CCTGGACTGA
 
Protein sequence
MSKDFVHLHV HTEYSMLDGA AKMRPMFAEV ERLGMSSIAM TDHGNMYGAY EFYQEAKKTG 
IKPIIGIEAY LAPGDRAHKK PVRWGRPDQK SDDVSGGGAY THMTMWAANG SGLRNLLKLS
SLASFEGQYQ KPRMDAELLS KYAEGIIATT GCPSGEVQTR LRLDQPDEAL AAAAKYQDIF
GKDNFFLELM DHGLDIETRV REGLLDISKK LGLPLLATND SHYVHQHDSD NHAALLCVQS
GKTLTDPNRF AFSGDGYYIK SPAEMRELWS ELPEACDNTL LIAERIESYE EVFAEVDRMP
RFPLKEGETE DILLRRDVEK YTPNRFPDGL TQEYKDRIDR ELGVMSAMGF SGYFLVVGDL
VRWAKSQKIP TGPGRGSATG SLVAYILQIT DLDPIEHSLI FERFLNPERV SPPDIDLDFD
ERRRGEVMQY TVEKWGEENV AQVITFGTIK TKAALKDAAR AHLGQPGFAV AERIAKALPP
PVAAQDIPLS GIVDPNHERY NEAAEVRALV ENEPEVKQIF DTARGLEGLI RNAGVHACAV
IISSVPLLGQ VPLWMRPDGS VITGWDYPSC EAMGLLKMDF LGLRNLTVIG DAIDNVKRNH
GVELSTETIT LDDPKTFELL CRGESAGVFQ FEGAGMQDLL KRMQPKKFGD IAAISALYRP
GPMAANAHLN YAERANGRQS PEPIHPELKD ALEPILGETF HLLVYQEQVM AIAQQLAGYT
LGGADLLRRA MGKKKKEIIE KEFEKFSNGM TANGYSMEAC QTLWDVMLPF AGYAFNKSHT
AGYGLVTYWT AYLKANYPAE YMAALLTSVG DNKDKSALYL ADCRKLGIKV LPPDVNESRR
NFSAVGGDIR FGLGAIRNVG TGVVDSIVAT REAKGNYESF SDFLQKSELP VCNKRVIESL
IKAGAFDSLK HSRKALCERH EVLVESIVGV KRKEAEGQFD LFGGMLTPDQ PDAATPGGDT
DFSGDDWPRK TTLEFEREML GLYVSGHPLE GAEMILRKNS ENRIADLLTS DIPDGTSVTI
AGIISSLERR VTKQGKPWAK ATVEDLDAAI ECLFFPKTYE FAGPQLAQDL VVAVRGKLNR
RDGEISIVAM DLAPLEINES DLVNEPTLTL NVQLAKVNER VLDELKAVLQ GNRGDMAVRV
KLCAPNAETL LALDERYKVA SGPGLTSELK SLLGAECLD