Gene Ndas_3141 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_3141 
Symbol 
ID9246997 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp3757353 
End bp3759140 
Gene Length1788 bp 
Protein Length595 aa 
Translation table11 
GC content76% 
IMG OID 
ProductDNA polymerase III, epsilon subunit 
Protein accessionYP_003681056 
Protein GI297562082 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGTACGAC AAGCGGCGCC CGCCGGCGTC CAGACCAGCA TCAGCGACCT CGGTACGCCG 
TTGGCCGCGG CGTCCTTCGT GGTGCTGGAC CTGGAGACCA CCGGCACCAG CGCGAGCGGC
TCCCGGATCA CGGAGGTCGG CGCGGTCAGG GTGCGCGGCG GCGAGGTCGT CGGCGAGTTC
GCCACCCTGG TCAATCCCGG TACCCCCATA CCGGCCAACA TCACCCTCCT GACGGGGATC
ACCCAGTCGA TGGTGGCCTC GGCCCCGCCG ATGGAGGAGG TCCTGCCCCG GCTGCTGGCC
TTCCTGGACG CCGAGCCCGA CACCGTGCTG GTGGCCCACA ACGCGCCCTT CGACACCGGC
TTCCTCAAGG CCGCGTGCGA GCGCCACGGC ACGGACTGGC CCGGCTACCC GGTCGTGGAC
ACGCTGCGCC TGGCCCGCGC CGTGCTCGCG CGCGGCGAGA CGCGCAACCA CCGGCTGGCC
ACACTGGCGG CCTACTTCGG GGTCCCGGTC GCGCCCAACC ACCGCGCCCT GGAGGACGCC
CGCGCCACCG TGGGCGTCCT GCACGGGCTC GTGGAGCGCC TGCGCCCCAT GGGCGTGTCC
AGCGTGGAGG AGCTGCGCGC GGTCACCAAG CCGCCCACCA AGGCCCAGCG CAGCAGGCGC
CACCTGGCCG AGGACCTGCC GGAGGAGCCG GGCGTGTACG TGTTCACCGA CGCCCGCGGG
GAGAGCCTCT ACGTCGGCAA GAGCAAGAAC CTGCGCCGCC GGGTGCGCAC CTACTTCACC
GCGGCCGAGA GCCGCCAGCG CATCCGTGAG ATGGCGGGCC TGGTGGCGGG CGTGACCCCC
ATCGTGTGCT CCAGCGAGCT GGAGGCCTCC GTCCGCGAGC TGCGCATCAT CGCCGAACGC
AAACCGCCCT ACAACCGGCG TTCGCGCAAC CCCGAGCGCG CCTCGTGGGT CCGGCTCACC
GCCGACGCCT TCCCGCGCCT GTCGGTGGTG CGCGCGGTCA GCGGCGACGG GGCGGCCCAC
ATCGGGCCCT ACGCCTCGCC TCGCGAGGCC GAGCGGGCCA GGGAGGCCCT GCTCCACGTC
TTCCCGCTGC GCCAGTGCGC GCACACCTTC CGGCCCCCGA AGGCGGTCGC GGGGGGAAGC
GGAGGCGGGA CCCGCGTCGC CCCGCAGGTG GTCACCTCGG GCGCGCGGTG GACGGGCCCG
TGCGTGGTGG CACAGCTGGG CCGCTGCGGC GCGCCCTGCG ACGGCAGCGA GAGCGAGGCC
GAGTACGCCG TGCACGCCGA GGCCGCGCGC GTCGCGATGA CCGGGGACCC CGCCGCCGTG
GTGGACGCGT ACACCGCGCG CATAGGCGAA CTCGCCGCGG ACCTGCGCTA CGAGGAGGCC
GCGCACCTGC GCGACCGGCT CACCGCGTTC CTGCGCGGCG CCAGGCGGGC CCAGCGCCTG
TCGGCCATCG CCGCGGTCGC GCACCTGGTC GCCTCCCGCC GCACCGCCGC GGGCTGGGAG
ACCTGCGTGG TCCGCCACGG CCGCCTGGCC GCCAGCGCCG TGCTGCGCCC GGGCACCGAC
CCGGCCGCGT TCCTGGCCTC CCTCGTGGCC ACCGCCGAGT ACGTGCCCGC CGGGTACGGG
CCCAGCCCCG GCGCGCTCCC GGGGGAGACC GAGCTGGTCC TGGACTGGCT CGCCGACCCC
GCCACGCGGC TGGTCGAGAT CGACGGGGAG TGGACATGCC CGTTGCGCAG CGCCGAGGCG
CACACCGGGA TGACACACTG GGCCCATGGC CGCGCTTCCG CTCAATGA
 
Protein sequence
MVRQAAPAGV QTSISDLGTP LAAASFVVLD LETTGTSASG SRITEVGAVR VRGGEVVGEF 
ATLVNPGTPI PANITLLTGI TQSMVASAPP MEEVLPRLLA FLDAEPDTVL VAHNAPFDTG
FLKAACERHG TDWPGYPVVD TLRLARAVLA RGETRNHRLA TLAAYFGVPV APNHRALEDA
RATVGVLHGL VERLRPMGVS SVEELRAVTK PPTKAQRSRR HLAEDLPEEP GVYVFTDARG
ESLYVGKSKN LRRRVRTYFT AAESRQRIRE MAGLVAGVTP IVCSSELEAS VRELRIIAER
KPPYNRRSRN PERASWVRLT ADAFPRLSVV RAVSGDGAAH IGPYASPREA ERAREALLHV
FPLRQCAHTF RPPKAVAGGS GGGTRVAPQV VTSGARWTGP CVVAQLGRCG APCDGSESEA
EYAVHAEAAR VAMTGDPAAV VDAYTARIGE LAADLRYEEA AHLRDRLTAF LRGARRAQRL
SAIAAVAHLV ASRRTAAGWE TCVVRHGRLA ASAVLRPGTD PAAFLASLVA TAEYVPAGYG
PSPGALPGET ELVLDWLADP ATRLVEIDGE WTCPLRSAEA HTGMTHWAHG RASAQ