Gene Ndas_3583 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_3583 
Symbol 
ID9247452 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp4296520 
End bp4298832 
Gene Length2313 bp 
Protein Length770 aa 
Translation table11 
GC content70% 
IMG OID 
Productguanosine pentaphosphate synthetase I/polyribonucleotide nucleotidyltransferase 
Protein accessionYP_003681490 
Protein GI297562516 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAGGGCG TGTACTCGGC CGAAGCCGTC ATCGACAACG GCTCCTTCGG CACCCGCACC 
ATCCGCTTCG AGACCGGTCG CCTGGCCCGC CAGGCCGCCG GATCGGCCAC GGTGTACCTG
GACGACGAGA CGGTCGTCCT GTCCGCCACC ACCGCCTCGA AGCGCCCCAA GGAGAACCTC
GACTTCTTCC CCCTGACGGT GGACGTCGAG GAGCGCATGT ACGCCGCGGG CCGCATCCCC
GGCTCCTTCT TCCGGCGTGA GGGCCGTCCC TCCGAGGACG CCATCCTCAC CTGCCGCCTG
ATCGACCGGC CGCTGCGCCC GTCGTTCAGG AAGGGCCTGC GCAACGAGAT CCAGATCGTC
GAGACGGTCC TGGCCCTGCA CCCCGAGCAC CTGTACGACG TCGTCGCCAT CAACGCGGCC
TCGATGTCCA CCCAGATCGC CGGGCTGCCC TTCTCCGGCC CGATCGGCGG CGTGCGCGTC
GCCCTCATCG ACGGCCAGTG GGTGGGCTTC CCGACCCACG CCGAGCTGGA GAACGCCACC
TTCGACATGG TGGTCGCCGG TCGCGTCCTG GCCGACGGCG ACGTCGCCAT CATGATGGTG
GAGGCCGAGT CCACGGTCCG CACCCTCAAG CTGGTCGCCG AGGGGGCCGT CGGCCCCAAC
GAGCAGACCG TCGCCGAGGG CCTGGAGGCC GCCAAGCCCT TCATCAAGGT CCTGTGCAAG
GCCCAGCAGG CCGTCGCCGA CCAGACCAGC CGCGAGCAGG GCGAGTTCCC GATCTTCCTC
GACTACGAGG ACGACGTGTA CGTCGCCGTC GAGGGCGCCG TCCGCGAGGA GCTGTCCAAG
GCGCTGACCA TCGCGGACAA GCAGGACCGC GAGGCCGAGC TGGACCGCGT CAAGGCCTCG
GCCGCCGAGA AGCTCGCCGA GGACTTCGAG GGCCGCGAGA AGGAGATCGG CGCCGCCTTC
CGCTCCCTGA GCAAGCAGCT CATGCGCGAG CGCGTGCTGC GCGACGGGAT CCGCATCGAC
GGCCGCGGCC CCAAGGACAT CCGCCAGCTG AGCGCCGAGG TCGGCGTCCT GCCGCGGGTG
CACGGCTCGG CCCTCTTCGA GCGCGGCGAG ACCCAGATCA TGGGTGTCAC CACGCTCAAC
ATGCTGCGCA TGGAGCAGAC GGTTGACACG CTCAACCCGG ACAAGACCAA GCGCTACATG
CACAACTACA ACTTCCCGCC CTACTCCACC GGTGAGACCG GTCGGGTGGG CTCGCCCAAG
CGGCGCGAGA TCGGGCACGG CGCCCTCGCT GAGCGCGCCC TGCTCCCGGT CCTGCCCGCC
CGCGAGGAGT TCCCCTACGC CATCCGTCAG GTGTCGGAGG CCCTGGGCTC CAACGGCTCC
ACCTCGATGG GTTCGGTCTG CGCCTCCACC ATGTCCCTGA TGGCGGCCGG CGTGCCCCTC
AAGGAGATGG TGTCGGGCAT CGCCATGGGC CTGATCAGCG AGGGCGACGA GTTCGTCACC
CTCACCGACA TCCTGGGCGC CGAGGACGCG TTCGGCGACA TGGACTTCAA GGTCGCCGGT
ACCCGCGAGC TCATCACGGC CCTGCAGCTG GACACCAAGC TCGACGGCAT CCCCGCCGAG
CAGCTGGCGC TCGCGCTCCA GCAGGCCCGC GGCGCCCGCC TGGCCATCCT CGACGTCATG
CAGGAGGCCA TCGAGCGCCC GGCCGAGATG AGCCCGAACG CTCCGCGCAT CCTCACCGTC
AAGGTCCCGG TGGAGAAGAT CGGCGAGGTC ATCGGCCCCA AGGGCAAGAT GATCAACTCG
ATCCAGGACG ACACCGGCGC CGAGATCACC ATCGAGGACG ACGGCACGAT CTACATCGGC
GCCACCGACG GCCCCTCGGC CGAGGCCGCG CGGGACACGA TCAACCAGAT CGCCAACCCG
ACGATGCCCG AGGTCGGCGA CCGCTACCTG GGCACCGTCG TCAAGACGAC CACGTTCGGC
GCGTTCGTGT CGCTGCTGCC CGGCAAGGAC GGCCTGCTGC ACATCTCGCA GATCCGCAAG
CTGCACGGCG GCAAGCGGAT CGAGAACCTC GACGACGTGA TCAGCATCGG CGAGAAGATC
CAGGTCGAGA TCCGCGAGAT CGACGACCGC GGCAAGCTGT CGCTGGTCCC GGTCGAGGTC
GTGGAGGCCG AGTCGGCCCA GGCCGCTCCC GCGGCTCCCG CCGCTGAGGA CGAGGGCTCC
CCGGCCGAGG AGGACGGCGG CGACAAGAAC GGCGGGGACG CCCCGCGCCG TCGCCGCCGC
CGCAGCTCGG GCGGGCGTTC GGAGAACACC TGA
 
Protein sequence
MEGVYSAEAV IDNGSFGTRT IRFETGRLAR QAAGSATVYL DDETVVLSAT TASKRPKENL 
DFFPLTVDVE ERMYAAGRIP GSFFRREGRP SEDAILTCRL IDRPLRPSFR KGLRNEIQIV
ETVLALHPEH LYDVVAINAA SMSTQIAGLP FSGPIGGVRV ALIDGQWVGF PTHAELENAT
FDMVVAGRVL ADGDVAIMMV EAESTVRTLK LVAEGAVGPN EQTVAEGLEA AKPFIKVLCK
AQQAVADQTS REQGEFPIFL DYEDDVYVAV EGAVREELSK ALTIADKQDR EAELDRVKAS
AAEKLAEDFE GREKEIGAAF RSLSKQLMRE RVLRDGIRID GRGPKDIRQL SAEVGVLPRV
HGSALFERGE TQIMGVTTLN MLRMEQTVDT LNPDKTKRYM HNYNFPPYST GETGRVGSPK
RREIGHGALA ERALLPVLPA REEFPYAIRQ VSEALGSNGS TSMGSVCAST MSLMAAGVPL
KEMVSGIAMG LISEGDEFVT LTDILGAEDA FGDMDFKVAG TRELITALQL DTKLDGIPAE
QLALALQQAR GARLAILDVM QEAIERPAEM SPNAPRILTV KVPVEKIGEV IGPKGKMINS
IQDDTGAEIT IEDDGTIYIG ATDGPSAEAA RDTINQIANP TMPEVGDRYL GTVVKTTTFG
AFVSLLPGKD GLLHISQIRK LHGGKRIENL DDVISIGEKI QVEIREIDDR GKLSLVPVEV
VEAESAQAAP AAPAAEDEGS PAEEDGGDKN GGDAPRRRRR RSSGGRSENT