Gene Ndas_4006 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_4006 
Symbol 
ID9247878 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp4791095 
End bp4792675 
Gene Length1581 bp 
Protein Length526 aa 
Translation table11 
GC content71% 
IMG OID 
ProductGMP synthase, large subunit 
Protein accessionYP_003681909 
Protein GI297562935 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.147243 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.371451 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTCCGTTG GTGCTGAGAG CACGTTCGAC ACCGTTCTCG TCGTCGACTT CGGCGCGCAG 
TACGCGCAGC TGATCGCACG CCGGGTCCGC GAATGCCACG TGCACAGTGA GATCGTGCCC
TCGACCATGC CCGTCGAGGA GATGCTCGCC AAGAAACCCA AGGCCATCAT CCTCTCCGGC
GGACCGTCGT CGGTCTACGC CGAGGGGGCC CCGAACCTGG GCCCCGAGCT GTTCGAGACC
GGGGTGCCCA CCTTCGGCAT CTGCTACGGC TTCCAGGCGA TGACCCGGGC CCTGGGCGGC
ACCGTCGCCA GGACCGACCT CAGCGAGTTC GGCCGCACCG AGCTGTCGGC GGTCACCGAC
TCCCTGCTGT TCGCCGGGAT GCCCGCCGAG CAGCTGGTCT GGATGTCGCA CGGCGACTCG
GTGGTGGAGG CCCCCGAGGG CTTCGCCACG ACCGCCAGCA CCGCGGGGGC CCCGGTCGCC
GCGTTCGAGC ACACCGGCCG CAACCTCTTC GGCGTCCAGT TCCACCCCGA GGTCCTGCAC
ACCGAGCACG GCCAGGAGGT GCTGCGCCGC TTCCTGTACG AGGGCGCCGG GTGCCGCCCC
ACCTGGACGA TGGTCAACAT CGTCGAGGAG CAGCTGGAGC GCATCCGCGA GGACATCGGC
GACAAGCGCG TCATCTGCGC GCTGAGCGGC GGCGTGGACT CCGCGGTGGC CGGCGCGCTG
GTGCAGCGCG CCGTCGGCGA CCAGCTGACC TGCGTCTTCG TGGACCACGG CCTGCTGCGC
AAGGGCGAGG CCGAGCAGGT GGAGAAGGAC TTCGTCGCGA TCACGGGCGC CAAGCTCAAG
GTCGTGGACG CCGAGGAGCG GTTCCTGTCC GCCCTCGCGG GGGTCTCCGA CCCCGAGGAG
AAGCGCAAGA TCATCGGCCG CGAGTTCATC CGCGTGTTCG AGCAGGCGGC CCGCGAGGTC
GTCGCCGAGA GCGGCGAGAC CGGCGCCGAG GTCGAGTTCC TGGTGCAGGG CACCCTCTAC
CCCGACGTGG TGGAGTCCGG CGGCGGTACC GGGACCGCCA ACATCAAGTC CCACCACAAC
GTGGGCGGGC TGCCCGACGA CCTCCAGTTC ACGCTGGTGG AACCGCTGCG CGAGCTGTTC
AAGGACGAGG TGCGCAAGGT CGGCGAGGAG CTGGGCCTGC CCGCCGAGAT GGTCTGGCGC
CAGCCCTTCC CCGGCCCCGG CCTGGGCATC CGCATCATCG GCGAGGTCAC CCGCGAACGC
CTGGAGATCC TGCGCGAGGC CGACGCGATC GCCCGCGAGG AGCTGACCCG CGCCGGACTC
GACCGCGACA TCTGGCAGTG CCCGGTGGTG CTGCTCGCCG ACGTGCGGTC GGTGGGCGTG
CAGGGCGACG GGCGCACCTA CGGCCACCCG GTCGTGCTGC GCCCGGTCAG CAGCGAGGAC
GCCATGACCG CCGACTGGTC GCGCGTGCCC TACGACGTGC TGGCCAGGAT CTCCAACCGC
ATCACCAACG AGGTGCGCGA GATCAACCGG GTGGCGCTGG ACGTGACCAG CAAGCCCCCG
GGCACCATCG AGTGGGAGTA G
 
Protein sequence
MSVGAESTFD TVLVVDFGAQ YAQLIARRVR ECHVHSEIVP STMPVEEMLA KKPKAIILSG 
GPSSVYAEGA PNLGPELFET GVPTFGICYG FQAMTRALGG TVARTDLSEF GRTELSAVTD
SLLFAGMPAE QLVWMSHGDS VVEAPEGFAT TASTAGAPVA AFEHTGRNLF GVQFHPEVLH
TEHGQEVLRR FLYEGAGCRP TWTMVNIVEE QLERIREDIG DKRVICALSG GVDSAVAGAL
VQRAVGDQLT CVFVDHGLLR KGEAEQVEKD FVAITGAKLK VVDAEERFLS ALAGVSDPEE
KRKIIGREFI RVFEQAAREV VAESGETGAE VEFLVQGTLY PDVVESGGGT GTANIKSHHN
VGGLPDDLQF TLVEPLRELF KDEVRKVGEE LGLPAEMVWR QPFPGPGLGI RIIGEVTRER
LEILREADAI AREELTRAGL DRDIWQCPVV LLADVRSVGV QGDGRTYGHP VVLRPVSSED
AMTADWSRVP YDVLARISNR ITNEVREINR VALDVTSKPP GTIEWE