Gene Ndas_1109 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_1109 
Symbol 
ID9244956 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp1358972 
End bp1361338 
Gene Length2367 bp 
Protein Length788 aa 
Translation table11 
GC content72% 
IMG OID 
Productglycoside hydrolase family 65 central catalytic 
Protein accessionYP_003679056 
Protein GI297560082 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.824966 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.328257 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCCGCT GGACGCTGGT CTACGAGGGG CGGGACCCGG ATGGCCAGGG CGTACGCGAG 
ACCCTGTGCA CCTTGGGGAA CGGTTACTTC GCCACGCGCG GCGCGCCGCC CGAGGCCCGT
GACGACGGAG TCCACTACCC GGGCACCTAC GTCGCCGGCT GCTACGACCG CGCCGTCTCC
GAGGTCGACG GGCACCGGGT GGAGAACGAG GACCTGGTCA ACGCCCCCAA CTGGCTCCCC
CTGACCTTCC GCGTCGGTGA CGGCGACTGG TTCGAACGAC CGGACCCGGC ACTCCCGCAG
CGCACCGAGC TGGACATGCG CCGAGGAGTG CTGACGCGGA CCTTCCACGT GGTGGACGAC
GGCAGGAGGA CACGGGTGGC CCAACGGCGC CTGGTGTCGA TGGACGCCCC GCATCTGGCC
GCCCTGGAGA CCACCTTGGT GCCCGAGGGT TGGAGCGGGA CCGCGGTCGT CCGGTCCGCC
CTGGACGGGA GGGTGGCCAA CCGCGGCGTG GCGCGCTACC GCGACCTGAA CGGACGTCAC
CTCCACCCGC TGGACACCGG CTCCGACGGC CCCGGCCTCG ACTGGCTGCG CTGCCGGACC
CTGTCCTCGG GCGTCGAGGT CGCGCTCGCC TCCCGGACCC TGGTCTCCCA GGGACCCCGG
CCCGCGGCCC GTGAAAGCCC CGCGGGCGAC GGCTGGGCGG CCACCGACCT GATCCTGGAC
CTGAGGAGCG GTGAACAGAC CACCGTGGAG AAGACGGTGG CCCTGTACAC CTCGCGCGAC
CGCGCCGTCG GCGACATCCT CGACGCGGCC CGCGACGCCC TGGAACGGGC GGGCGGATTC
GACGAACTGC TGCGCCGACA CACCACCGCC TGGCACCACC TGTGGCGGTC CTGCGCACTG
GAGGCCGGGG ACGAGGAGGA ACAGCGGGTC CTCAACCTGC ACCTCTTCCA CCTCCTGCAG
ACGCTGTCGC CCCACACGGC CGACCTCGAC GCGGGTGTGC CCGCGAGGGG CCTGCACGGT
GAGGCCTACC GCGGCCACGT CTTCTGGGAC GAGCTGTTCG TCCTTCCCTT CCTCAACCTC
CACTTCCCCG AGACCGCGCG AGCCCTGCTG CGCTACCGGT GGCGCAGACT GCCCCAGGCG
CGGGCCATCG CCCGCGCCGC GGGGCTGAGG GGCGCTCTCT TCCCCTGGCA GAGCGGGAGC
GACGGCAGCG AGGAGTCCCA GAGCACACAC CTCAACCCCC GCTCGGGGAG GTGGATCCCC
GACCACTCGC ACCTGCAGCG CCATGTCGGG CTCGCGGTCG CCTACAACGT CTGGCAGCAC
CACCAGGCCA CCGGCGACAC CGCCTTCCTG ACCGGGTTCG GCGCGGAACT GCTGTTGGAG
GTCGCCCGCG CCTTCGCGGA CATGGCCGTC TACGACAAGG CTTTGGACCG CTACGTGATC
CGCGGTGTGA TGGGCCCCGA CGAGTACCAC GACGGCTACC CCGGCCGCGA GGACCCCGGT
CTGGACGACA ACGCCTACAC CAACCTCATG GCGGTGTGGG TCATGCTGCG GGCGCTGGAC
ACCCTGCGGG CGCTGCCGGG ACCGAGCCGC AGGGACCTGG AGGAGTCCCT CGGGCTGGAC
GCGGACGAGG TCGAGCGGTT CGAGACCCTC ACCCGCAAGA TGCGCGTCCC CTTCCACGAG
GGAGTCATCA GCCAGTTCGC CGGTTACGGG GACCTGGAGG AGCTCGACTG GCGGGACTGC
CGCGGCGTCC GGCGCCTGGA CCGCTTCCTG GAGGCCGGGG GCGACAGCTG CAACCGCTAC
AAGGCGTCCA AGCAGGCCGA CGTGCTGATG CTGTTCTTCC TGCTACCCGC CGAGGAGATC
GCCGACATGC TGCGCCGCCT CGGCTACACC TACGATCCCG GGCTCATCCC CCGCACGGTC
GACTACTACC TTGCACGCAC CTCGCACGGG TCGACCCTCA GCTCCGTGGT GCACTCCTGG
GTGCTGGCCC GGACCAACCG GGAGGAGTCC TGGGACTTCT TCCGCAGGGC GCTGAGCACC
GACGTCGACG ACGTCCAGGG CGGGACCACG GCCGAGGGCA TCCACCTGGG GGCCATGGCG
GGCACCGTCG ACCTCCTCAC CCGGTGCTAC ACCGGCCTGA CCACACGCGG TGGAGCCCTG
CACCTGAGCC CCCTGCTGCC CGCCGAACTG GACCACCTCT CCTACGGACT GCGCTACCAC
GACCACTGGG AGGTGGGCGT GGACGTGCGC CGCGACCACG TGCGGGTCAC CCTGCCACCC
TCGGCCGGGC CGCCCGTCCG GGTCCGCGTC AAGGAACGCC ACGCCCTGGT CGCCCCCGGC
TCGTCCTGTG TCCTACCCCT GTGGTGA
 
Protein sequence
MSRWTLVYEG RDPDGQGVRE TLCTLGNGYF ATRGAPPEAR DDGVHYPGTY VAGCYDRAVS 
EVDGHRVENE DLVNAPNWLP LTFRVGDGDW FERPDPALPQ RTELDMRRGV LTRTFHVVDD
GRRTRVAQRR LVSMDAPHLA ALETTLVPEG WSGTAVVRSA LDGRVANRGV ARYRDLNGRH
LHPLDTGSDG PGLDWLRCRT LSSGVEVALA SRTLVSQGPR PAARESPAGD GWAATDLILD
LRSGEQTTVE KTVALYTSRD RAVGDILDAA RDALERAGGF DELLRRHTTA WHHLWRSCAL
EAGDEEEQRV LNLHLFHLLQ TLSPHTADLD AGVPARGLHG EAYRGHVFWD ELFVLPFLNL
HFPETARALL RYRWRRLPQA RAIARAAGLR GALFPWQSGS DGSEESQSTH LNPRSGRWIP
DHSHLQRHVG LAVAYNVWQH HQATGDTAFL TGFGAELLLE VARAFADMAV YDKALDRYVI
RGVMGPDEYH DGYPGREDPG LDDNAYTNLM AVWVMLRALD TLRALPGPSR RDLEESLGLD
ADEVERFETL TRKMRVPFHE GVISQFAGYG DLEELDWRDC RGVRRLDRFL EAGGDSCNRY
KASKQADVLM LFFLLPAEEI ADMLRRLGYT YDPGLIPRTV DYYLARTSHG STLSSVVHSW
VLARTNREES WDFFRRALST DVDDVQGGTT AEGIHLGAMA GTVDLLTRCY TGLTTRGGAL
HLSPLLPAEL DHLSYGLRYH DHWEVGVDVR RDHVRVTLPP SAGPPVRVRV KERHALVAPG
SSCVLPLW