Gene Ndas_3729 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_3729 
Symbol 
ID9247598 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp4477940 
End bp4479616 
Gene Length1677 bp 
Protein Length558 aa 
Translation table11 
GC content74% 
IMG OID 
Productglycoside hydrolase family 3 domain protein 
Protein accessionYP_003681633 
Protein GI297562659 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAACTGT CCGTTCCGCA CGCCCTGGCC CCGGCCTGCG TGGGACTCGC ACTGATCGCC 
ACCGCATGCA CCACAGAGGA GGTGGGTTCC TCGCAGGAGG ACCGCGCAAC CCAGCCGTCG
GCCGAGGCCG AGCCCTTCGA GCCCGTGCTC GCACCCCGGC TGCTGTCCGA GATGGACCTG
GACGACAAGA TCGGCCAGCT CCTCGTCCTG ACCGCGCAGG GCACCTCGGC CGCCGAGAAC
GCAGCCCAGA TCGAGGCCTA CCGGCCCGGC GGCCTCATCT ACTTCGACGC CAACCTCACC
GACGCCGAGC AGATCGCCAC CATGTCGGCG GGCGTGCAGG ACCTCGCCGC CGACCAGGGG
CGGGGCGTTC CGCTGTTCGT CGGCATCGAC CAGGAGCAGG GCCTGGTGGC CCGGCTGCCC
GTGGGCACCC GCTTCCCCGA CGCCATGGCC GTCGGCGCCA CCCGCGACAC CGAGCTGGCC
GAGCTGCGCG CCTCCACCAC CGCCGAGGAG CTCACCGCCC TGGGAGTCAA CCTCAACTAC
GCGCCCGACG CCGACGTCAA CACCGACCCC GGCAACCCGG TCATCGGCAT CCGCTCCTTC
GGATCCGACC CCGATCTGGT CGCGCAGATG GCCGTCGCCG AGTCGGACGC CTACTCCGGC
GCCGGTGTGG TGTCGGTGGT CAAGCACTTC CCCGGGCACG GCGACACCGA CGTGGACAGC
CACAGCGGCC TGCCCGTCAT CGACATGCCG CGCGAGCAGT GGGAGGCCGG GCACCTGCCG
CCGTTCCGGG CGGCCATCGA CGCCGACGTG GACGCCATCA TGACCGCCCA CGTGCTCATG
CCGCAGCTGG ACGGGAGCGA GGACCCCGAG CCCGCCACCA TCTCCCCGGA GCTGATCGAC
GGCATCCTCC GCGACGAGCT GGGCTACGAC GGCGTGGTGA CCACCGACGC CCTCAACATG
GAGGGTGTGC GCCAGCGCCA CTCCGACGGC GAGATCGCCG TGCGCGTGCT GGAGGCGGGC
GTGGACCAGC TGCTCATGCC GCCGGACCCC GCCGCCGCGG TGTCCGCGAT CCGCGAGGCC
GTCGAGCAGG GCCGCCTGAC CGAGGAGCGC ATCGACGAGT CCGTGCTGCG CGTCCTGGCG
CTCAAGGAGA AGCGCGGGAT CCTGGAGGCC GAACCGGTGG ACGCGCAGGG CGCCGCGGCG
GTCCTGGAGG ACCCCGAGCA CGCCGAGGCC GCCCAGCGCG TGGCGGACGC CTCCGCGACC
CTGCTGCGCA ACGAGGGCGA CCTGCTGCCC CTGGCCGAGG GCAGCGGAGT CCGCGTCCAG
GGGGTGGGCG CGGAGCAGAT CGCGGCCGCG CTCACCGAGG CCGGGATCGA CGTGGTGGAG
AGCGGGGCCG ACGCCGTGGT CGTGGGCACC GGGGGCGGCA GCGGGGCCTC GGAGCAGAGC
GGCCTGGTCC AGGCCGCGCG CGCCGAGGGG CTGCCGGTGG TCGTGGTGTC CCAGGGCACG
CCCTACGACC TGGCGGCCTT CCCGGAGGCG GAGGCGTTCG TGGCGGTGTA CTCGTCCATG
GACGTGTCGC GGGCGGCCGC CGCGCGGGTC GTCGCCGGAC AGGTGAAGCC CTCCGGCAAG
CTGCCGGTGG ACATCCCCGC GGCCGACGTG GAGATCGGGA CCGGGCTGAC CACCTGA
 
Protein sequence
MKLSVPHALA PACVGLALIA TACTTEEVGS SQEDRATQPS AEAEPFEPVL APRLLSEMDL 
DDKIGQLLVL TAQGTSAAEN AAQIEAYRPG GLIYFDANLT DAEQIATMSA GVQDLAADQG
RGVPLFVGID QEQGLVARLP VGTRFPDAMA VGATRDTELA ELRASTTAEE LTALGVNLNY
APDADVNTDP GNPVIGIRSF GSDPDLVAQM AVAESDAYSG AGVVSVVKHF PGHGDTDVDS
HSGLPVIDMP REQWEAGHLP PFRAAIDADV DAIMTAHVLM PQLDGSEDPE PATISPELID
GILRDELGYD GVVTTDALNM EGVRQRHSDG EIAVRVLEAG VDQLLMPPDP AAAVSAIREA
VEQGRLTEER IDESVLRVLA LKEKRGILEA EPVDAQGAAA VLEDPEHAEA AQRVADASAT
LLRNEGDLLP LAEGSGVRVQ GVGAEQIAAA LTEAGIDVVE SGADAVVVGT GGGSGASEQS
GLVQAARAEG LPVVVVSQGT PYDLAAFPEA EAFVAVYSSM DVSRAAAARV VAGQVKPSGK
LPVDIPAADV EIGTGLTT