Gene Ndas_5561 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_5561 
Symbol 
ID9249464 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014211 
Strand
Start bp758352 
End bp760625 
Gene Length2274 bp 
Protein Length757 aa 
Translation table11 
GC content76% 
IMG OID 
ProductAlpha-L-fucosidase 
Protein accessionYP_003683446 
Protein GI297564473 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.56822 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCGCAG CACGACGGGT CCGCACCCGC CATCCGGTCG GGAACTGGGA GGACGCGCTG 
GTCACCGGCA ACGGGCGCCA GGGGGCGCTC GTGCACTCGA CGGCGGAGCG CGTCCGCCTC
ACGCTGGGCC ACGAGCGGCT GTTCCTGCCG GTCACGGAGC CCCTGCCCGC TCCCGCGACC
GCCTCCCTGC TACCCGAACT GAGGGAGCTG CTGCGCCGGG GCCGCTCCCG CGAGGCCGCC
GGGCGGATCA CGGAGTTCGC GGCGCGGGAG CACCCGGGCT ACGCCGACAC CCGGTGGATC
GACCCGCTGG TGGGGGCCGC CGTGCTCTCG TTCGCCCCGC GCGGACCCCG GCCGGGGCCG
GTCACGCGCA CGTGCGACCT GGACTCGGGC CTGGTCACGG AGGAGCTGCC GGACGGGACC
GTGCACCGGG CCTTCGCCTC CCGGCCCGAC GACGCCGTGG TCGTCGAGCT CGCCTCACCC
GGTGGCCTGG ACGGAACGCT GCGGCTCACC GCCTTGGAGG AGACGCCGCC GGTCCCGATG
GCGGTGGCCC CTGAGGCGGC CCCCGGCCTG TTGACCCTGC GGGCGGACTT CCCCGAGCGC
CCGCCCGGCG GCCTGTCCGG GTACACGGTG ACCTGCGCCG TCAGCGCCGA GGGCGGTCGG
GTCGGCGTCG GTGGTGAGGG GTTGGACCTG CGGGGTGTCC GGCGGCTCCG GCTGGTGGCA
CGCGTGCGGG TGGACGGTTT CGCCGCGTCC GCCGGGATCC CGGAGGACTT CGCCGGGGCG
CTCGAACGCC ACCGGGCGGT GCACGGCGAC CTCATGTCCC GCTTCCGGCT GGAGCTGGAG
GGCGGTCCGG CCCACCCCGA CGCGCTCGGC GAGGACCTCC TGGCGGCGGG GGCCGCCCCC
GACCTGATCG CACGCCTGGT CGACGCCGGG CGGTACGCGA TCATCTCCAG CTGCGGCGAC
CTGCCGCCCA CCCTCCAGGG GGTGTGGAGC GGCACCTACG ACCCACCGTG GCGGTCGGGG
TACACCTTCG ACGGCAACCT GCCCTCCGCC CTCGCGGCCC TGCACACCAC GGGCACGCCC
GAGCTGATGA CCGGCCTGTT CGACCTCCTC GACGGCATGG CCGACGACCT GGCGGAGAAC
GCGCGGCGGC TCTACGGCTG CCGGGGGATC CTGCTGCCCG CGCACGCCTC CACCTCCGGC
AGGCACAACC ACTTCGGCCC CGAGTGGTGC CTGACCTGCT GGACCGCGGG AGCGGCGTGG
ACGTCCCGCC TGTACTGGGA CCACTACTCC CACACCCGCG ACCGCGCGTT CCTGCGCGAG
CGCGCCCTGC CCTTCCTCAC CGCCGCCGCG GAGTTCCACG AGGACTTCCT CACCGACGAG
GGCTTCTCCC CCTCCTACTC GCCGGAGAAC ACCCCCGGGG ACGGCGGCGG CCAGGCCGCC
GTCAACGCCA CCATGGACGT CGCGGCCGTG CGCGACCTGG TGCGCAACCT ACTGCGGGCG
CACCGCGTGC TGGGCGTTCC CGGATCCCGC CGCTGGGCAC GGCTGGGGGC GCGGCTGCCC
GGATACCGGG TCGCCCCGGG CGGCGAACTC GCCGAGTGGG CGGGCCCGGC CGGGCCGGTC
GGACAGACCG AGCGGCACGC GCACCGGCAC GCCTCCCACC TGTACCCGCT CTGGTACGAG
ACCGACCCCG CCCTGGCCTC GCCCCGGCTG CGTGCGGCGG CGGTCGCCGC CGTGCGCGCC
CGCCTGGAGT GGTGGCGCGG GCAGGAGTCC GACGAGATGG CCTTCGGCCT GGTCCAGCTG
GGGCTGGCCG CGGCCAACCT GGGCCTGGCC GCCGAGGCGC ACGAGACCCT GTCCCTGCTG
GCCACCCGCT ACTGGCGCCC CACCCTGGTG CCCACGCACA ACCGGGGCAG CCTGTTCAAC
GTGGACATCG GCGGCGGGTT CCCGGCCGTG GCGGCGGCGA TGCTGGCCCG GTCCGCCGAG
GGGCGCCTGG ACCTGTTGCC CGCCCTGCCC CGGGAGTGGG CCTCGGGCCG GGTGGAGGGG
CTGCGGGCCC GGGGCGGGGT CGTCGTGGAA CTCCTGGAGT GGTCCGGCGA TCGGGCCCGC
GCGCGGCTGC GGGCCGTCGA GCCCCGCGAG ACGGTGGTGG CGCTTCCCGG AGGGGAGCGG
CGCAAGGTGG ACCTGGTACC GGACCGGACG GTGTCCCTGC AATTCCCGAT GGTCGCCGGG
GAAGAGTCGA ATCGCTTCGA AATTTCCTCC ACAACACCCC TTGCCGGGGG TTAG
 
Protein sequence
MSAARRVRTR HPVGNWEDAL VTGNGRQGAL VHSTAERVRL TLGHERLFLP VTEPLPAPAT 
ASLLPELREL LRRGRSREAA GRITEFAARE HPGYADTRWI DPLVGAAVLS FAPRGPRPGP
VTRTCDLDSG LVTEELPDGT VHRAFASRPD DAVVVELASP GGLDGTLRLT ALEETPPVPM
AVAPEAAPGL LTLRADFPER PPGGLSGYTV TCAVSAEGGR VGVGGEGLDL RGVRRLRLVA
RVRVDGFAAS AGIPEDFAGA LERHRAVHGD LMSRFRLELE GGPAHPDALG EDLLAAGAAP
DLIARLVDAG RYAIISSCGD LPPTLQGVWS GTYDPPWRSG YTFDGNLPSA LAALHTTGTP
ELMTGLFDLL DGMADDLAEN ARRLYGCRGI LLPAHASTSG RHNHFGPEWC LTCWTAGAAW
TSRLYWDHYS HTRDRAFLRE RALPFLTAAA EFHEDFLTDE GFSPSYSPEN TPGDGGGQAA
VNATMDVAAV RDLVRNLLRA HRVLGVPGSR RWARLGARLP GYRVAPGGEL AEWAGPAGPV
GQTERHAHRH ASHLYPLWYE TDPALASPRL RAAAVAAVRA RLEWWRGQES DEMAFGLVQL
GLAAANLGLA AEAHETLSLL ATRYWRPTLV PTHNRGSLFN VDIGGGFPAV AAAMLARSAE
GRLDLLPALP REWASGRVEG LRARGGVVVE LLEWSGDRAR ARLRAVEPRE TVVALPGGER
RKVDLVPDRT VSLQFPMVAG EESNRFEISS TTPLAGG