Gene Ndas_2942 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_2942 
Symbol 
ID9246794 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp3515766 
End bp3517190 
Gene Length1425 bp 
Protein Length474 aa 
Translation table11 
GC content68% 
IMG OID 
ProductAlpha-N-arabinofuranosidase 
Protein accessionYP_003680858 
Protein GI297561884 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0858983 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCCTGG TCGTCGGTGG TGCCGGTGCC GTGACGGTGC CGACGCCCGC CGCGGCGGCG 
ACGGTGGACA CCAACGCGTG GTACGTGCTG GTGAACCGCG ACAGCGGCAA GGCGTTGGAC
GTGTACAACC TGGCCACCGG TGACGGCGCG CGGATCACCC AGTGGACACG GAACGACCAG
TACCAGCAGC AGTGGCGGTT CGTCGACTCC GGCAACGGTT ACTACCGGTT GCGGTCACGG
CATTCGGGCA GGGTGCTGGA CGTCCACAAC TGGTCGACCG CCAACGGCGG CGGTATCGTC
CAGTGGACCG ACCACGACCA GGCCAACCAG CAGTTCCGGC TGGAGGACTC GCCCGGCGGC
CACATCCGTC TGGTCAACCG GCACAGCGGC AAGGCCGTGG AGGTCCAGGG CGCCTCGACC
GCGGACGGTG CCAACGTCGT GCAGTACGAC GACTGGGGCG GCGGCAACCA GCAGTGGCGG
CTCGTCCGCG TCGACGGCGC GGGACCGGGC GAAACGTGCG CCCTTCCGTC GAGTTACACC
TGGACGTCGA CCGGGCCGCT GGCGCAGCCG AGGCCGGGGT GGGCCTCGCT CAAGGACTTC
ACCCACGCCC CCTACAACGG CCAGCACCTC GTCTACGCGA CGACCCACGA CACCGGGACG
TCATGGGGCT CGATGAACTT CGGCCTCTTC TCGGACTGGT CCCAGATGGG CTCGGCCAGC
CAGAACCCGA TGCCCTTCTC AGCCGTCGCG CCGACGCTCT TCTACTTCGC CCCCAGGGAC
GTCTGGGTGC TCGCCTACCA GTGGGCCGGT CCCGCCTTCT CCTACCGGAC ATCGACCAAC
CCCGTCAACG TGAACAGTTG GTCGGCTCCG CAGACGCTCT TCTCCGGAAG CATCGGTGAC
TCCTCCACGG GGCCCATCGA CCAGGCGCTC ATCGGCGACA GCACGCACAT GTACCTGTTC
TTCGCCGGGG ACAACGGCCG CATCTACCGG GCCGGCATGC CCATCGGCGA CTTCCCGGGC
AGCTTCGGCT CGACCTCGAC GGTCGTCATG TCCGACAGCA CCAACAACCT GTTCGAAGCG
GTTCAGGTCT ACAGGGTCGA GGGCGAGAAC CGGTACCTCA TGATCGTCGA GGCCATCGGC
GCGCAGGGGC ACCGCTACTT CCGCTCGTTC ACGGCCACCA GTCTGGACGG CACGTGGACA
CCCCAGGCCG CGACCGAGGG CAACCCCTTC GCGGGTCGGG CCAACAGCGG CGCCACCTGG
ACCAACGACA TCAGTCACGG TGAGCTCATC CGCACCAACC CCGACCAGAC CATGACCGTC
GACGCCTGCG ACATGCGGTT CCTCTACCAG GGGCGCTCCC CCGGCTCCGG CGGCGACTAC
GGCCTCCTGC CCTACCGGCC CGCAGTGCTG ACGCTGCGGC GCTGA
 
Protein sequence
MALVVGGAGA VTVPTPAAAA TVDTNAWYVL VNRDSGKALD VYNLATGDGA RITQWTRNDQ 
YQQQWRFVDS GNGYYRLRSR HSGRVLDVHN WSTANGGGIV QWTDHDQANQ QFRLEDSPGG
HIRLVNRHSG KAVEVQGAST ADGANVVQYD DWGGGNQQWR LVRVDGAGPG ETCALPSSYT
WTSTGPLAQP RPGWASLKDF THAPYNGQHL VYATTHDTGT SWGSMNFGLF SDWSQMGSAS
QNPMPFSAVA PTLFYFAPRD VWVLAYQWAG PAFSYRTSTN PVNVNSWSAP QTLFSGSIGD
SSTGPIDQAL IGDSTHMYLF FAGDNGRIYR AGMPIGDFPG SFGSTSTVVM SDSTNNLFEA
VQVYRVEGEN RYLMIVEAIG AQGHRYFRSF TATSLDGTWT PQAATEGNPF AGRANSGATW
TNDISHGELI RTNPDQTMTV DACDMRFLYQ GRSPGSGGDY GLLPYRPAVL TLRR