Gene Ndas_0249 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_0249 
Symbol 
ID9244083 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp308080 
End bp310050 
Gene Length1971 bp 
Protein Length656 aa 
Translation table11 
GC content71% 
IMG OID 
Productalpha amylase catalytic region 
Protein accessionYP_003678204 
Protein GI297559230 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.852341 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGATCGGAC GCCTGCCCAT CCTCGATATC TCTCCAGAGA ACGACCTCGG ACCGGTGAAA 
GCCGTTCCGG GCGAACGCTT CACCGTGGGA GCGACGGTGA TCCGCGAGGG CCACGACTCC
CTCGCCGCGG GCGTCGTCGT CTACTCCCCC GAGGGACGCC GCGAACAGCT CGTCCCCATG
CGCGAGCACG CCCCCGGCAC CGACCGCTAC GAGGCCTGCG TCAGCCTCCC CACGGAGGGG
ACGTGGTCCT TCGCCGTCGA GTCGTGGACC GACCCCTTCG CCACCTGGCA CCACGTCGCG
GGCGTCAAGC TCCCGCTGGG CCAGGACACC GAACTCGTCC TGGAGGAGGG CGCCCGCCTG
CTCGCCCGCG CCGGGCGACG GGTCCCGCGC CGCCCCTTCC TGAACGCCGC CGCCAAGGCG
CTGCGCGACA CCTCGATGAC CCCGGTGGAG CGCTTCGAGG CGGCGCTCAC CCCCGAGGTC
CTGGCCGAGA TGGAGCGCGC GCCCCTGCGC GAGCTGGTCA CCAGGTCCAA GCGCACCAGC
GTCGTCGTCC ACCGCGAGCG CGCGCTGTTC GGCTCCTGGT ACGAGTTCTT CCCGCGCTCG
GAGGGCGCCC AGGTCGACAC CGCGCCCGGC CAGGAGCTCT CGGGCACCCT CGCCACCGCG
GCCAAGCGCC TGCCCGCCAT CGCCGACATG GGCTTCGACG TGGTCTACCT GCCGCCCATC
CACCCGGTCG GCACCACCCA CCGCAAGGGC GCCAACAACG CGCTGACCGC CGGTCCCGGC
GACCCCGGTT CGGTGTGGGC CATCGGGTCG GCCGACGGCG GCCACGACGC GGTCCACCCC
GACCTGGGCA CCCTCGCCGA CTTCGACGCC TTCGTCGCCG AGGCCCGCGA GCACGGCATG
GAGATCGCCC TGGACCTGGC CCTGCAGTGC TCCCCCGACC ACCCCTGGGT GACCGAGCAC
CCCGAGTGGT TCACCGCCCG GGCCGACGGC TCCATCGCCT ACGCCGAGAA CCCGCCCAAG
AAGTACCAGG ACATCTATCC GCTCAACTTC GACCGCGACT TCGAGGGCCT GTACGCGGAG
GTCCTGCGGG TGGTCGAGCA CTGGATCGCG CACGGCGTAC GCGTCTTCCG CGTCGACAAC
CCGCACACCA AGCCGGTCGC TTTCTGGCAG AAGCTGCTCG CCGACGTCGC CGACAGGCAC
CCCGACGTGC TGTTCCTCGC CGAGGCCTTC ACCCGCCCCG CCATGATGCG CACGCTGGCC
AAGGTCGGCT TCCACCAGTC CTACACCTAC TTCACCTGGC GCAACGGCAA GGACGAGCTG
ACCGACTACC TCACCGAGCT GAGCCGGGAG AGCGCCCACT ACCTGCGCCC CAACCTCTTC
GCCAACACCC CGGACATCCT GCACGCCTAC CTCCAGCACG GCGGTCGGCC CGCGTTCGCC
GTCCGCGCCG TGCTGGCGGC CCTGCTCTCC CCCACCTGGG GCGTCTACTC CGGCTTCGAA
CTGTGCGAGA ACACCCCCGC CGGGCCGGGC AGCGAGGAGT ACCTCGACTC GGAGAAGTAC
CAGTACCGCC CCCGCGACTG GGCCGCGGCC GAGGCCTCCG GCGAGACCCT CACCGGCCTC
ATCACGCTGC TCAACCGGCT GCGCCGGGAG CACCCGGCCC TGCGGGAGCT GCGCAACCTG
CGCTTCCACC ACGTGGACCG GCCCGAGATC GTCTGCTTCT CCAAGCACCG GCCCGGAACC
GGCCCCAAGG ACCCCGACGA CGCCGTGATC GCCGTCGTCA ACCTCGACCC GCACCACGCA
CGCGAGGCGA CGGTGCACCT GGATCTGCCG TCCATCGGCC TCACAAGGGA GGAGGAGTTC
AGGGTGACCG ACGAGCTGAC CGGCCGTTCC TACACCTGGG GTGCGGACAA CTACGTCCGT
CTCGACCCCG CGGCCGGTCC CGCGCACGTG TTCACCGTCA GCGGCAGATA G
 
Protein sequence
MIGRLPILDI SPENDLGPVK AVPGERFTVG ATVIREGHDS LAAGVVVYSP EGRREQLVPM 
REHAPGTDRY EACVSLPTEG TWSFAVESWT DPFATWHHVA GVKLPLGQDT ELVLEEGARL
LARAGRRVPR RPFLNAAAKA LRDTSMTPVE RFEAALTPEV LAEMERAPLR ELVTRSKRTS
VVVHRERALF GSWYEFFPRS EGAQVDTAPG QELSGTLATA AKRLPAIADM GFDVVYLPPI
HPVGTTHRKG ANNALTAGPG DPGSVWAIGS ADGGHDAVHP DLGTLADFDA FVAEAREHGM
EIALDLALQC SPDHPWVTEH PEWFTARADG SIAYAENPPK KYQDIYPLNF DRDFEGLYAE
VLRVVEHWIA HGVRVFRVDN PHTKPVAFWQ KLLADVADRH PDVLFLAEAF TRPAMMRTLA
KVGFHQSYTY FTWRNGKDEL TDYLTELSRE SAHYLRPNLF ANTPDILHAY LQHGGRPAFA
VRAVLAALLS PTWGVYSGFE LCENTPAGPG SEEYLDSEKY QYRPRDWAAA EASGETLTGL
ITLLNRLRRE HPALRELRNL RFHHVDRPEI VCFSKHRPGT GPKDPDDAVI AVVNLDPHHA
REATVHLDLP SIGLTREEEF RVTDELTGRS YTWGADNYVR LDPAAGPAHV FTVSGR