Gene Ndas_0808 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_0808 
Symbol 
ID9244653 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp996957 
End bp998357 
Gene Length1401 bp 
Protein Length466 aa 
Translation table11 
GC content72% 
IMG OID 
Productbeta-galactosidase 
Protein accessionYP_003678758 
Protein GI297559784 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.201383 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAACAACC GCAACGATTT CCCTGAGGGC TTCCTCTGGG GGGCGGCCAC CGCCTCGTTC 
CAGATCGAAG GCGCCACGAC CGCCGACGGC CGCGGCCGCA GCATCTGGGA CACCTTCGCC
GAGACGCCCG GCAAGGTCCT GGGCGGAGAC ACCGGCGACC CCGCCGACGA CCACTACAAC
CGCTACGCCG ACGACGTCGC GCTCATGACC CAGCTCAACC TGGGCGCGTA CCGCTTCTCC
GTCGCCTGGC CGCGGATCAT CCCCGAGGGC ACCGGCGCCG TGAACCAGGC CGGGATCGAC
TTCTACGACC GCTTGGTGGA CACCCTCCTG GCGGCCGGGA TCCAGCCCTG GGCGACCCTC
TACCACTGGG ACCTGCCCCA GCCGCTGGAG GACGCGGGGG GCTGGCCCCA CCGCGACACC
GCCTACCGCT TCCGCGACTA CGCCCAGGTC GTGGCCGAGG CCCTCGGCGA CCGCGTCAGC
AACTGGATGA CCATCAACGA GCCCTGGTGC TCGGCCTTCC TCGGCTACGA GAACGGTCAC
CACGCGCCGG GCCACAAGGA CCCGGCCGCG GCCCTGGCCG CCGCCCACCA CCTGCTGCTG
GGCCACGGCC TGGCCGCCGA GGCGATCCGC TCCACCGGCC ACCCCGCCCG GGTGGGCCTC
GCCCACAACC AGGCGGTCAT CCGGCCCCAC GGCCCCAGCG CCGCGGACGC CCGCGCCGCC
CGCCGCGCCG ACGGCGTGCG CAACCGCATC TTCACCGACC CCCTGCTCAA GGGCCGCTAC
CCCGCGGACG TCAGGGAGGA CCTCGCCGGC GTCAGCGACT TCTCCTTCAT CCAGGACGGC
GACCTGGAGA TCACCTCCGC CCCCCTGGAC TTCCTGGGCG TCAACTACTA CTCGCCCGAG
TTCGTCGCCG CCTCGGCCAA GGGCCTGGAC CCCGCGCTGG TCAGCGGCGA GGGCGGCGCG
TGGCTCGGCG CCGAGCCCGA GGAGGTGCAC GTCTCGCAGG GGCTGCCCGT CACGCACATG
GGCTGGGAGA TCGACCCCAC CGGCCTGTAC GACGTGCTCT CCCGCCTGGC GGGGGAGAGC
GGCGGCATCG ACCTCTACGT CACCGAGAAC GGGTGCGCCT TCGAGGACAC CGTCACCGAG
GCCGGCGAGG TCAACGACAC CGACCGGATC GACTACTACG AGGGCCACCT GAGGGCGGCC
AAGGAGGCCG TCCTGGCCGG AGTGCCCCTG CGGGGTTACT TCGCCTGGTC GCTCCTGGAT
AATTTCGAGT GGGCGTGGGG GTACTCCCGC CGGTTCGGCA TCGTGCACGT CGACTACGAG
ACGCAGCGCA GGATCATCAA GGACAGCGGT CACTGGTACG CCGAACTGGC CAAGACGGGG
CAGTTCCCCG AGCGCCAGTA A
 
Protein sequence
MNNRNDFPEG FLWGAATASF QIEGATTADG RGRSIWDTFA ETPGKVLGGD TGDPADDHYN 
RYADDVALMT QLNLGAYRFS VAWPRIIPEG TGAVNQAGID FYDRLVDTLL AAGIQPWATL
YHWDLPQPLE DAGGWPHRDT AYRFRDYAQV VAEALGDRVS NWMTINEPWC SAFLGYENGH
HAPGHKDPAA ALAAAHHLLL GHGLAAEAIR STGHPARVGL AHNQAVIRPH GPSAADARAA
RRADGVRNRI FTDPLLKGRY PADVREDLAG VSDFSFIQDG DLEITSAPLD FLGVNYYSPE
FVAASAKGLD PALVSGEGGA WLGAEPEEVH VSQGLPVTHM GWEIDPTGLY DVLSRLAGES
GGIDLYVTEN GCAFEDTVTE AGEVNDTDRI DYYEGHLRAA KEAVLAGVPL RGYFAWSLLD
NFEWAWGYSR RFGIVHVDYE TQRRIIKDSG HWYAELAKTG QFPERQ