Gene Ndas_3559 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_3559 
Symbol 
ID9247428 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp4268334 
End bp4269584 
Gene Length1251 bp 
Protein Length416 aa 
Translation table11 
GC content71% 
IMG OID 
Producthypothetical protein 
Protein accessionYP_003681466 
Protein GI297562492 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.476897 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.877312 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGTGACA CACCCCCCGC ACCCACGAAG GTGCGCCCCC GTGTCGGGCG GATCGCGATC 
AGCAGCAGCC TGCTCGGTGC GATGTTCCTC ATGGCCACCA GCGCCATCGG CCCCGGCTTC
ATCACCCAGA CCACCACGTT CACCGCCGAG CTGGGCGCGA CCTTCGCGGT CGCCATCCTC
ATCTCGATCC TGGTCGACAT CGCGGTCCAG CTGAACATCT GGCGCGTCAT CGGCGTCGCC
AACACCCGCG CCCAGGACCT GGCCAACAAG GTCGTCCCCG GCGCGGGCTA CCTGCTGGCC
GCGCTCATCG TCTTCGGCGG CCTGATCTTC AACGTCGGCA ACCTCGCGGG CACCGGCCTG
GGCCTGAACG CCATGGTCGG GTTCGACACC CGGCTCGGCG CCGTCCTGTC CGCGGTGCTG
GCCGTGGTCA TCCTGGCCGT CAAGAGGCTG GGCGTGGCCC TGGACCGGAT CCTGGTCGTG
CTGGGCGTGG CCATGATCGC GCTGACCGTC TACGTGGCCT TCGTGTCCCA GCCCCCGGTC
GGCGAGGCCC TGCGCCAGGC CGTCCTGCCC GAGCGGTTCG GCGCGGACGT CTTCCTGGCC
ACCGTCACCA TCATCGGCGG CACCGTCGGC GGCTACATCA CCTACGCGGG CGTGCACCGA
CTCGTCGAGG CGGGTCAGGG CGGGCGCGAG AACGTCCGTG CCATCACGCA GACCTCCGTG
ACCGGCATCA TCGTCACCGG CGTCATGCGC GTGGTGCTGT TCCTGGCCAT CCTCGGCGTC
GTCGCGGGCG GCGCCGACAT CCTGGCGTCC GCCAACCCCT CCGCCGAGGC CTTCCGCCAG
GCCGCGGGCG AGGCCGGTGT GCGCCTGTTC GGCGTCATCC TGTGGGCCGC GGCGGTCAGC
TCCGTCATCG GCGCCTCCTA CACCTCGATC TCCTTCGTCA CGACCTTCCA CCCGTGGTTG
GAGAAGCGCC GGGGCCTCCT GGTCACCGTC TTCATCGGTG TCTCGCTGGC GATCCTGCTG
CTGTCCGGGC AGGCTCCGAA CACGCTGCTG ATCCTGGCCG GAGCCCTCAA CGGCGTGATC
CTGCCGGTGG GCCTCGGCAT CCTGCTGTGG GTCGCGGCCC GCCGCTCCGG CGACCTGCTG
GGCGGGTACC GCTACCCGCG CTGGCTGATC GTCATCGGCG TCGCCGCCTG GGCGCTCACC
GTGTACATGG CCATCGGCTC CCTGGGCGGC ATCGCCGACC TCTGGCAGTA A
 
Protein sequence
MGDTPPAPTK VRPRVGRIAI SSSLLGAMFL MATSAIGPGF ITQTTTFTAE LGATFAVAIL 
ISILVDIAVQ LNIWRVIGVA NTRAQDLANK VVPGAGYLLA ALIVFGGLIF NVGNLAGTGL
GLNAMVGFDT RLGAVLSAVL AVVILAVKRL GVALDRILVV LGVAMIALTV YVAFVSQPPV
GEALRQAVLP ERFGADVFLA TVTIIGGTVG GYITYAGVHR LVEAGQGGRE NVRAITQTSV
TGIIVTGVMR VVLFLAILGV VAGGADILAS ANPSAEAFRQ AAGEAGVRLF GVILWAAAVS
SVIGASYTSI SFVTTFHPWL EKRRGLLVTV FIGVSLAILL LSGQAPNTLL ILAGALNGVI
LPVGLGILLW VAARRSGDLL GGYRYPRWLI VIGVAAWALT VYMAIGSLGG IADLWQ