Gene Ndas_3083 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_3083 
Symbol 
ID9246939 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp3690362 
End bp3692752 
Gene Length2391 bp 
Protein Length796 aa 
Translation table11 
GC content68% 
IMG OID 
Producthypothetical protein 
Protein accessionYP_003680998 
Protein GI297562024 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCCATT CATCCGCCGC CGGTACCGTT CACGAGCTTC TGCGAGAGCG TGTCGCCGAC 
TCCGCCCTCT CCTCCCAGGC CCAGCGGCTT CTCTGGGAGG CCTTCACCGA AATAGCGTCG
GAACCGGAAC ACGGGTCTAT GGAGCCCGCC TTCCTTTCGT CGGTCAAGGT CACCGGGTTC
CGCGGTATCG GCGGTGAGGC GGAGCTGAAC CTTCCTCCCG GGCCAGGCCT GACGATGGTG
TTCGGTGCGA ACGGCTCTGG GAAGTCGAGT TTCGCCGAGG GCATCGAAGC CGCCGTCACC
GGTGACAACG CACGGTGGCA CACCGCCAAG TCCAATGTGT GGTCCGGCAG CTGGCGCAAT
GTCCACACGT CGAAGCCCTC CCGAATCGAC GTGGAGTTCT CCACGGCCGA CGGCGGCGGC
AGCCATACCC TGACCCGCAC CTGGCACGGG CAGAACGCCT CCGACTCGTC GGCGAAGGTG
TCCGCGCCCG ACGGAGCACA GCGACCGCTG GAGGAGCTGG GGTGGAAGCG CGCCCTTCAG
CTATACCGCC CCTTCCTGCC CTACGCGGAG CTCGGGACGG CCTTCACGGG TGCCCGCAGC
GAGGCCCACG ACCGGATCGC CGACATCCTC GGGCTGGAGG AGCTGACCGA GGCGGACGGG
CGGCTGCGGC AGTTGGTCTC CGGGAAGGAG GCGGCGGCCA AGCGGGTACG CGAGGAATGC
GTGTCACTGC GCACCGAACT CGAAGCCATG GACGACCCCC GAGCTCAGGC CGCCGCCGAC
GCGCTGGACG GGCGCAGCCC TGATCTGGCG GCACTGCGGG AGATTCTGGA GAACCAGCGG
GTCCTGGACG AGACCGGGCT GGAGCGCGCA CGTCGCATCT CGCTCCTCCA GACGCCCGAC
GCCTCGGCGA TCACTTCGGC GGCTCGGGAA CTGCGTCGGG CCGCGGAGGA AACTGCACGC
GTGTCGGGTA CGTCGGCCGC CGCCGCGCAA CGACGAGCCG ACCTTCTCGA ATCTGTCCTG
GGACTGCATA TCGATCATCC GCAGGAACAC ACGTGCCCCG TGTGCGGTAC GGCGGATCAG
ATCACCGAAG GGTGGGCCCG CAAGGCCAAA CAGGAGATCG ACACCCTGCG GGAGGAGGGT
GAGCGCGCCC GAGCGGCCCA GCAGCGGTGC GTCTCAGCGG TCCGCAGCGC CCAAGCCTTG
CCGCAGAACG CTCCCGTGTG GCTGCCTGAG GCTCTGAACG CACCGTGGAC GGCCTGGACC
GACTGTCGGA GGATCTCGGA TCCACAGGCG CTGGCCGAGG CTCTGGAGCG TACTGGAAGC
GATCTGGCGA CGGCCAGCGG GCCGGTCATC GAGCGGGCCA AGAGCGAACT GGCCGAGGCT
GACAGCGCAT GGCTGCGCGT CGCCGGGGAT CTCGCCGCGT GGGTCACCCG GGCCGAAGAG
GTGGAACGGG CGCGTCCGGT CGTGCGCGAG GCGAAGAAGG CCCGGGAGTG GCTGAAGTCC
GCCCACGGCG ATCTTCGATC GGCCCGACTC GCCCCTTTCG CCGACAGGAC TCAGGCGATC
TGGGGGGAAC TGCGCCAGGA GAGCAGCGTG TCGTTGAACT CGATCGAACT GATGGGCACC
AACACCACCC GGCACCTGGC GTTGGACGTG TCGGTGGACG ACAAGGCGGC ACAGGCCCTG
GGGGTGATGA GCCAGGGAGA ACTCAACTCC CTGGCCCTGG CGCTCTTCCT GCCCCGCGCG
TGTTCAGAAG AGAGCCCGTA CCGCTTCATC GTGCTCGACG ACCCCGTGCA GTCGATGGAC
GCCGACAAGG TCGCCGGATT CGCCCGGGTT CTACAGGACT ACGCGGCGAG TCGACAGGTC
ATCGTCTTCA CGCACGACAT GCGACTGGTC GATGCGGTCC GATGGCTCCG AATCCCCGCT
ACCGTGATGA ACGTGGACCG CGGGAGTTCG TCCCAGGTGC GGTGTAGGCC GTGCACCTCT
CCGGTCGAGC AGGCTCTGGA AGACGCGTCA GTGATCGTCC ACGACCGCCA TGCCGGACCG
TGGGCGGCCG ATATCGTCCC AGGGCAGTGC CGGATCGCCC TTGAAGCCGC TTTCAAGGGA
GCGGCGGTGG GAAAGCTCAC CGGTCAGGGC CAGTCGTGGT CGGACGCACA GGAGCGGGTC
ACAAAGGCCA CCAAGCTCAC CGACCTCGCA TCGCTCGCAC TGTTCGGCAC CGCCCGCCGG
GCGGCGAGGG ACGTGTACGC GAGATTGGGA GAGCTCTTCG GGCAGTGGGC AGCCGACACG
GTCAGCGCAT GCAACCGCGG CTCCCATGCC CCCGGGACGG TGTCGATGGA CCCCCAGGAG
CTCATCGAGC AGACTCGGCG TCTCGCTCAG CGGATCGGCG GATGGCGATG A
 
Protein sequence
MSHSSAAGTV HELLRERVAD SALSSQAQRL LWEAFTEIAS EPEHGSMEPA FLSSVKVTGF 
RGIGGEAELN LPPGPGLTMV FGANGSGKSS FAEGIEAAVT GDNARWHTAK SNVWSGSWRN
VHTSKPSRID VEFSTADGGG SHTLTRTWHG QNASDSSAKV SAPDGAQRPL EELGWKRALQ
LYRPFLPYAE LGTAFTGARS EAHDRIADIL GLEELTEADG RLRQLVSGKE AAAKRVREEC
VSLRTELEAM DDPRAQAAAD ALDGRSPDLA ALREILENQR VLDETGLERA RRISLLQTPD
ASAITSAARE LRRAAEETAR VSGTSAAAAQ RRADLLESVL GLHIDHPQEH TCPVCGTADQ
ITEGWARKAK QEIDTLREEG ERARAAQQRC VSAVRSAQAL PQNAPVWLPE ALNAPWTAWT
DCRRISDPQA LAEALERTGS DLATASGPVI ERAKSELAEA DSAWLRVAGD LAAWVTRAEE
VERARPVVRE AKKAREWLKS AHGDLRSARL APFADRTQAI WGELRQESSV SLNSIELMGT
NTTRHLALDV SVDDKAAQAL GVMSQGELNS LALALFLPRA CSEESPYRFI VLDDPVQSMD
ADKVAGFARV LQDYAASRQV IVFTHDMRLV DAVRWLRIPA TVMNVDRGSS SQVRCRPCTS
PVEQALEDAS VIVHDRHAGP WAADIVPGQC RIALEAAFKG AAVGKLTGQG QSWSDAQERV
TKATKLTDLA SLALFGTARR AARDVYARLG ELFGQWAADT VSACNRGSHA PGTVSMDPQE
LIEQTRRLAQ RIGGWR