Gene Ndas_0260 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_0260 
Symbol 
ID9244094 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp324192 
End bp325856 
Gene Length1665 bp 
Protein Length554 aa 
Translation table11 
GC content68% 
IMG OID 
Productprotein of unknown function DUF1023 
Protein accessionYP_003678215 
Protein GI297559241 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.160803 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGCGACC TCTCCTCCAT GTCCCTCGGT CTGCCCGAGG ACATCGCCGC CGACGTCGGC 
GCCATCGAGA CGGGCGCCGA CCAGCTCGAC GCCGTCCGAC AGAACATGCT CGGCCAGTCC
GAGGGCACCC ACAGCCGGTT CCGGTCCTCC GCCGGGGAGT TCACCGACCT CGTCGCCTGG
AACATCGTCT CCAGTTCGTC CCAGGAACTG TCCTCCTGGC AGGAGGCCGC CGCTTCCCTG
ACCTACGGCG CGGCGGTCCT GCGCCAGTGG GGGCTCGACA TCGAGACCTA CCGTGCCGAG
CGCGCCAAGC TGGAGACCCG CTGGGAGGAG GAGAAGGCCG ACGCTGAAGC GGCCGTCGGC
GCCTCCGAGG GGAGCGGCTC CATCCTCGGT GAGGGAACCC GTGAGGGGAT GAAGGTCGCC
CAGCTGGAAA CGCTGCGCGC GGAACTGCTC TCCGAGCACT CCGGGCACTG GGAAACCCTG
ATGGAGCAGG CCGAGCAGAC CGAGAGGGAC CTGCGCGACG GCCCCAGCCA GGACAGCCTG
GAGCGTCTGA TCGAGTCGTC CCTGCTCACC GGCGGCCAGC TGTCCTACTT CGGTGACGCG
GTCCCCAGCA TGGTCCCGGA CGAGCTGAGC GGTGACGAGC ACCCGTCCGT CGTCAACCTG
TGGTGGACCT CGATGACCGA GGAGGAGCAG GACCAGGCCG TGCGGGACCA TCCCGAGCTG
CTGCGCGAGC TCGACGGCAT CCCGGCGGCC GTGCGCGACC GACTGAACCG CGACCATCTC
GACGACGAGA TCGAGCGGTT CGAGGAGGAG ATCGCCGAAC GGGACGAGGA GATCGGGGAG
GCGGCCGCCC GGGGCAGCAA CGGGTCCGAC GCGATCGCTT TGGCCATGGC GAACGACGAC
ACGCTCGACA ACCAGCTCCA GGAGCTGAAG GAGCTGCGGG AGAGCCTGGA GGACGAAAGC
GCTGACAGGT ACCTGCTGGC TCTGGACACC GGGGGCGACG GACGGGCGAT CGTGGCCAAC
GGCAACCCCG ACACCGCCGA CAACGTGGCG ACCCTGGTGC CGGGCACCAC GACGACCTGG
GAGAGCATCA ACGACCAGAT GGGACGCGCG GACGCTTTGG CGGACTCTGC GAACCGGGTC
AGCCGCGACC AGGACCACTC CGTCATCAGC TGGATCGGCT ACGACGCCCC CAACGTCCCC
GAGGCGGCCT TCGAAGGACG GGCGGAGGAC GCGGTCAGCG AGCTGAGCAG TTTCCAGGAC
GGACTGCGCT CCACCCATCA GGGGCCGCCG TCCCATAACA CCGTCATAGG CCACAGTTAC
GGTTCCACGG TGGTCGGGCA CACCGCGCAG AGCGACGCCG GGCTCGACAC GGACGAAGTG
ATACTCGTGG GCAGCCCCGG AACCAACGCC GACCACGTGA CCGACCTGAA TCTTCCCGCC
GAGAACGTGC ACGTCTCAAC GGCGGAGAAT GACGGCATCA CCAACCTGAC GGGCCTCACG
CACGGCATGG ACCCGACCGA TCCGGAATTC GGAGCGAACG TGTTCGAGTC CGACCCTGGC
AGCGAGGGTG GCACGTGGCC CCTCGGTGAC GCCCATTCGG AGTACTTCGA CGAGAACACG
AGTTCGCTGA GGCACATGGG CTCTGTCATC GCGGGACAAG AGTAG
 
Protein sequence
MSDLSSMSLG LPEDIAADVG AIETGADQLD AVRQNMLGQS EGTHSRFRSS AGEFTDLVAW 
NIVSSSSQEL SSWQEAAASL TYGAAVLRQW GLDIETYRAE RAKLETRWEE EKADAEAAVG
ASEGSGSILG EGTREGMKVA QLETLRAELL SEHSGHWETL MEQAEQTERD LRDGPSQDSL
ERLIESSLLT GGQLSYFGDA VPSMVPDELS GDEHPSVVNL WWTSMTEEEQ DQAVRDHPEL
LRELDGIPAA VRDRLNRDHL DDEIERFEEE IAERDEEIGE AAARGSNGSD AIALAMANDD
TLDNQLQELK ELRESLEDES ADRYLLALDT GGDGRAIVAN GNPDTADNVA TLVPGTTTTW
ESINDQMGRA DALADSANRV SRDQDHSVIS WIGYDAPNVP EAAFEGRAED AVSELSSFQD
GLRSTHQGPP SHNTVIGHSY GSTVVGHTAQ SDAGLDTDEV ILVGSPGTNA DHVTDLNLPA
ENVHVSTAEN DGITNLTGLT HGMDPTDPEF GANVFESDPG SEGGTWPLGD AHSEYFDENT
SSLRHMGSVI AGQE