Gene Ndas_3394 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_3394 
Symbol 
ID9247259 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp4057904 
End bp4059316 
Gene Length1413 bp 
Protein Length470 aa 
Translation table11 
GC content69% 
IMG OID 
Productsecreted protein 
Protein accessionYP_003681305 
Protein GI297562331 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.118882 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGTCAT CCCCCCCGAC CCCCACCTCC CGGCGCGCCA GACGCGGACT GACCGGAGTC 
TCCCTCGCCG GAGCGGCGGT GCTCATGGTC GGGCTCCTCG GCCCGGCCTC GGCCTCGGCG
GACGACGAGT TCGCCAGTGA CGCCGACACC AGTGCCAACA TCACCCATGT GACCAACGTC
CCGAAGACCG AGGCCATGAG TGACTTCAAC TCCGACCTCG CCTTCAGCGG CGACTACGCC
ATCGGCGGCA ACTACAACGG CTTCGTCATC TACGACATCT CCGAGCCGGA GGAGCCGCAG
GTCGTCTCCG AGGTCCTGTG CCCGGGCGGA CAGGGCGACG TGTCGGTCAG CGGCGACCTG
CTCTACTTCT CGGTGGACTA CCCGCGCGCG AGCACCGAGT GCGGCGCGCC CTCCGTGCCG
ACCACCGACC CGGACGGCTT CGAGGGCATC CGGATCTTCG ACATCTCCGA CAAGGCCAAC
CCCCAGTACG TGTCGGCGGT GCGCACCGAC TGCGGCTCGC ACACCAACAC CCTGGTGCCG
GGCAAGGACG GCGAACACGA GTACGTGTAC GTCTCCTCGT ACTCGCCCTC GGAGCGCTTC
CCGAACTGCC AGCCGCCGCA CGACAAGATC TCCGTCATCG AGATCCCCGT CGACGAACCG
TCCGAGGCCA AGGTCATCGA CACCCCGGTG CTCTTCCCCG ACGGCGGCAA CACCGGCACC
GCGGGCTGCC ACGACATCAC CGCCTACGCC GAGCGCGACA TCGCCGCGGG CGCCTGCATG
GGCGACGGCG TGCTGATGGA CATCTCCGAC CCGGCCGCGC CGGTCGTCAC CGAGGTGGTC
CAGGACGACA ACTTCGCGTT CTGGCACTCG GCCACCTTCA CCAACAACGG CGAGACCGTG
CTGTTCACCG ACGAACTCGG CGGAGGCGGC GGCGCCACCT GCAACGAGGA GATCGGCTCC
GAGCGCGGCG CCAACGCCAT CTACGCCATC GGCGGCGGCC AGGACTCCCC CGACCTGGAG
TTCGAGGGCT ACTACAAGCT GCCCCGCCAC CAGGCCGACA CCGAGAACTG CGTCGCGCAC
AACGGCTCGC TGATCCCCGT CCCCGGACGC GACTACTTCG TGCAGTCCTG GTACCAGGGC
GGCGTCTCGG TGATCGACTT CAACGACCCG GCCGACCCCC GGGAGATCGG CCACTTCGAC
CGCGGCCCGA TCAGCGAGGA CAACCTCGTC CTGGGCGGTT CCTGGTCGGC CTACTACTAC
AACGGCTACG TCTACTCCTC CGACATCACC CGCGGGCTCG ACGTCCTGAA ACTGGAGGAC
CCCCGGTTCC GGGTGGCCGA GCAGGTGCGG ATGGACGAGT TCAACCCCCA GTCCCAGCCC
GAGTACCGGC CGGGCCGCGG CAACGGCCGC TGA
 
Protein sequence
MASSPPTPTS RRARRGLTGV SLAGAAVLMV GLLGPASASA DDEFASDADT SANITHVTNV 
PKTEAMSDFN SDLAFSGDYA IGGNYNGFVI YDISEPEEPQ VVSEVLCPGG QGDVSVSGDL
LYFSVDYPRA STECGAPSVP TTDPDGFEGI RIFDISDKAN PQYVSAVRTD CGSHTNTLVP
GKDGEHEYVY VSSYSPSERF PNCQPPHDKI SVIEIPVDEP SEAKVIDTPV LFPDGGNTGT
AGCHDITAYA ERDIAAGACM GDGVLMDISD PAAPVVTEVV QDDNFAFWHS ATFTNNGETV
LFTDELGGGG GATCNEEIGS ERGANAIYAI GGGQDSPDLE FEGYYKLPRH QADTENCVAH
NGSLIPVPGR DYFVQSWYQG GVSVIDFNDP ADPREIGHFD RGPISEDNLV LGGSWSAYYY
NGYVYSSDIT RGLDVLKLED PRFRVAEQVR MDEFNPQSQP EYRPGRGNGR