Gene Ndas_3795 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_3795 
Symbol 
ID9247666 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp4556517 
End bp4559540 
Gene Length3024 bp 
Protein Length1007 aa 
Translation table11 
GC content69% 
IMG OID 
Productprotein of unknown function UPF0182 
Protein accessionYP_003681699 
Protein GI297562725 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.214749 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0401617 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGCTTCC GATCGCCCGG CGCACCACCT GCGCGTATGC CTCGCCGATC AAGGTTGCTC 
GCGCCAGTCG CGGCGACCGT GGTCGTCATC ATCGCGGGGC TGATGCTCGC CGCGAACTTC
TGGACCGACT TCAAGTGGTT CGAATCCGTC GGCTACACCT CGGTCTTCCT GACCGAACTG
TGGACCCGCG TGCTGCTGTT CGCCGTCGCC GGACTGGTGA TGGCCGTCAT CGTCGGCGCC
AGCATCTTCT TCGCCTACCG CTCCCGGCCC GGCATCCGAC CGATGAGCCT GGAGCAGCAG
GGCCTGGACC GCTACCGGCA GTCCATCGAC CCGCACCGCA AACTGTTCTT CTGGATCGCC
GTGGGCGCCC TCGCGCTGCT GGCCGGAGCC GCGGCCAGCG GTGACTGGCG GTCCTACCTC
CAGTTCGTGA ACAGCACCGA CTTCGGGGTG AACGACCCCG AGTTCGGCAT GGACGTGGCC
TTCTTCGCGT TCACCTACCC GTTCCTGCGC ATCCTGCTCG GCTACCTGTA CGCGGCGGTC
ATCCTGGCGT TCATCGCCGC GGTGATCGTG CACTACCTGT ACGGCGGGGT CCGCCTCCAG
AACGACTCCG GCCAGCGCGC CACCCCCTCG GCCCGCGTGC ACCTGTCGGT GCTGCTGGGC
CTGTTCGTGC TGCTGCGCGG CGGTTCCTAC TGGCTGGACC GCTACGGCCT GGTCTTCTCC
GAGCGCGGCT ACACCTTCGG CGCCTCGTAC ACCGACGTGA ACGCCGTCAA GACCGCGCTG
CTCATCCTCA CGGTCATCTC GGTCATCTGC GCCGTGCTGT TCTTCGCCAA CATCTACTTC
AAGAACATCA TGGTGCCCAT GGCCAGCCTC GGACTGCTGG TGCTGTCCGC GGTGCTGGTC
GGCGTGGCCT ACCCCGAGAT CGTCCAGCGC TTCCAGGTCG CGCCCAACGA GCAGCGCCTG
GAGAGCCCCT ACATCGAGCG CAACATCGAG TACACGCGCC AGGCCTACGG CATCGACGAC
GCCGAGGTGC AGGCCTACGA CGCCACCACC GAGCTCAGCG CCCAGGACCT GGTGGAGGAG
TCCCAGGACA CCACCGTGCG CCTGGTCGAC CCCTCCGTCG TCTCGCAGAC CTTCCAGCAG
ATGCAGCAGG TCCGCGGCTT CTACCAGTTC CCCGAGGTGC TGGAGGTCGA CCGCTACCCG
GACTCCGAGG GCAACCTGAT CGACACGATC GTGGCCGTCC GAGAGCTGGA CGGGCCGCCC
GCCGACCAGG ACAACTGGCT CAACCGGCAC CTGATCTACA CCCACGGCTA CGGCATGGTC
GCCGCCGCGG GCACCCAGAT CGACGCCGAG GGGCGCCCGG TCTTCACCGA GTACAACATC
CCTCCGCGCG GTGAGCTGAG CGACGTCGTC GGCGAGTACG AGCCGCGCAT CTACTACGGC
CGCGAGGGCG CCGAGTACGC GATCGTGCAG GCCGAGGAGG AGTACGACTA CCCCCTCGAC
GCCGAGGAGG AGACCGGCGA CGTCCCGACC CCGGAGGACG CGGTCGAGCC GGAGGTCTCC
CCGAGCCCGG ACGAGGCCCG CGCCCCGTCC GACGCGGACC AGGAGGCCTC GGAGCAGACC
GCAGAGGAGG CGCCCGCCGA GGGCGGCGGC GAGGGCGCCG GAGGCGGTGG GGAGGGCAGC
GACTCGCAGG CCTACAACCG CTACGACGGC GACGGCGGCG TCCAGCTGGC CAGCTTCTTC
GACCGGATCC TGTACGCGAT CAAGTACCAG GAACCCAACA TCCTGCTCAA CAGCGCCATC
ACCAACGACT CGCGCATCAT CTACGAGCGC GACCCGGTGG AGCGCGTGGA GAAGGTGGCC
CCGTACCTGA CCACGGACAG CCGACCCTAC CCGGCGGTCG TCGACGGCCG GGTCGTGTGG
ATCGTGGACG CCTACACCAC GTCCGACGGC TACCCGTACG CCAACCGGAT CGACTTCACC
CAGGCGGTGA CCGACACCTT CACCGACGGC TCGGCCCAGC AGGTGGGCGC GCTGCCGGGC
AACGAGGTCA ACTACATCCG CAACTCGGTG AAGGCCACCG TCGACGCCTA CGACGGCACC
GTCACCCTGT ACGCGTGGGA CGAGGCGGAC CCGGTCCTCC AGACCTGGAT GGACGCCTTC
CCCGGGACCG TCGTCGGCAG GGACCAGATG AGCGAGGAGC TCGTCGACCA CCTGCGCTAC
CCGGACGACC TGTTCAAGGT GCAGCGCCAG ATCATGCGCG AGTATCACGT GACGGACGCC
GCGGCCTACT ACGGCGGTCA GGACTTCTGG TCGGTCCCCA GCGACCCGAC CAGCGAGACC
GATGCCCCCG AGCCGCCCTA CCGGCAGACC ATCCAGTACC CGGGTGAGGA GTCCACCTTC
TCGCTGACCA GCACGTTCGT GCCGCGCGGC CGTGAGAACC TGGCGGCCTT CATGGCGGTG
GACAGCGATC CGCGCTCCGA GGAGTACGGC CAGCTCAAGC TGCTGGAGCT GCCGCGGAGC
ACGGTGATCC TCGGCCCGGG GCAGGTGCAG AACGCCTTCG ACGCCGACGC CGACGTCCGC
GAGGTGCTGC TGCCGCTGGA GCAGTCCAAC GCGGAGGTGA CGCGGGGCAA CCTGCTCACG
CTGCCCTTCG CCGGGGGCCT GCTCTACGTC GAGCCGCTGT ACGTGCAGGC GGGCGGCGGC
GGAGCCGCCT CCTTCCCGCT GCTCCAGCAG GTCATGGTCG GCTTCGGTGA CGAGGTGGCC
ATCGGCAACA GCCTGCCGGA CGCGCTCAGC AACCTCTTCG ACGGAGAGGG CGCGGCCCCC
GAGGAGGGCG TCGAGCAGAC TCCCGAGGAG GAGTCCGAGG CCGGTGGCGG CGGCGGAGGC
GGTGGTGGGG GCAACGAGGA CCTCACCGAG TCCCTCAACG AGGCCGTCGA GGCCTGGGAA
GAGGCCCAGC AGTCCAACGA GGAGGCCAAC GACCGCCTGC GCGAGGCGCT GGAGGACATC
CAGCAGCAGC TGGACGAGAA CTGA
 
Protein sequence
MSFRSPGAPP ARMPRRSRLL APVAATVVVI IAGLMLAANF WTDFKWFESV GYTSVFLTEL 
WTRVLLFAVA GLVMAVIVGA SIFFAYRSRP GIRPMSLEQQ GLDRYRQSID PHRKLFFWIA
VGALALLAGA AASGDWRSYL QFVNSTDFGV NDPEFGMDVA FFAFTYPFLR ILLGYLYAAV
ILAFIAAVIV HYLYGGVRLQ NDSGQRATPS ARVHLSVLLG LFVLLRGGSY WLDRYGLVFS
ERGYTFGASY TDVNAVKTAL LILTVISVIC AVLFFANIYF KNIMVPMASL GLLVLSAVLV
GVAYPEIVQR FQVAPNEQRL ESPYIERNIE YTRQAYGIDD AEVQAYDATT ELSAQDLVEE
SQDTTVRLVD PSVVSQTFQQ MQQVRGFYQF PEVLEVDRYP DSEGNLIDTI VAVRELDGPP
ADQDNWLNRH LIYTHGYGMV AAAGTQIDAE GRPVFTEYNI PPRGELSDVV GEYEPRIYYG
REGAEYAIVQ AEEEYDYPLD AEEETGDVPT PEDAVEPEVS PSPDEARAPS DADQEASEQT
AEEAPAEGGG EGAGGGGEGS DSQAYNRYDG DGGVQLASFF DRILYAIKYQ EPNILLNSAI
TNDSRIIYER DPVERVEKVA PYLTTDSRPY PAVVDGRVVW IVDAYTTSDG YPYANRIDFT
QAVTDTFTDG SAQQVGALPG NEVNYIRNSV KATVDAYDGT VTLYAWDEAD PVLQTWMDAF
PGTVVGRDQM SEELVDHLRY PDDLFKVQRQ IMREYHVTDA AAYYGGQDFW SVPSDPTSET
DAPEPPYRQT IQYPGEESTF SLTSTFVPRG RENLAAFMAV DSDPRSEEYG QLKLLELPRS
TVILGPGQVQ NAFDADADVR EVLLPLEQSN AEVTRGNLLT LPFAGGLLYV EPLYVQAGGG
GAASFPLLQQ VMVGFGDEVA IGNSLPDALS NLFDGEGAAP EEGVEQTPEE ESEAGGGGGG
GGGGNEDLTE SLNEAVEAWE EAQQSNEEAN DRLREALEDI QQQLDEN