Gene Ndas_4142 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_4142 
Symbol 
ID9248016 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp4941749 
End bp4944889 
Gene Length3141 bp 
Protein Length1046 aa 
Translation table11 
GC content68% 
IMG OID 
ProductYD repeat protein 
Protein accessionYP_003682043 
Protein GI297563069 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGAAAGC ACATCCCCTG GTCCCTGACG GCCGCGACGG CCGCCTCCAC CCTCGTGCTG 
GGGCTGCTGT CCGCGCCCCC GGCCCTGGCC GACACCGACC CCCACTCACC CGCCGACCGC
ACCGTTCCGG TCCCCGACAC CTGGGAGCCG GGCGAGGCCA ACACCGAACC CGTGGGCGGA
GAACCACCCG ACGAACCCTG GGCTCCGCCC GGGGCCGAGA GCGCACCCGA GGCCCAGAAC
CGTTCGGAAG AACCCGAAAC CCGGGTCGGC ACCTGCGCGG ACGGCGTCAA CCGCGGTATC
CAGCCCTGGT ACCCGCTGGA ACGCCACCAG ATCAGCGACC GCATGGAACT GACGGTCAAC
ACCCAGGGCG GCAACCTCGT CGTGCGCCAC CGGGACCTGA CCGTGGCCGG AACGGGGCTG
GACCTGTCGG TGTCCAGCTT CTACAACTCC GCGCCCTCCT ACGACGGCTG GACCCTCTCC
CACGGCCAGG ACGTGGGACT GAGCATCTTC AGCAACAGCA TCATCTTCCA GGGCCCCTCC
GGCTACTGCG AACGCTTCGA CATCGCCGAG GACGGCTCCT TCACCCCGCC TGCGGGGCTC
AACGCCGAAC TGGAGGAGCT GTCGAATGGC CACTACGCGC TGACCTTCCA CCGAGGCGAG
TACGCCGACC AGGTGTGGAC CTTCAACGCC AACGGGTGGC TGTACTCCCA GGCCGACCGC
AACGGCCACA CTCACCGGAT GCGCTACGAC ACCGAGGGAG ACCTGGCCTC GATCGTGGAC
ACCCAGGACC GGGTCACCAC CTTCGAGTGG GACCCCGACG CCTGGGAGCA GCCGACCCGG
ATCACCGACC CGGTCGGCGA GACCGCCACG GCCTTCACCT GGAACGGCGA CGGCGGCATG
GTCGCGCTGA CCGACCGCGC GGGCCAGGGC ATCGAGTTCG GCTACGACGG CGACGGCCTG
CTGACCTCGA TCACCGACGC CACCGGCGCG GTGTGGGAGA TCGCCTACAA CGCCGACGAC
CAGGTCTCCA GCCTGACCGT GCCAGACGGC ACCGAGGACG GGGCCACCAC CGCCTACTCC
TACGCCGACG GTGCCTGGTC CAACACCCAG ACCACGGTCA CCGATCCGGG CGGCGGTGAG
TCCACCCTGG AGTTCGACGA CCAGGGAAGG CAGACCTCCG CGACCGACCA GGTCGGCAAC
ACCCGCTCCC AGACCTGGAC GGCGAACTCG GACGTGGCCA CCACCACCGA CGCCCTGCAA
GCCTCGGTGA CCTACGACTA CGACGAGTTC AACAACCTCA TCGGCACCGA ACTGCCCACA
GGGGCGGCCA CTTCGGTCGG GTTCGCGGAC ACGGCCAACC CGGCCAAGCC CACCAGCGTG
ACCACCCCGG ACGGGGACAC GCTTCAGATG TCCTACGACG ACGCCGGCAA CCTCACCTCG
GCAGTCCAGG AGGAGATGGA CATCGAGGTG GCCGACATCC GTTACCACTC CAACGGGTTG
GTCAACCAGG TCATCGATGC CAACGGCAAC TCCACCAGCT ACTCCTACGA CCGCGCCGGG
AACATGACGG CGATGGACGA GCCGGGGCCG ATGGGCACCA CCCAATACGG CTACGACGCG
CTGTCACGGG TCACCTCGGT CACCGACGGC AACGGTGTGA GGCTGGAGTA CGGCTACGAC
CGGCTGGACC GGATCGTGTC CATCTCCCAG GGCGGCGACC TGCTCCAGGC CATCGCCTAC
GACGGCAACG GCCGCCAGGT CGCCACCCAC ACCGACCAGG CCAGCGTGGA GCACTCCTAC
AACGGCCGCG GCGACCTGCT CGAAACCGTG CGCACCGACT CGGCCGGAAG CGAGAGGACC
ACCTACGCCT ACGACACCGC GGGCAACGTC ACCGAGATGG TCGAACACGG TAAGACCACG
ACCTACGGCT ACGACGCCGC GTTCCGGCTC ACCTCGCTCA CCGACCACAC CGGGGCCGAG
ACCACCTTCA CCAACGACGC CAACAACCGC CGCACCAGCA TCACCCACCC CGACGAGGCC
GTGGAGGAGC GCACCTACGA CGACTCCGGG CGCCTGACCG GCATCACCAC CACCGGGGCC
GCCGACCAGG CTCTGGTGGA GGCTGTCTAC TCCTACGACA ACGACGGCAC CGACAGCGAC
CAGTTGCAGT CGCGCACCAT CGGCGGTGAG ACCGAGACCT TCACCTACGA CGGCCTGGAC
CGACTCACCA GCGACGGCAC CACCGACTAC ACCTACGACG ACGTCGGCAA CCTGCTCACC
GCCGGGAATG AGGAGTTCGC CTACAACGAC GCCGACCAGC CCACCTCGGC TCGGGGCCAG
GAGGTGGGCC ACGACGAGGC CGGAAACATG ACCTCGCGGG GTGAGTACGT CTACGAGTAC
TCGGTGACCA ACCAGACGCT GCGCTCCAAC GACGGTCAGG GCGAACTGGC CTCGTGGCTG
AGCTACGACA CCACCGACCA GACCCAGATG CGCGGCGTCA CCGACGTGCA CGAGGGCAGC
CGGGTGGAGC GGCAGCTGTC CAACACCGCG CTCGGGGTCA CCAACATCGC CTCCGAGGGT
GAGCGCACCA GCTTCGTGCG CGACCCGGAG GGGCGTCTGG TCTCCATGGT GGCCTGGGAC
GGCGACGAGC GGTTCCACTA CACCCTCGAC CAGCAGAACA CCGTGCTGGC GCTGACGGCT
GAGGGTTCGG AGGCTGAGTC CCCGGACGTG GTCTATGACT ACAGCCCCTA CGGTGAGCGC
ACCTCTGAGA GCCTGGAGGG GACCGAGGCG GCGGCGTTGA GCCCGTTCGG GTTCACCGGC
GCCTACCAGT TCCAGGACGG CACGGTGCAT CTGAACCACC GCTTCCACAG CACGTTCACG
TTGGGCTTCA CCCAGCCGGA CCCGTCCCGG CAGGAGTTGA ACAACTACGC CTACGCGGCC
TGCGACCCCA TCAACAACAC CGACCCCACC GGGCTTTCGG CCGTTCCGTG GCCACAGGCA
GCGTGCTGGG GTGCAACCAT CTACATGCTG AAGCTGTCTA CCGCATCCTT GGTTCTCAAT
GTGTGGGGAA CACCAGTGCA CATCGCTACA GCTGTAGTGA CTGTCGGATG CCTGGCTTAT
GACATGACTG TCTCCACCTG A
 
Protein sequence
MRKHIPWSLT AATAASTLVL GLLSAPPALA DTDPHSPADR TVPVPDTWEP GEANTEPVGG 
EPPDEPWAPP GAESAPEAQN RSEEPETRVG TCADGVNRGI QPWYPLERHQ ISDRMELTVN
TQGGNLVVRH RDLTVAGTGL DLSVSSFYNS APSYDGWTLS HGQDVGLSIF SNSIIFQGPS
GYCERFDIAE DGSFTPPAGL NAELEELSNG HYALTFHRGE YADQVWTFNA NGWLYSQADR
NGHTHRMRYD TEGDLASIVD TQDRVTTFEW DPDAWEQPTR ITDPVGETAT AFTWNGDGGM
VALTDRAGQG IEFGYDGDGL LTSITDATGA VWEIAYNADD QVSSLTVPDG TEDGATTAYS
YADGAWSNTQ TTVTDPGGGE STLEFDDQGR QTSATDQVGN TRSQTWTANS DVATTTDALQ
ASVTYDYDEF NNLIGTELPT GAATSVGFAD TANPAKPTSV TTPDGDTLQM SYDDAGNLTS
AVQEEMDIEV ADIRYHSNGL VNQVIDANGN STSYSYDRAG NMTAMDEPGP MGTTQYGYDA
LSRVTSVTDG NGVRLEYGYD RLDRIVSISQ GGDLLQAIAY DGNGRQVATH TDQASVEHSY
NGRGDLLETV RTDSAGSERT TYAYDTAGNV TEMVEHGKTT TYGYDAAFRL TSLTDHTGAE
TTFTNDANNR RTSITHPDEA VEERTYDDSG RLTGITTTGA ADQALVEAVY SYDNDGTDSD
QLQSRTIGGE TETFTYDGLD RLTSDGTTDY TYDDVGNLLT AGNEEFAYND ADQPTSARGQ
EVGHDEAGNM TSRGEYVYEY SVTNQTLRSN DGQGELASWL SYDTTDQTQM RGVTDVHEGS
RVERQLSNTA LGVTNIASEG ERTSFVRDPE GRLVSMVAWD GDERFHYTLD QQNTVLALTA
EGSEAESPDV VYDYSPYGER TSESLEGTEA AALSPFGFTG AYQFQDGTVH LNHRFHSTFT
LGFTQPDPSR QELNNYAYAA CDPINNTDPT GLSAVPWPQA ACWGATIYML KLSTASLVLN
VWGTPVHIAT AVVTVGCLAY DMTVST