Gene Ndas_1886 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_1886 
Symbol 
ID9245736 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp2297679 
End bp2300195 
Gene Length2517 bp 
Protein Length838 aa 
Translation table11 
GC content76% 
IMG OID 
Producthypothetical protein 
Protein accessionYP_003679820 
Protein GI297560846 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.292841 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.495378 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTACGAAT CTGAAGCAGG AACGGCCGGC TCGGGCGGCG GCACGGACAC CGCTGCCCGG 
GTGGCGGCGG CGGTGCGGGA CTGCCTCGCG CCGCTGCGGC TCTCCGAGGC CCACGAACCG
GTCGTCGAGC ACGTGCTGAG CGGCACCCGA CCGGAGGCCC TGGCGGCGTT GCGGGAGCGG
CCGACCGGCG CCGACATGGT CGCCAAACCG GACGCGGTCT GGCGGACCGA CCGGCTGACC
GCCGTCGCCG ACGCGCATCC CGGCTGGTCA CTGCGGGAGG CCGACGCGGC ACGGCTGGTG
CTCTACCGGC TCGCGCCGAT CGACCTGCTC GTCCGGTTCG GGCAGGTGCT GCACGCGGTC
ACCGGCAACG CCCCGACGTC CGGGGAGCCG TCGTCGCTCC TCGTGCTGGC CGACGACGTG
CTCCGTGTCC ACGGCGCCGC CGACGGGACC GACGCCGACG ACGTCCGGCG GCGGTGGGAC
CTCCACACCC TCACCGAGGT CGCCCGGGCG GGCGGCGCAC CCGGCCGGAC TCCCGTGCAC
GCCGCCCTGT CGGCGCTGCT GTACTCGGGC TCCGGCCACT GGCCGTTCCG TCGGCACCGG
CTTCTGGAGA GCGAGGCGGG CGTCGCGTTC CTCGCCCGGC ACGCCGACGC GCTCGCGGAC
GTCGTCACCG GGTCCGGTCC GAACACTCGC AGGTACGTGG CGGACCGGTG CGCGCACCGG
CCGGAGGCGC ACGCGGAACT CGCCGCCGAA CTGGCGGTGG ACGCCGAGGC GAGCGTTCGC
GCTCAGGTGC TGTCCGCGCT GGCCCGGACC GACGGCCCCC GGCAGGTGGA CCTGCTGCGA
CGGCACCTGC GGACCGCGCC GCCGGACCGT CTGCCGGACG TCCTGGCCCG CCTGGCCGAC
CTCGACGGCG GCGTCGCGGC GATCGAGGAG GCCCTGGCCG ACGGCGGCGA CGGGACTCAG
GACCCCGGGC GCGAGGGGCT GCTGCGCCGG GCCGCCTCCC GGGTGCGGGC GCTGAGAACC
GCGGAGGCCG CCGTCCCCGT GCCCGACGTC GCCGCACCGC AGGACGCGGA CCTGGCGGAG
GAACTGCGGA CGCTGGGGGC CGGGGGCGGG TCCGACGGGG ACCGTTCCTG GAACGGCGTC
GAGGGGAGGG TGGCGCTGAT GCCCGACGTC CGCGCCCTGC GGGACGCGTT CCGCGCCGCC
GGGATGTCCG ACGCGGACCG GCGGACCGCC TCGCTCCTGG TCACCCGCAC GGACTCCCGC
GGACGCAGGA TCGGCGCCTT CCTCACCCCG GAGGACGCCG AGCGCTGGTG GCCCCTGTTC
GCCGAGCGCC TGGACCTGGC GGACGAGTAC CTCGACGGCG GCGACGGGAG GCGGCACCCG
GACCAGCCCG CCGTGGACAC GAGGACGATG ATCCTGACCG TCCTCGAAAG CTTCCCCGCC
GCCCCCGAGG CTCTGGTCCC GCGCCTGACC TCGCTCGCGC TCGGCGCGAA CCGCCACCGG
CTGGCGGCCC GGCGCGTGCT CGGCGACCAC CCGGACGCCC GGGCGGCAGC GGCGGCCGCG
CTGTCGGACG CCGACGCCCG GACGCGGTCG TCCGCAGCGG AGTGGCTCGC GGGGCTGGGC
GAGCCGGGCG TGGTGGCGCC GGAGCCCGGC TGGGAGTTCG GCGCGGGGGT GCTGCACCCG
TCGGTGAGGG CCCTGCCCGC GTCGGTGCTC TCCTGGCTGG ACCGGTTCCG GGAGCAGGCG
CTCGACAAGG GGGTTCCGGC GGACGACGTG GACCGGTGGC TCGGGCTGGC CCGCCCGAAG
CTGCGGACGG CGCGCGACGG CGGCGGCACG GTCGTCGGCA GGCTCGGCAG CCCGCTGATG
CTCCCGCCGG ACGCGCCCAC CCCGGGCACG GTGTGGGACG ACGACCCCGG CAACCGCGAC
GATCACCAGC TCATCGCGAC GCTCGACCTG GCCGCGATCC CGCCGGAGGC CACCGACATC
CCGCTGCCGC CCGACGGCCA CCTGCTCCTG TTCGCGAACG TCGAGTTGGA CGAGTTCGTC
ATCCCGGGCG GCGCCGCGTA CGTACCGGCG GGGACGCCGG TGGAGGAGCG GGAGTCCTCA
CCGAGCTACG AGCCCTACGA GTACGACTCC CCCGAGGCCC TGGACGAGGA GCTGCGGCGC
ACCGGTGACC TGCGGCTGGT CCCCGGCGTG GGGCTGCCCT CCTGCCCCGT CGAGGACGGG
GACCTCGCCC TGCACCCCCA CGCGGAGACG TTGCAGGAGG TCTGGTCCGA GCAGACCGAC
GGGGGCGGCG AGTGGCAGAT CGGCGGCTAC GCCGCGGACT TCGACGGCTA CGGGGACCCG
GCGCGCGCCT CGGCGTTCCC GGAGGAGGGC GAGCAGTGGT CCAGCCCTGA GGACTGGGTC
CTGCTCGCGC AGTGGGTCGG GGTGCCGATG GGGATCCTGT ACTGGACCAT CACCCGTGAG
GACCTCAAGG CGCGCCGTTT CGACCGCGTC GTGGTGCAGA TGTACTCCAA CCCCTGA
 
Protein sequence
MYESEAGTAG SGGGTDTAAR VAAAVRDCLA PLRLSEAHEP VVEHVLSGTR PEALAALRER 
PTGADMVAKP DAVWRTDRLT AVADAHPGWS LREADAARLV LYRLAPIDLL VRFGQVLHAV
TGNAPTSGEP SSLLVLADDV LRVHGAADGT DADDVRRRWD LHTLTEVARA GGAPGRTPVH
AALSALLYSG SGHWPFRRHR LLESEAGVAF LARHADALAD VVTGSGPNTR RYVADRCAHR
PEAHAELAAE LAVDAEASVR AQVLSALART DGPRQVDLLR RHLRTAPPDR LPDVLARLAD
LDGGVAAIEE ALADGGDGTQ DPGREGLLRR AASRVRALRT AEAAVPVPDV AAPQDADLAE
ELRTLGAGGG SDGDRSWNGV EGRVALMPDV RALRDAFRAA GMSDADRRTA SLLVTRTDSR
GRRIGAFLTP EDAERWWPLF AERLDLADEY LDGGDGRRHP DQPAVDTRTM ILTVLESFPA
APEALVPRLT SLALGANRHR LAARRVLGDH PDARAAAAAA LSDADARTRS SAAEWLAGLG
EPGVVAPEPG WEFGAGVLHP SVRALPASVL SWLDRFREQA LDKGVPADDV DRWLGLARPK
LRTARDGGGT VVGRLGSPLM LPPDAPTPGT VWDDDPGNRD DHQLIATLDL AAIPPEATDI
PLPPDGHLLL FANVELDEFV IPGGAAYVPA GTPVEERESS PSYEPYEYDS PEALDEELRR
TGDLRLVPGV GLPSCPVEDG DLALHPHAET LQEVWSEQTD GGGEWQIGGY AADFDGYGDP
ARASAFPEEG EQWSSPEDWV LLAQWVGVPM GILYWTITRE DLKARRFDRV VVQMYSNP