Gene Ndas_1122 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_1122 
Symbol 
ID9244972 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp1377815 
End bp1379476 
Gene Length1662 bp 
Protein Length553 aa 
Translation table11 
GC content75% 
IMG OID 
Producthypothetical protein 
Protein accessionYP_003679069 
Protein GI297560095 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.106292 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGCTGGCC TCCTGATCCG GTTGAAGTTC ACCCTGCTGC GGCACTCGGT GACGGGGATG 
CGGCTGTTCG GGATCGCCCT GGTCGTCGGC GGGACCGCGC TGACCTGGTA CCTGGCGGTG
GCGGCGGCCT CGGACGGCGT GCGCGCGGAC CTGCTCTGCC TGGCGTTCGC GGTGTGGGCG
GTGGGCTGGA TGCTCGGCCC CACCGTCGCC AACGGCACGG GTGTGCTGCG CTCGCAGTAC
TTCGCGCTGC TGCCCCTGGA CCGGCGCCGG GTGGGCGCCC TGCTGCTGGT GACGATGTTC
GTGGACGTCG GTCCCGCCGT CACTCTGCTG GCGCTGGGCG CGCTCGTGTG GCACGCGCTG
GCGCTGGACC CGTCCGCGCT GGTGGTGGCG GTGCCGGGGG TCCTGGTGCT GTGGGTGTTC
GTGGTGACGC TGTCCCGGCT GGTGTTCCGG GCGCTGGGCG CGGCCATGCA CTCGCGGCTG
GGCATGGAGA TCGCGTCGGT GCAGTGGGGG CTGATCCTGG CCGGACTCTT CTTCGGGTGG
ATCGCGGTGC AGCCCGCGTT CCAGGCGACG CTGAGCCTGC GCGAGAACGG TCTGGGCGAG
GGCGTGGTGG GCACCGTGCT GGCCGCGCTG CCGACCTCGT GGCCGGTGCT GGCCGTGCAG
GCCGCCGCGG CCGGTTCCTG GCCCGCGGCC GCGGCCTGGC TGGGCGGGTT CGCCCTGCTC
ACGGTGGCGA TGGTGACGGT GACCTGCGTG CTGCTGGCGC CCCGGGTCGC GGCGCGCGGG
GTTCGGCGGC GCCGCGGCCC CGGGGGCGGT GCGCTCACGC GCCGGGCGTT CGCGCTGCTG
CCGGACTCCC CCCTGGGCGC GGTGGTCGCC AGGGAGCTGC GCCAGTGGTG GCGCGACCCG
TGGCGGGGCC TGGAGCTGCG GGCGTCGCTG TGGGCGGCGC TGTTCACGGG CCTGCTGGCC
TGGCCCACCG ACCTGTACGC GTTCTTCTCC CCGTTCGCGG GGGTGATCGC GGCCTTCGTC
ATGGCGCTGG CCACCTCGAA CATGTACGGG CACGACGGCA CGGCGCTGTG GCTGTCGGTG
GTCGGCCAGG ACCGGGACAC GCTGCGCGCG GACGTGCGCG GCCGCCAGAT CGCGATCCTG
CTGCTGATCG CCCCTGCCGC GACCGTGCTG AGCGCGGTGT TCATCGTGGC CGCCGGAGCC
CACTGGGCGT GGCCGCTGGT GCTCACGGGG CTGGCGGCGT TCTTCGGGGT GGGCAGCGGG
CTGGCGCTGC TGCTGTCGGT GGTGGCGCTC TCCCCCGGCG TGGACCCGCA GCTGCGGGTG
GACGCCAACG ACTCCGGCGA CAACACGGTG CAGGTGTGGA TCGCGCTCGC GGCGCTGCCG
GTGCTGTGCG CGCCCTCGGT GCTGGCCGCC GTGTTCCTGT CTCTGTGGGG CCTGCCGTGG
CTGGCGGTGC CGGTGGGTGT GCTCAACGGG GGGTTCGTGG CCTGGCTGCT GGGCCGCGTC
GCCTACCGGA GGTTGGAGGC CCGGCTACCG GAGACCTTCA CCCGGATCCG CTACGGCCGG
GAGGTCGCGC TCCAGACCGT GTCGGAGCGC GGCGGCTGGC TGGACCTCCT GGAGCGCTCG
GCCATCGAGG GCAACTCCGA GACCAAGCCC ACGGGCTCCT GA
 
Protein sequence
MAGLLIRLKF TLLRHSVTGM RLFGIALVVG GTALTWYLAV AAASDGVRAD LLCLAFAVWA 
VGWMLGPTVA NGTGVLRSQY FALLPLDRRR VGALLLVTMF VDVGPAVTLL ALGALVWHAL
ALDPSALVVA VPGVLVLWVF VVTLSRLVFR ALGAAMHSRL GMEIASVQWG LILAGLFFGW
IAVQPAFQAT LSLRENGLGE GVVGTVLAAL PTSWPVLAVQ AAAAGSWPAA AAWLGGFALL
TVAMVTVTCV LLAPRVAARG VRRRRGPGGG ALTRRAFALL PDSPLGAVVA RELRQWWRDP
WRGLELRASL WAALFTGLLA WPTDLYAFFS PFAGVIAAFV MALATSNMYG HDGTALWLSV
VGQDRDTLRA DVRGRQIAIL LLIAPAATVL SAVFIVAAGA HWAWPLVLTG LAAFFGVGSG
LALLLSVVAL SPGVDPQLRV DANDSGDNTV QVWIALAALP VLCAPSVLAA VFLSLWGLPW
LAVPVGVLNG GFVAWLLGRV AYRRLEARLP ETFTRIRYGR EVALQTVSER GGWLDLLERS
AIEGNSETKP TGS