Gene Ndas_5523 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_5523 
Symbol 
ID9249426 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014211 
Strand
Start bp715325 
End bp716794 
Gene Length1470 bp 
Protein Length489 aa 
Translation table11 
GC content72% 
IMG OID 
Producthypothetical protein 
Protein accessionYP_003683408 
Protein GI297564435 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.941713 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGTTCGCA CGTACGGAGG ATGGCGCCGC CGACGCAGCA TCGGGCTGTT CGGGCTGGGG 
CTCAACGCCA CCCTCGTCCT GCTGGGCGGC CTCGTCGTGC TGATGCTGTC CGGCATGTTC
GCGCCCCGGT TGGCCCTCTA CATCCTGCCC GTCATGACGG TGGCCTTCCT GCTCGCCGCC
GTGCCGGTCC AGGGCGAGTC AGTGCTCAGC TTCCTGCTGA CGCGTCTGCG GTGGTGGTGG
GCCAAGGTCC TCGGGCACAC CCGCTACCGG GCCGGTGTGA TGGTCGAGCA CCCGCGGGCC
TTCCAGCTGC CGGGGGTCCT GGCGCCCGTC ACCCTGCTCT CCTGCGAGGA CGGCAGGGGC
GGCCGGTTCG GCCTGGCCTG GAACCGGCGG CTCGGGCACA TGACCGCGAC CGTCCTGGTC
TCGTCCAACT CCGTGTGGCT GGCGGAGCCC TCCGACGTCG ACACGTGGGT GGCCAACTGG
CACCAGTGGC TGGCCTCGCT CGGTTACATG CCGTGGATCC AGTGGGTCAC CGTGACCGTC
GAGTCACGGC CCGCCCCCGG AAGCGCGCTG CGCGGCAGCG TCGAGGCCGA CATCGACCCG
CAGTCCCCCC CGGCCGCCCG GAGGATCATG CGCGACCTCG TGCTGAACGC CCCCTCGGTC
AGCGCCCAGG TCGAGACCCG GATCAGCCTC ACCTACGATC CGCGCAAGTT CCCTGTGGCG
CCCAAGGACC GCATGCAGGC GGCGGTCGAG ATCAGCGGCT ACCTCAACCA GATCGAGACC
AGCCTCGCCA GCACGGGCGT GTCGGTCCTG GGCCGGGCCA CCCCGGAACA GCTCGTCGCC
ATGGTGCGCA CCGCCTACGA CCCGACCGTG GTCAGCGACC TCAGTTCCGT GCTGAACGAC
AAGCCCCAGG ACATCGGAAC CGTCGACTGG ACGGACGCCA CGCCCGTCGA GGCCGAGGTG
CACCACGACC GCTACACCCA CGACAGCGGC ACGAGCGTCT CGTGGATCTG GCAGGAGGCG
CCGCGCACCC ACGTGACCAG CACGGTGCTC GCCGAACTGC TCTCGCCCAC CCGGTGGGTC
AAGAGGACCA CGCTGCTGTT CCGGCCCTGG CCCGCCGCGG CGGCCGCGCG CGCCCTGGAG
CAGCAGAACG CCGCCGCCCA GTACAAGAGC GAGCTGTCCG CCCGCGTGCG CCACAGGGTG
AGCGCGCGCG ACCGCCAGGA CGTCGCCTAC GCCCAGCAGG CGGCCCAGGA GGAGACGCGC
GGTGCGGGGC TGGTGCAGGT GAGCCTGTAC ACGACGGCGA CCGTCGACGA CGAGGAGCGG
TTGAAGCAGG CGGTCGCGCA CACCGAGTCC GCCGCCGAGA CCTCGCGGAT CCGGCTGCGC
CGCGCCTGGG GCAGTCAGGA CGTCGCGTTC GCCACGACCC TGCCCCTCGG GGTGTGTCCG
CCGTTCCTCA TTCAGCGCGG CATGGGTTAG
 
Protein sequence
MVRTYGGWRR RRSIGLFGLG LNATLVLLGG LVVLMLSGMF APRLALYILP VMTVAFLLAA 
VPVQGESVLS FLLTRLRWWW AKVLGHTRYR AGVMVEHPRA FQLPGVLAPV TLLSCEDGRG
GRFGLAWNRR LGHMTATVLV SSNSVWLAEP SDVDTWVANW HQWLASLGYM PWIQWVTVTV
ESRPAPGSAL RGSVEADIDP QSPPAARRIM RDLVLNAPSV SAQVETRISL TYDPRKFPVA
PKDRMQAAVE ISGYLNQIET SLASTGVSVL GRATPEQLVA MVRTAYDPTV VSDLSSVLND
KPQDIGTVDW TDATPVEAEV HHDRYTHDSG TSVSWIWQEA PRTHVTSTVL AELLSPTRWV
KRTTLLFRPW PAAAAARALE QQNAAAQYKS ELSARVRHRV SARDRQDVAY AQQAAQEETR
GAGLVQVSLY TTATVDDEER LKQAVAHTES AAETSRIRLR RAWGSQDVAF ATTLPLGVCP
PFLIQRGMG