Gene Ndas_3448 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_3448 
Symbol 
ID9247316 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp4132230 
End bp4133471 
Gene Length1242 bp 
Protein Length413 aa 
Translation table11 
GC content67% 
IMG OID 
Producthypothetical protein 
Protein accessionYP_003681359 
Protein GI297562385 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGTGACC TGTCGCAGGC CATGGTCTCC CGAGTCCTGA GCGGGCAGCG CGAGTTGCGC 
AAGCGGGAGA CCGTCCAGCG TGTGCTGGAG GGTCTGACTT GCGGGCACGA CATCCCTGGG
CCGGCGATGG AACGCTTCCC GCCTGAACTC GGTTTGGGGC TGAACGGCTC CGGTCAGCAA
ACCGCGTCGG TTCTCGAACG GGTGACCCGG CTGGACCTGG AGGAAATCCC CGGAGCGGAT
GCACAACCGC CCTATTCAGC GATCAGTTCG GCCGTACTGT CGTGGTTGGT ATCCCACACC
CCGTCCGATT CCGAGGGGTT CTGGGGCGAG AAAGGTCCCG CAGCGAGCAT TCGTGCTGCC
GCGGCTGCGT TCGCAAGCCT GGACAACCGC TTCGGCGGCG ACCACGCGCG GATAGCCGCC
ACGCAGTACC TCAGCGGCAC GGTGGTCCCC CTGCTGCGTA CGTCATACCC GAGCAGTACA
GGGCGATCGG TGTTCGCGGC GTCTGCGGAG TTCGTGCTCT CGCTGGCGTG GATGTCTTAC
GATGCCCAGC GCAACGGGAC GGCCCGACGC TACTTCGTGC AGGCCTTGGA CCTGGCGGAC
CATGCCGAAG ATCGCCTGCT AGGGGCGAGT GTGCTGTCGG CTATGAGCCA CCAGGCCAAC
TACGTGGGCT CCTATACCGA AGCTCGGGAT CTGGCGCGAG CGGCGTTGAC CGGAGCCGGG
GAGCGGGCGA CGGCGACTCT GCGGGCGCAG TTCCTGATGA TGGAGGCGCG TGCCCATGCA
TCTCTGAGGG ACGAGGGCGC GTGTTCTCGG GCGATGGGCC ACGCCGAGCA GGCTTTCAAC
CAACGCGACC CTGATGCCGA TCCGGCGTGG ATCGGGTACT TCGACCAAGC GGAGTACTCC
GATGAAGTGG CCCACTGCCA CCGGGATTTG GGAGAATCCC GTGCCGCTCG ACGGTCGGCC
GAGCACAGCC TGTCCGCCTC CCGTGGTCAG GAGTACGCGC GCAGCCGGGT CTTCACCCGG
GTCGTGCTGG CGTCCGCGAT GCTCGGGCAG GGCGAGGTCG AAGAGGCCTG CCACTTTGGT
GCAGCAGTCG TGCCGCAGGT GCAGGCGACG TCCTCGGCCA GGTGCGCGGG GTACCTGGAC
ACGTTCGTCG ACAGTGTGCG CGCTTATCGG GGGCAGCCGG AGGCGGATCG GTTCCTCCAG
CAGGCCCGGG CGGCGAAGGT TGTGGGGTCC GCGGGTCAGT AG
 
Protein sequence
MGDLSQAMVS RVLSGQRELR KRETVQRVLE GLTCGHDIPG PAMERFPPEL GLGLNGSGQQ 
TASVLERVTR LDLEEIPGAD AQPPYSAISS AVLSWLVSHT PSDSEGFWGE KGPAASIRAA
AAAFASLDNR FGGDHARIAA TQYLSGTVVP LLRTSYPSST GRSVFAASAE FVLSLAWMSY
DAQRNGTARR YFVQALDLAD HAEDRLLGAS VLSAMSHQAN YVGSYTEARD LARAALTGAG
ERATATLRAQ FLMMEARAHA SLRDEGACSR AMGHAEQAFN QRDPDADPAW IGYFDQAEYS
DEVAHCHRDL GESRAARRSA EHSLSASRGQ EYARSRVFTR VVLASAMLGQ GEVEEACHFG
AAVVPQVQAT SSARCAGYLD TFVDSVRAYR GQPEADRFLQ QARAAKVVGS AGQ