Gene Ndas_4484 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_4484 
Symbol 
ID9248363 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp5321294 
End bp5322637 
Gene Length1344 bp 
Protein Length447 aa 
Translation table11 
GC content69% 
IMG OID 
Producthypothetical protein 
Protein accessionYP_003682378 
Protein GI297563404 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGCGCGTA CCCCCAAGCA GCCGAACGTC CAGCTCGTGG AACTCCTGGA CGAGGCTGGA 
ATGCCCGCCA AGGGGCTTGC CCGCAGGGTC GTGGATCGGG GCAGAGAGCA GGGGTTGTCG
CTCTCCTACG ACCACAACAG CGTCCGTCGC TGGATGTCGG GCGAACGTCC GCAGCGGCCA
GTCCCCGGAA TCATCGCGGC CGTCCTCTCC GAGGCGCTGG GGCGCCTCGT ACTTCCGGAG
GACTGCGGTC TTCCCGACGA CGAGGCACGG ACCCTGGAGT TTCCGCTGTC CTGGACGACA
GGGATCACCA CTGTCGGTCA GCTCTACCGG GCTGACGGCG AACGCCGTCG CGATGTCCTC
GGCGGATACT CGACGGCGGC CTACCCCAGC GCGACCGTAC GGTGGCTCAC CCAGCCTCCT
GCGTCCGGAC CCGCCCATCG CGGCCGGATC CGTGTCGGAC GGCCCGAGAT CTCGGCCATC
CGGCAGATGA CGCGTGCCTT CCGGGACTTG GACAACCGCG TTGGCGGGGG CCGCATCCGC
AGCACCGTCG TGCAGTACCT CGACGCCAAC GTCGCCCCGC TGCTGCGGGG CAGCTACACG
GAGGAGGTCG GCCGGGATCT GTTCTCGGCA GCCGCCGAAC TCACCAAGGC CGTGGGATGG
ATGGCCTACG ACTGCGAGGA GCACGGACTC GCCCAGCGGT ACCTGATCCA GGCCCTGCGC
ATGGCTCAGA CATCCGGGGA TGACGGGTTG TGCGCCGAGA TCCTCGCGGC GATGGGCCAC
CAGGCCACCT ATATCGGGCG GTCGGCAGAA GCCGTGGACC TGGCGAGGGC GGCGCAATCC
GCCGCCCACC GGGCCGGGCA CCCCGCGCTC GCGGCGGAGT GCCACCTCAT CGAAGCCCAC
GGGCACGCCG GACTCTCAGA CCCACGGGCC ACGTCCCGCT CCCTGCGGGC GGGGGTGAAG
GCCTTCGAAG CGGACGACCC CAGCCCGCCG GAGTGGCTGG CCTACTTCGA CAACGCCTAC
CTCGCGGCCA AGGTCGCGCA CTGCTTCCTC GCTCTGGGCA ATGACGCCCA GACGGCTGTG
TACGCCGAGC AGTCCCTGAA GATGAACACC GACTACGTAC GCGGCAGGAC GTTCAACCTC
CTGATGCTCG CCACCGCCCA CGCCACCGAC GAACCGGAGG AAGCGGTGCG GGTCGGTGGA
GTGGCTCTGA ACCTGGTGGA GGGGTTGCAG TCACAACGTG TGTTGTCCTA CCTGCGCCGT
CTGCGGCACC GGCTCCGGCC TCACGAGAAC CTGCCCGAGG TGGAGGAGTT CACGGCACGG
GTCAGGGAGG TCATACCAGG GTGA
 
Protein sequence
MARTPKQPNV QLVELLDEAG MPAKGLARRV VDRGREQGLS LSYDHNSVRR WMSGERPQRP 
VPGIIAAVLS EALGRLVLPE DCGLPDDEAR TLEFPLSWTT GITTVGQLYR ADGERRRDVL
GGYSTAAYPS ATVRWLTQPP ASGPAHRGRI RVGRPEISAI RQMTRAFRDL DNRVGGGRIR
STVVQYLDAN VAPLLRGSYT EEVGRDLFSA AAELTKAVGW MAYDCEEHGL AQRYLIQALR
MAQTSGDDGL CAEILAAMGH QATYIGRSAE AVDLARAAQS AAHRAGHPAL AAECHLIEAH
GHAGLSDPRA TSRSLRAGVK AFEADDPSPP EWLAYFDNAY LAAKVAHCFL ALGNDAQTAV
YAEQSLKMNT DYVRGRTFNL LMLATAHATD EPEEAVRVGG VALNLVEGLQ SQRVLSYLRR
LRHRLRPHEN LPEVEEFTAR VREVIPG