Gene Ndas_2475 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_2475 
Symbol 
ID9246325 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp2939185 
End bp2941329 
Gene Length2145 bp 
Protein Length714 aa 
Translation table11 
GC content69% 
IMG OID 
Producthypothetical protein 
Protein accessionYP_003680401 
Protein GI297561427 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGTCGTT CCCTGACCCG TACGACGGCG GTCTCGGCCG CCTCGGTACT GGCCTTGTCA 
CTCCTCTCCC CGTCGCAGGC GCTCGCCGAC GACAACCGTC CTCCGGACCG GCCCGACACC
GCCACGCTCA CCGCGCGGGG AAGGGTGTGC TCCACCGACC AGGATGCCCC CACACTTATC
GGCAGCACAG CACCCCTCCT GAGAGGAGTC TTCCGCGATC CCGACTCCGA GACCGGGGCA
CAACGGATCA GGGCCGAGTT CGAGTGGAGC CTGGAGGGCA CCGACGAACT CCTCGGCGCG
GCCGAGTCGA CCCATACCTG GCACCACCCC GGCCGTGACC CCTATCCGCT GTCGGTAACC
GCGAGCGGCC TGCCCGAAGA CACCCTCATG CGCTATCGCG CGCGCGGCCA TGACAACCAG
GAGCCGGGTG AGTGGTCCGA GTGGTGCTGG ATCGAAGTCA ACACCGACGG ACCCGAAGCG
CCTCCGCTGG TCACCTCCGA CGACTACCTT CCCAACGGTG GTTTCCAGGG AGCACCCGGT
CGGACAGCGG AGTTCACCTT CGCCAACAAC GGAACGCTGG ACGCGGTCTC CTACGAGTAC
AACCTCGGCC TGGAACCCCT GTGCAACACC CGGGTGGAAC TCGATGAGCC CGGTGCCTCC
GCCACCGTCT CGGTCACACC GCAGAGGTCG GGACCCCAGT GGATCTACGC CCGGAGCGTC
GACGCCTACG GCAACACCTC CGCGTGCGAG TCGGTGTACG AGGTCCTGGT GGCTTCCCTG
GCCGACCCCG TGGCCTACTT CCTCCTCGAC GAAGGTGAGG GGACGAGCGC CTCGGACGTC
ATGTCAGAGG ACCGCTCTGC TACGGGGGAC GGCGGCATCG CCTGGACCCG TGGCCGCGTG
GGCGAGCGCC AGGGCAGCGG TTACCGGCTG GAGGGCACGG CCGTGGCCAC GGCTGACGGA
CACCTGCGCA CCGACACGGC GGTCGTCGAC ACCTCGGAGG CCTTCGCGGT GTCGGCCTGG
GTCCGGTTGG ACGAGACCGG CACCGACGCC GTCGCCCTCT CCCAGGACGG CGAACATCTG
AGCGGCTTCC AGCTCGGTTA CGACGCCTCC GAGGAGGCAT GGGTGTTCCA GACAGCCTCT
CAGGACGGAC CGCAGGCCGG GTTCGACCAG CGCGTGGTCT CCACCGTTCC CGCTCAGGCG
GGCGTGTGGA CACAGCTCAG CGGTCAGCAC GATCCGGAAA CCGGAGAGAT CGCCCTCTAC
GTCGAGGGCG CCCATCAGGG AAGCGCCGCG TGGGACTCCG CCTGGAACGC TGAGGGCCCG
TTCGTCATCG GAGGCGGCCG GGAAGCGGAC GCCTTCTCGG GCAGTTGGCC GGGAGCGGTC
GATCACGTCA AGGTATGGGA CCGCCTCCTC ATCGTCGAGG ACGCGCCCTA CACCAGCACC
AAGCGGTCGG AGGTGTGGCA GCACGCCAAC CTGCCGCTGG CCCTGGAGGG CCGGTGGATG
TTGGAGGAGA GCGGTGGTGC GGCCGCCGCG GACGGTTCCG ACCACGGCCT GGACGCGACA
CTGCACGGTG ACCCCGCGAC GGTGTGGGAG GGGGCGTTCA ACGACTGGAC CTACACCTCC
GCGATCCTTC TCGACGGAAC CGCGCAGGAG CACCTGCGCA CGGACGGAGC GGCTGTGCGC
ACCGACCGCA GCTTCACCGC CTCGGTGTGG GTGCGTTTGG ACGAGGGCGG CTCCGACGCC
GTCGCGCTCT CCCAGAGCGG CGAGCACACC GGTGGGTTCG TGCTGGGGTA CGACGCCGAA
CTGGAGGCGT GGGTCTTCGA GACATCGGCC GGGGACTCCG AGGGAGCTGA GGTCAGCCGC
GTCGCCTCCG CCTGGGCGGA GACGGGGAGG TGGACCCATT TGACCGGTAT CTACGACCAC
GTGGACGGAA CACTCGCCCT CTACGTCGAC GGTGTCCGGC AGGAGGACGC GGAACGCGAG
GGCGCCTGGC ACGCCGACGG CGACGTGGTG ATCGGAGGAG CCGGATACAC GGACGGCGTC
GACCGTCCCT GGACGGGCGC TCTCGGCACG GTCTTCCTCC TCCAGGGAGT CGCGTTCCCC
CACGACGTCT ACACCGTGAT GGAAGGTCTC CTGCCGAGGG TTTAG
 
Protein sequence
MRRSLTRTTA VSAASVLALS LLSPSQALAD DNRPPDRPDT ATLTARGRVC STDQDAPTLI 
GSTAPLLRGV FRDPDSETGA QRIRAEFEWS LEGTDELLGA AESTHTWHHP GRDPYPLSVT
ASGLPEDTLM RYRARGHDNQ EPGEWSEWCW IEVNTDGPEA PPLVTSDDYL PNGGFQGAPG
RTAEFTFANN GTLDAVSYEY NLGLEPLCNT RVELDEPGAS ATVSVTPQRS GPQWIYARSV
DAYGNTSACE SVYEVLVASL ADPVAYFLLD EGEGTSASDV MSEDRSATGD GGIAWTRGRV
GERQGSGYRL EGTAVATADG HLRTDTAVVD TSEAFAVSAW VRLDETGTDA VALSQDGEHL
SGFQLGYDAS EEAWVFQTAS QDGPQAGFDQ RVVSTVPAQA GVWTQLSGQH DPETGEIALY
VEGAHQGSAA WDSAWNAEGP FVIGGGREAD AFSGSWPGAV DHVKVWDRLL IVEDAPYTST
KRSEVWQHAN LPLALEGRWM LEESGGAAAA DGSDHGLDAT LHGDPATVWE GAFNDWTYTS
AILLDGTAQE HLRTDGAAVR TDRSFTASVW VRLDEGGSDA VALSQSGEHT GGFVLGYDAE
LEAWVFETSA GDSEGAEVSR VASAWAETGR WTHLTGIYDH VDGTLALYVD GVRQEDAERE
GAWHADGDVV IGGAGYTDGV DRPWTGALGT VFLLQGVAFP HDVYTVMEGL LPRV