Gene Ndas_1668 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_1668 
Symbol 
ID9245518 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp2038455 
End bp2039717 
Gene Length1263 bp 
Protein Length420 aa 
Translation table11 
GC content74% 
IMG OID 
Productprotein of unknown function DUF418 
Protein accessionYP_003679603 
Protein GI297560629 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.620816 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGACCG ACCGCGCCGA AGGCCCGCCC TCCGTCCCCG GAGACCCGCC CGCAGCCGCC 
GCCGACACCC GGGCCCCGGC GGGTTCGACG CCGGTGGCCG AACGGGCCCT GGCCCCGGAC
CTCGCCCGCG GCATGATGCT GCTGCTGATC GCGCTGGCCC ACGTGCCGTG GTTCCTGTAC
CAGGCGCCCA CCGGCCTGGC CATGCTGCAC CCCGTCGACG GCAACCTGGC GGACAGGGCC
GCGCAGTTCA TGACGATCGT CGTCGTGGAC GCCCGCACGC ACACGATGTT CGGCTTCCTC
TTCGCCTACG GCATCGGGCA GATGTACCGC CGGCAGAGGG CGCGCGGCAC CGGTGAGAAG
GAGGCCCGCG GGCTCCTGCG CAGGCGCCAC CTGTGGATGC TCGTCTTCGG CGCGGTCCAC
GCCGCCCTGC TGTGGCAGGG CGACATCCTG GGCACCTACG GCCTGATCGG GCTCATCATG
GTGCCGCTGT TCCTCAACCG CAGCGACCGC ACCCTCAAGA TCTGGCTGTC CGTCCTGCTG
GCCCTGGGCG CCCTGGTCAC CGCCGTCTCG GCGGCCTCGG TCCTGCTGGC GCCGGACGCG
GTGTCCACGG CCGCGGCGAC CGACATGCAA AGGGCCAGCA TCGCCGAGAC GAGCTACCTG
CTCTCCGCCG TGTTCCGCCT CCCGGCCTGG TTCTTCGGGC TCTTCTCCGG CCTGTTCACC
CTGGCCCTGC CGACGGTGTT CCTGATCGGC CTGCTCGCGG CGCGGCACCG GTTCCTGGAG
GACCCGGCGC GACACCTGAC GCTGCTGCGC CGGGTCGCGG TCCTGGGCAT CGCCGTGGGG
TGGGCCGCCG GAGCGGTGCT GGGCCTCCAG CACGTGGGCG TCCTGGACGC CACCCACATC
TCGGCGGTCT CCTCGGTGCA CTTCTACACC GGGATCTTCA CCGGGGTGGG CTACGCCGCG
CTCTTCGGGC TCCTCGCGCA CCGGCTCTCC GCCCGGGGGG CCCAGCGGTC CCTGCCGGTC
CGGGCCCTGG TGTCCCTGGG GCGGCGCTCC CTGAGCGGCT ACCTGGCCCA GTCGGTGGCC
TTCGCCCCGT TCCTGGCCGC CTGGGGCCTG GGCCTGGGCG TGCACCTGTC CAGCTGGTCG
GCGGTGCTGG TGGCCGTGGG CACCTGGCTG CTGACCGTCG CGGCGGCGTT CCGGCTCGAC
CGCGCGGGCA GGCGCGGACC GGCGGAGATC CTGCTGCGGA GGCTGACCTA CCGCAAGCCG
TGA
 
Protein sequence
MTTDRAEGPP SVPGDPPAAA ADTRAPAGST PVAERALAPD LARGMMLLLI ALAHVPWFLY 
QAPTGLAMLH PVDGNLADRA AQFMTIVVVD ARTHTMFGFL FAYGIGQMYR RQRARGTGEK
EARGLLRRRH LWMLVFGAVH AALLWQGDIL GTYGLIGLIM VPLFLNRSDR TLKIWLSVLL
ALGALVTAVS AASVLLAPDA VSTAAATDMQ RASIAETSYL LSAVFRLPAW FFGLFSGLFT
LALPTVFLIG LLAARHRFLE DPARHLTLLR RVAVLGIAVG WAAGAVLGLQ HVGVLDATHI
SAVSSVHFYT GIFTGVGYAA LFGLLAHRLS ARGAQRSLPV RALVSLGRRS LSGYLAQSVA
FAPFLAAWGL GLGVHLSSWS AVLVAVGTWL LTVAAAFRLD RAGRRGPAEI LLRRLTYRKP