Gene Ndas_5522 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_5522 
Symbol 
ID9249425 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014211 
Strand
Start bp713759 
End bp715309 
Gene Length1551 bp 
Protein Length516 aa 
Translation table11 
GC content72% 
IMG OID 
Producthypothetical protein 
Protein accessionYP_003683407 
Protein GI297564434 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.349353 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.675999 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGATCA TCGACCGGCA GAGCCGCCGT GAGGAGCGCC GCCAGCAGCG GGCGGAGCGC 
CGACGCCTCA AGCAGAAGCG GGAGAACCAG GAGCAGGCGA GGCGGCCGGA CCGCTCGCGG
GAGGGGGACC GGGAGGTCGT CGCCCCCCTG CTCGGCTGGC GGCACCGGGG AGGCGGGGCC
TCGCCGAACA TCCCCAACGC GGTCGAGTAC CAGGCGACCA CCGCCCAGGC GTGCGGGCTG
TTCCCGTTCG TCACCAGCAG CAAGCCGCCG TCGGTCGGCA CGCCGATCGG CCGCGACCTC
CTCTCCGGGG AGGCGGTCTG CCTCGACCCG ATGGCGTGGC TGAGGGCCGG TCTGATCACC
AACCCCGGCT GCTTCGTGCT GGGGCAGCCG GGCACCGGCA AGTCCACGTT CGTCAAGCGG
CTGGTCACCG GGGCGGTGGC CTTCGGCTCC CAGGCCATCA TCCTGGGCGA CACCAAGCCC
GACTACACCG ACCTGATCCA GCACCTGGGC GGCCAGGTCA TCCGCATCGG GCGCGGCCTG
GACCGGATCA ACCCGCTCGA CGCCGGCCCT CTGGGGGCGG TCATGGACCG GCTGTCGCAG
GTGGAGCGGG AGAAGCTGCG CTGGGAGGTC CGCTCCCGGC GCATCTCGCT GCTGATGGCC
CTGTGCACCC TGGTCCGCGA GGGCAGGATC GGCAACGCCG AGGAGGTCGT CCTCGGCGCC
GCGGTGGACC TGCTCGACGA ACGGCTCGCC GGTCGCCAGC CCACGGTCGT GGACGTGCTC
AACGTGGTGG AGGAGGGCCC GGACTCGCTG CGCTCGTTCG CCCGCGCCGA CAGCAGGGAC
TCGTACGACA AGCGCGTGGA CGACCTGGTC TTCACGCTCC GCCTGCTGTG CACCGGCTCC
CTGGCGGGGG TCTTCGACGC CGAGACCACC AGGCCCCTCG ACATGGAGGC GCCGGGGATC
AGCGTCGACA TCTCCCGGGT GGGAGCCGCC GGGGACAAGC TGCTGACCGC GGCGATGCTG
TGCACGTGGA GCTACGGTTT CGGCATGGTG GACGCGGCGG CCGTGCTCTC CGACCTCGGC
CTCATCCGGC GCCGCTCCTA CATCGGCGTG ATGGACGAGC TGTGGCGCGC CCTGCGCGGC
GCTCCGGGCC TGGTGGAGTT CGCCGACTCC CTCACCCGGC TCAACAGGTC CAAGGGCATG
TCGTCGGTCA TGGTGACCCA CTCCCTCAAC GACCTGGAGG CCCTGGCCAC CGAGGAGGAC
CGGGCCAAGG CGCGGGGCTT CGTGGACCGC TCCGCCATCA CCGTCCTGGC CGGGCTGCCG
CCGCGCGAAC TCGCCTCGGT CAACCGGATC ACCCCGATGA CCGCCCCCGA GCAGGGGCTG
GTGGCGTCGT GGTCGTCCCC GGACTCCTGG CAGCCCGGAG CTCGCCACCC CGGGCGCGGG
AAGTACCTGA TCAAGACGGG CGCGGAGCGC CTGGGGATCC CGGTGCAGAT GACGCTCACC
CGCGACGAAC TCGTCCTGTA CGACACCGAC CAGGCGATCC GGGGGGAGTG A
 
Protein sequence
MAIIDRQSRR EERRQQRAER RRLKQKRENQ EQARRPDRSR EGDREVVAPL LGWRHRGGGA 
SPNIPNAVEY QATTAQACGL FPFVTSSKPP SVGTPIGRDL LSGEAVCLDP MAWLRAGLIT
NPGCFVLGQP GTGKSTFVKR LVTGAVAFGS QAIILGDTKP DYTDLIQHLG GQVIRIGRGL
DRINPLDAGP LGAVMDRLSQ VEREKLRWEV RSRRISLLMA LCTLVREGRI GNAEEVVLGA
AVDLLDERLA GRQPTVVDVL NVVEEGPDSL RSFARADSRD SYDKRVDDLV FTLRLLCTGS
LAGVFDAETT RPLDMEAPGI SVDISRVGAA GDKLLTAAML CTWSYGFGMV DAAAVLSDLG
LIRRRSYIGV MDELWRALRG APGLVEFADS LTRLNRSKGM SSVMVTHSLN DLEALATEED
RAKARGFVDR SAITVLAGLP PRELASVNRI TPMTAPEQGL VASWSSPDSW QPGARHPGRG
KYLIKTGAER LGIPVQMTLT RDELVLYDTD QAIRGE