Gene Ndas_4605 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_4605 
Symbol 
ID9248486 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp5462365 
End bp5464191 
Gene Length1827 bp 
Protein Length608 aa 
Translation table11 
GC content73% 
IMG OID 
Producthypothetical protein 
Protein accessionYP_003682497 
Protein GI297563523 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.269217 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.930021 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGGGTGC TCGACCAGTA CGGCACCGTG GGCGGTGCCG TGCTCCTGTC CCCCGGCCTC 
CTGGTCACGT GCGCGCACGT CGTCAACGCC GCGCTCGGCC TCCACGGGGA GAAGGAGGAG
CACCCGGGCC CGCTCGCCAC CGTGCGCCTG CGGTCCTTCG ACGACCGGGT GTGGGAGGCC
TCCGTCGACA CCCGGCTGTG GTCGGCGGGC CCCGACAGCC GCGACCTGGC CGTGCTGAGG
CTGCGGGACG CGGGCCCCGA CACCTCCTTC CCCGTGCTGC GGGAGTGCGC GAGCCTGGAG
CCGCAGCAGC CCCTGTACAC GGCGGGCTAC CCGGAGGGCA TGCGTTCCCT CCAGGCCCCG
CTCGTCTGCC AGGGGCCCGG CGGGCCCACC GGGGTCACCC ACCAGGTCGA GACCCCCACC
TCACAGCCGG TGCGGATCAC CGGCGGGTTC AGCGGCTGCG CCGTGCGCAC CGGGACGGGC
GAGCTGGTGG GGATCATGCA GAAGACCCAC CACTACGTCT GGAACGACCC CGACCGGCCC
TCCGGCATCG CGTTCATCCT CCCGGTGGAG GAGCTCGTCG GCGAGCGCGA CGACGGCGAG
GTCGTCTCCG CGCAGCGCCT GGCCGACGAG TCCCTGTGCG GCAGGGAGGC CTACGACAGG
CTGCACGACC TGCTGGACTC GGTCCCCCTG GACGCTGTCC CGCCGGAGGA CCTGCTGAGC
CCCGCCGAGG CGCGCAAGGC CAGGCGGCAC GGCGCGGGCA CGACCGCCTG GCGGGTCCTC
ACCGCGCTGT GGGACCTGGT CCCGCCCGTC GGCGAGCCGC CGCCCCGGGT CGCCTGGGTG
CACCACGTCT ACCAGGAGAT CCGGCACCAG CGGCCGCTCC CGCCCGCGGT GTGGTCCTGG
ATCCGGCAGG AGGCCGGGCC GATGGGGCGC GACTGGGAGG AGGCCCTGAC CCGTGACCGG
GACCGGCGGC TGCTCCGGCG CAGGGAGCCG GACGCGCCCG CGGCGCCCTC CGGCCACCGC
GACGAGCGTC CGCCCGACAC GGTGGTGCTC TTCGAGCTGG AGCCGGTGAC CGGCGGCTAC
CGGCTCTCGC ACCGGATCGC CCACCGGGGC GAGAGGGACG ATCCGCTGCC CCAGGGCACG
AGGCTGGTCG GCGAACCGCA GATCTGCGAC GAGATCGCCG ACCTCATGGG CGAGGCCACC
ATGCAGAGGC TGGTCACGCC CAACGAGGAG TCGCTGCGGC TGCGCGTCCT GCTGCCCAGG
GACCTGCTGC ACCTGAACCC GGGCCAGGCC AGCCCCCACC GCGACCTGCT GGAGTACGCG
CCGCCGCTGT GCACCATGTA CGAGATCGTG TACCACGTCC GCGAGCGGGT GCGGTTACCG
CACTACCTGG GGGTTCCCCC CGACAGGTGG CGCCTGCGCT GCGAACGGCA GACGGCCAGC
CCCCTGGTGG AGGACCGCAA CGTGCTCGCC TCCTGGAAAC AGGAGGTGAG CGAGGTGGCG
ATCGCCCTCT CCGACCAGAA CGTGACCGTG TGCGTCACCG ACTCCGACAA CAGCGATGTG
GAGCACGTCT ACGACTCCGC CCTCTACTGG GGAATTCCCA CCATCATCAG GGGGCCCAGA
AAGGCGGTGA CCGCCTTCCT TGAGGAACTC CTGGACCGGG AACCCGATTC GCGGGTGCGC
ATCTCCGGAC TCGCCCGCCA CCTGCGTGAC AGCGCGAGAA GGAGTTCGCA GGCCAGGGAG
ATCGCGATCA TTCATGACAT CTTCGGCGAC GCCCTCCTCC AGGAGACGCC CGGGGAGCCC
GCGGGCCCTG AGAGACCGGG CGTCTGA
 
Protein sequence
MRVLDQYGTV GGAVLLSPGL LVTCAHVVNA ALGLHGEKEE HPGPLATVRL RSFDDRVWEA 
SVDTRLWSAG PDSRDLAVLR LRDAGPDTSF PVLRECASLE PQQPLYTAGY PEGMRSLQAP
LVCQGPGGPT GVTHQVETPT SQPVRITGGF SGCAVRTGTG ELVGIMQKTH HYVWNDPDRP
SGIAFILPVE ELVGERDDGE VVSAQRLADE SLCGREAYDR LHDLLDSVPL DAVPPEDLLS
PAEARKARRH GAGTTAWRVL TALWDLVPPV GEPPPRVAWV HHVYQEIRHQ RPLPPAVWSW
IRQEAGPMGR DWEEALTRDR DRRLLRRREP DAPAAPSGHR DERPPDTVVL FELEPVTGGY
RLSHRIAHRG ERDDPLPQGT RLVGEPQICD EIADLMGEAT MQRLVTPNEE SLRLRVLLPR
DLLHLNPGQA SPHRDLLEYA PPLCTMYEIV YHVRERVRLP HYLGVPPDRW RLRCERQTAS
PLVEDRNVLA SWKQEVSEVA IALSDQNVTV CVTDSDNSDV EHVYDSALYW GIPTIIRGPR
KAVTAFLEEL LDREPDSRVR ISGLARHLRD SARRSSQARE IAIIHDIFGD ALLQETPGEP
AGPERPGV