Gene Ndas_5027 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_5027 
Symbol 
ID9248916 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014211 
Strand
Start bp168350 
End bp169429 
Gene Length1080 bp 
Protein Length359 aa 
Translation table11 
GC content70% 
IMG OID 
ProductParB domain protein nuclease 
Protein accessionYP_003682914 
Protein GI297563941 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.556043 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCTTCA CGAAGGGAAA TTTCACTGTG CCTGCCCAAA CCATGCCATC CGCCCCTGGA 
CTGGGGGAGG GTCAGAGAAA TGACGGGGAC CTGCTCCCCG AGGTGGCTGA CATAATCAGG
GATCTGCCCC AGGGAATTGT TCCCATCGCG GACCTCGCCC CTTCCCTTTC TCCGCGCGAG
AAGATGGAGG ACCCGGCCCA CGCCCGTGTG ATCGCGGAAA TGGGGAGCGC CGTCGAACCC
ATCCTCGTCC ACCGCCGCAC CATGCGCGTG ATCGACGGGG TGCACAGGAT CCGCGCCGCG
CGGATGAGGG GGAAGACCGA GATCAGGGCC CGTTTCTTCG ACGGCAGCGA CGAGGACGCC
TTCGTCCTCG CGGTGCAGCT CAACGTGGGG CACGGGCTTC CGCTGACCCT GAGCGAGCGC
AGGACCTCCG CCCAGCGCAT CATCCTGTCC CATCCGCACT GGTCCGACCG GGCCATCGCG
AGGAGGGCCG GGCTGAACGC CAAGACGGTG GCCAGGCTCC GTCGGGAGGC GGGAGAGGGG
GGCTCCGTCG CGGAGAGCAG GGTGGGCATG GACGGAAGGG CCAGACCGCT CAGCAGCGTC
GAGGGCCGGC GCAGCGCCGC GGCCATCCTC GCCGAGAACC CCGAGATCTC GCTGCGTGAG
GTGTCCAGGA AGAGCGGGCT GTCGGTGGGC ACGGTCCGGG ACGTCCGCAA GCGCGTCGAC
CAGGGCCAGG ACCCCATTCC GGAGCGCTTC CGCCGCAACG GCGGGGAGCG CGTCCCCGTG
TCGGTCGCGG TCCGGGAACC GGACCGCAGG CCCCGGGTCA CCGAGTCCAC GGCGGTGGGC
GAGACCGCGG CGGCGGTCCG CAAGCTCGCC CGGGACCCCT CCCTGCGCGC CACCGAAGCC
GGTCGGACCC TGCTGCGCAT GCTGACGATC ACCGAGGTCG GACGCGCGCA GTGGGAGGAG
ATCATCAGGA CGATCCCCGA GCACTGGCTG CCCCTCGTCC GCGCCATCGT CCTCCAGCGC
TCCCAGGAGC TGCGGGAACT CGCCTACATG GTGCCCGGCG AGGACTCCGA CCCGGACTGA
 
Protein sequence
MPFTKGNFTV PAQTMPSAPG LGEGQRNDGD LLPEVADIIR DLPQGIVPIA DLAPSLSPRE 
KMEDPAHARV IAEMGSAVEP ILVHRRTMRV IDGVHRIRAA RMRGKTEIRA RFFDGSDEDA
FVLAVQLNVG HGLPLTLSER RTSAQRIILS HPHWSDRAIA RRAGLNAKTV ARLRREAGEG
GSVAESRVGM DGRARPLSSV EGRRSAAAIL AENPEISLRE VSRKSGLSVG TVRDVRKRVD
QGQDPIPERF RRNGGERVPV SVAVREPDRR PRVTESTAVG ETAAAVRKLA RDPSLRATEA
GRTLLRMLTI TEVGRAQWEE IIRTIPEHWL PLVRAIVLQR SQELRELAYM VPGEDSDPD