Gene Ndas_5100 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_5100 
Symbol 
ID9248990 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014211 
Strand
Start bp247095 
End bp249125 
Gene Length2031 bp 
Protein Length676 aa 
Translation table11 
GC content75% 
IMG OID 
ProductSMC domain protein 
Protein accessionYP_003682987 
Protein GI297564014 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.13432 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.797675 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACTCGGG TACCCGAGAC GTTCGAGGGT GGGACGGCGC TCCTGGACGC TCTGGCGCGC 
TCCGGACTGC CCCGGGAGGT CAGCTCCTGG GTCAGCGCGG CCCTGTGGGG AGAGGACGCC
CTGGAGGCCC GCCTGAGGGG TGAACGGGTC CCCGAGCCGG AACCCGCCGC CGAGCCGCCC
GCGGGCAGGG TCCGGCGGAC CTACCTCACG GGGATCAGGG TCCAGGGCTT CCGGGGCATC
GGCCGCCCCG CCGAGCTCAC CTTCGACGCC GGTCCCGGGC TCACCGTGAT CGTCGGGCGC
AACGGCTCGG GCAAGTCCAG CTTCGCCGAG GCCGCGGAGG CGGCCCTCAC CGGCCGCAAC
CCCCGGTGGG ACTCCATGCC CACGGGCTGG CGCGACGGCT GGCGCAACCT CCACTACGAC
GAGCGCACCG AGGCGACCGT GGACGTGCGC GTCGCCGGGG ACGAGGGCTC CACCCGCATC
AGCCGCAGGT GGACGGGCGA GAGCGTCCGC TCCGCGCGCG GCGAGGTCGT CCACCCCGAC
GGCGAGGTCT CCCCCCTCAG GACCATGGAC TGGGGCGACA ACCTCGTGCG CTACCGCCCG
TTCCTGTCCT ACGACGAACT CGGCCGCACG GTCACCGGGC GTTCGGCCGA GCTGTACGAC
ACGCTCACCG GCCTGCTCGG GCTCAGCGGC CTGGCGGAGG CCGAGCGGCG CCTGGCCAAG
GTGTGCGACG GGCTCGCCAA GCGCCGCGAC CGGCCCTCGC GGGAGCTGCG GTACCTGCTC
GACGCGCTCC GGGCCTCCGA CGACCCCAGG GCCGTGCAGG CCGTCCAGCT GCTCACCGCC
TCCTACTTCG ACATGGACGC CCTGCGCAGG CTCGCCTCCG ACACCGGGCC CAGCGACCCC
GAGCTGCACA CGGTCCTGCG CCGCCTGCGG CGGCTGGCCG TGCCCGAGCG GACGCTGATG
TCCGACGTGG TCAACGAGCT GCGGGGCGCG TCCATGGAGC TGGCCATGGC CGCGGGCAGC
AAGGGCGACC GCGCGCACGG GGTGGTCAGG CTCCTGGAAC AGGCCCTCGA ACACCACCAG
CGCCACCCCT CCGAGACCGA GTGCCCCACG TGCTCCGCGC CGCTGGGCGC CGACTGGGTG
CGCCGGGCCA ACGCCCAGCT GCGCGCCCTC AAGCCCCAGG CGGCCGTCGT CTCGGCCGCC
TACGAGCGCG CCGACGCGGC CAGGGACCAG GCCCGCTTCC TGATGTCCCC GGCGCCGGGC
TGGCTGCCGC CGGAGAGCGA GCTGGGCCAG GTGTGGTCGC TGTGGGAGTC CGGCGCGGAC
ATCGAGGACC TGGCCGAGCT GGCCGAGCAC ATCGAGGCGG TCGGCCGCAG GCTGAGGGCG
GCCGCGGTGA GCGCCCGCCG GGACGCGAGC GAGCGGCTGG AGGACCCGAC CGGCGGCTGG
TCGGAGCTGG CCGAGCAGCT CTCGGGGTGG CTGGACGACG CCCAGGACGC CCTCACCGCG
CGCGAGGTCC TCAACGGGGC CGAGGCCGCG CTGAACTGGC TGTCGGAGCA GGCGAGGATC
CTGCGCGAGG AGCGGCTGGG CCCGGTCGCG GCGCAGGCCG AGCAGGTCTG GTACCGGTTG
CGCCAGGAAC GCCACATCGA CCTCCAGGGC ATGCGGCTGA TCGGCCGGGG CGCCCGGCGC
CGGGTCGAGG TGGACGTGTC GGTCGACGGC GTCGGCGACC AGACCAGCGC GCCCGGGCTC
CTGAGCCAGG GCGAGTTCCA GGCGCTGGCG CTGTCCATCT GCCTGCCGCG CACCCTGGTG
GAGGGCAACC CCTTCGGCTT CCTGGTCCTG GACGACCCGG TCCAGGCGAT GGACACCGAG
ACCGTGGAGG GCCTGTCCGC GGTGCTGGCG GAGGTGGGCA GGCACCGGCA GCTGATCGTG
TTCACCCACG ACACCCGCCT GTCCGACGCC CTGCGCCGAC TGGGCCTGCC CGCCGACATC
CGCACGATCA ACCGGGACGC GATGTCCAAC GTGTGGGTCG AGGCGGTCTG A
 
Protein sequence
MTRVPETFEG GTALLDALAR SGLPREVSSW VSAALWGEDA LEARLRGERV PEPEPAAEPP 
AGRVRRTYLT GIRVQGFRGI GRPAELTFDA GPGLTVIVGR NGSGKSSFAE AAEAALTGRN
PRWDSMPTGW RDGWRNLHYD ERTEATVDVR VAGDEGSTRI SRRWTGESVR SARGEVVHPD
GEVSPLRTMD WGDNLVRYRP FLSYDELGRT VTGRSAELYD TLTGLLGLSG LAEAERRLAK
VCDGLAKRRD RPSRELRYLL DALRASDDPR AVQAVQLLTA SYFDMDALRR LASDTGPSDP
ELHTVLRRLR RLAVPERTLM SDVVNELRGA SMELAMAAGS KGDRAHGVVR LLEQALEHHQ
RHPSETECPT CSAPLGADWV RRANAQLRAL KPQAAVVSAA YERADAARDQ ARFLMSPAPG
WLPPESELGQ VWSLWESGAD IEDLAELAEH IEAVGRRLRA AAVSARRDAS ERLEDPTGGW
SELAEQLSGW LDDAQDALTA REVLNGAEAA LNWLSEQARI LREERLGPVA AQAEQVWYRL
RQERHIDLQG MRLIGRGARR RVEVDVSVDG VGDQTSAPGL LSQGEFQALA LSICLPRTLV
EGNPFGFLVL DDPVQAMDTE TVEGLSAVLA EVGRHRQLIV FTHDTRLSDA LRRLGLPADI
RTINRDAMSN VWVEAV