Gene Ndas_2630 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_2630 
Symbol 
ID9246481 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp3140989 
End bp3142239 
Gene Length1251 bp 
Protein Length416 aa 
Translation table11 
GC content72% 
IMG OID 
ProductSel1 domain protein repeat-containing protein 
Protein accessionYP_003680553 
Protein GI297561579 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0395405 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCCCCGGC CCTGGAAGCG TCCCGATCTG CCCCCGGGTG GGCTCGACGA GCTCAACCGC 
GCGCTGCATG AGTTGCACCA CCGGGCCGGC TGGCCCTCCT CCCGACAGAT CCAGCGCGCC
CTGGACGCCA AGGGGGTGCC CATGTCCCAC ACCAAGGTTC ACGACACCCT CACCAAGCCG
GACCTGCCGC CCAAGGGCGC GGTCGAGATG ATCACCGAGG TCCTGGCGGA GGCGGTCCGC
GGCGCCGACG CCGACACCGA GGTCGAGCGC CTCCTGGACC TGTGGCAGGA CGCCTCCGAC
GGCACCCTGT CCTCTTCTTC CGCGGCGCAC GGCGAACCCC GGGCCGAAGC ACCGCCCCCG
GCGCTCCGGC CTCCGACCGG CGGCGGACAG GGCGGGGACG AGCAGGCCTA CGAGTGGGGA
GACGTGCCGC GGGAGTGGAG GGACAGGGCC GAGGCCGGTG AACCCGACGC CATGATCAGC
ATCGGCCTCA GGTTCGGGGT CTGGGGCGAG AAGGACAAGG CGGAGACCTG GTACCGCCGT
GCCGTCGAGG CGGGCAGTAC CCGGGCCATG GACAACCTCG GGAGCCTGTT GGAGGACCGG
GGCGACCTGG GGGAGGCCGA GGAGTGGTTC CGCCGCGCCG CCGAGGACGG TCACACCGAT
GCCATGGACA ACCTCGGGAG CCTGCTGGAG GGCCGGGGCG AGCTGGACGA GGCCGAGGGG
TGGTTCCGCC GCGCGGTCGA GGACGGTCAC ACCGATGCCA TGAACAACCT CGGTGTCCTG
CTGCGGGGAC AGGGTGAGCT GGACGAGGCC GAGGGGTGGT TCCGCCGTGC CGCCGAGGAC
GGACACCTCC AGGCCATGAA CGACCTCGGC GTCCTGTTGC GGGGACGGGG GCGGCTCGAC
GAAGCCGAAT CCTGGTTCCG CAACGCCGCC GGCAAGAACG GCAACGCGCA CGCCATGTAC
AACCTCGGGT CCCTGTTGGA GGACCGGGGC GAGCTCGGCG GGGCCGATGT GTGGTACCGG
CGCGCCGCCA AGAACGGCAA CACCCAGGCC ATGTACAACC TCGCGTTTCT GCTTCACCGA
GAGGGGGACA AGGACGAGGC CGAGACCTGG TACCGCCGTG CCGCTGAGTT CGGCCACACC
GCCGCCATGT ACAACCTCGG CGTGCTGCTC CAGGGGCGGG GCAGGCCCGG GGAGGCCCAG
GGGTGGTGGC AGCGGGCGTT GGCCGCGACG GGTCGGCGCC GCGGGCGCTG A
 
Protein sequence
MPRPWKRPDL PPGGLDELNR ALHELHHRAG WPSSRQIQRA LDAKGVPMSH TKVHDTLTKP 
DLPPKGAVEM ITEVLAEAVR GADADTEVER LLDLWQDASD GTLSSSSAAH GEPRAEAPPP
ALRPPTGGGQ GGDEQAYEWG DVPREWRDRA EAGEPDAMIS IGLRFGVWGE KDKAETWYRR
AVEAGSTRAM DNLGSLLEDR GDLGEAEEWF RRAAEDGHTD AMDNLGSLLE GRGELDEAEG
WFRRAVEDGH TDAMNNLGVL LRGQGELDEA EGWFRRAAED GHLQAMNDLG VLLRGRGRLD
EAESWFRNAA GKNGNAHAMY NLGSLLEDRG ELGGADVWYR RAAKNGNTQA MYNLAFLLHR
EGDKDEAETW YRRAAEFGHT AAMYNLGVLL QGRGRPGEAQ GWWQRALAAT GRRRGR