Gene Ndas_5293 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_5293 
Symbol 
ID9249191 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014211 
Strand
Start bp455972 
End bp457171 
Gene Length1200 bp 
Protein Length399 aa 
Translation table11 
GC content70% 
IMG OID 
Productintegrase family protein 
Protein accessionYP_003683179 
Protein GI297564206 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTCGCG CGTGGGTCTA CGACCGCACC AAGGACAGGG CGTACACGGA GGCGGTGAGC 
AAGGCCAAGG CGTCCAAGCG CACTCCGCCC GGCCGGTGGT GGGTCCGCTA CTACGACCCC
TCCGGGAAGA TCAAGAGCGG CGGCGTCTTC CAGAAGAAGC CGGACGCGGA GAAGAAGCGG
ACCGAGATCG AGAACAGCCT TCACGAGGGC TCCTACCGCA ACCCCCATGA CGCCAAGGTC
ACCGTGGCGG AGATGGCGGA GAAGTGGCTC ATCACCCGCA CGGACATCAA ACGATCCACG
TGGTGGCAGT ACCGGGCGGT CCTGGACAAC CACGTGCTGC CGCGCTGGGG AGACCTGCGC
CTCTCGGCAG TCCACGCCGA GGACGTGGCC GTGTGGGTGG CCCATCTCCA GAAGCCCCGG
GACGAGGGCG GTAGCAACCT GGGGGCCTCG CAGACCCGGC ACGCGCACGT CGTGCTGTCG
ATGGTCCTGG GCTGGTGCGT CCCCCGGCGC ATTCCCTTCA ACCCGGCCAA GGGCGTACCG
CTGCCCAAGC CCAGTGAGGC CGAACACGTC TACCTCGACC ACGCCCAGGT GGAGGCGCTG
GCCGACGCCT CCCTGACGCT GCGCACCAAG TACGGGCAGG AACTCGCCTC GGCCAGGGTG
AGCCGGGCGC TCGTCCTGCT CCTGGCCTAC ACCGGCATGC GATGGAGTGA GGCCGCCGCG
CTGCGCGTGG GAAGGGTGGA CCTGGACCGG CGGCGGGTGC GGGTCGTCGT GACCTTCGCC
GAGGTGGACG GCAGGCTCGT CGAACAGCCC CCGAAGAACG GCAGGTTCCG GACCGTGCCG
GTTCCCCGGT CCCTCGTCCC CGAACTGCGG CCCTTCGTGA AGGGGCGGCC GGATGACGCG
CTGGTCTTCA CCACCAGGCG GGGCGCTCCG CTGCGCATCC GGAACTGGCG CAACCGTGAG
TTCGCCCTGG CGGTGAAGGT GGTCGGGCTC GACGGCATGG GGCTGACCCC CCACAAGCTC
CGGCACACCG CCGCGTCCCT GGCGATCGCG GCCGGTGCGG ACGTCAAGGT CGTGCAGGCC
ATGCTCGGCC ACAAGACGGC GACCATGACC CTGGACCGGT ACGGGCACCT GTTCCCGGAC
CGCCTGGACG AGGTGGCCGA CGCGATGGAC GCCGCCCGTC TGCGGGTCCT CGCCGCCTGA
 
Protein sequence
MARAWVYDRT KDRAYTEAVS KAKASKRTPP GRWWVRYYDP SGKIKSGGVF QKKPDAEKKR 
TEIENSLHEG SYRNPHDAKV TVAEMAEKWL ITRTDIKRST WWQYRAVLDN HVLPRWGDLR
LSAVHAEDVA VWVAHLQKPR DEGGSNLGAS QTRHAHVVLS MVLGWCVPRR IPFNPAKGVP
LPKPSEAEHV YLDHAQVEAL ADASLTLRTK YGQELASARV SRALVLLLAY TGMRWSEAAA
LRVGRVDLDR RRVRVVVTFA EVDGRLVEQP PKNGRFRTVP VPRSLVPELR PFVKGRPDDA
LVFTTRRGAP LRIRNWRNRE FALAVKVVGL DGMGLTPHKL RHTAASLAIA AGADVKVVQA
MLGHKTATMT LDRYGHLFPD RLDEVADAMD AARLRVLAA