Gene Ndas_3743 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_3743 
Symbol 
ID9247612 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp4493880 
End bp4495754 
Gene Length1875 bp 
Protein Length624 aa 
Translation table11 
GC content73% 
IMG OID 
ProductLipoprotein LpqB, beta-propeller domain protein 
Protein accessionYP_003681647 
Protein GI297562673 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGGACAC CGCACCGGTT CGGCCGCGCC GCGCGCGCCT GCCTGGCCGC GGCCGTGGCG 
GTCGTGTCCA TGGCCGCCTG CGCGACCGTC CCCACCACCG GCCCGGTCGT GGCCAGCGAC
GGCAACGAGT CCGAGGGCGA CCCCTACGGC GGCTACGTGC GCCTGCTGCC CGCCGGACCC
CAGGAGGGAG TGGCGCCGGA GGGCCTGGTC AGCGACTTCC TCAAGGACAT GGGCAGCTTC
GAGCAGGACT ACGAGGCCGC CCGCAGCTAC ATGCTGTCGA GCACGGACCA GTCGTGGAAC
CCGGACGGGA CGGTCAAGGT CTTCCCCGGC CACGACACCG TCGACCTCGA CACCGAGATC
AGCGGAGACG GGCTCACCGC CACCGTGCGG ATGCGCAGCT CGCTGGTGGC CACGATCGAC
GAGGACGGCA GGTACGTCTC CGGCGACTCC GGTCGCCTGC TCGACGAGAC CTTCGTCCTG
GCCCGGGAGG AGGAGGACCC CGAGGGCGAG TGGCGCATCC AGAGCCTGCC CGACGAGCTC
ATCCTCAGCC AGCTCGACGT GGAGCGCACC CACCGCCCGT TCAACCTCTA CTACTTCAAC
CCCGACGAGA ACGCCCTGGT CCCTGACCCG GTCTACCTGC CCGTCAGCAA CGACGAGCTC
ACCAAGCGCC TGCTGCACAG GCTCGTCCGC GGCCCCAGCA CCTGGCTGGA CCCCTCCGTG
CACTCCTCCT TCGCCGAGGA CGCCGACCCC GAGGTGGAGG TGGAGGAGGA CCGGGTGACC
GTCAGCGTGA CCGCCCCCGG CCAGGCCGAC GAGTTCGGCA TGGGCGCCCA GATCGCCTGG
ACCCTGCGCC AGCTGCCGGA GATCCAGGAG TTCACGCTGC GGGTGAACGG CAACGAGGTG
GACTTCCCCG GAGCCGAGGG CGAGAGCGCC GACCGGCCGC GCCCGGGCAG CACCTACTGG
TCGGAGGTCA GCCCCGGCGC GACCTCGCCC GGCGTCCACG TCTACTACTC CCACGAGGGC
CAGCTGTGGT CGGCCTCGGA CTGGGACACC GACACGTTCG GCAGCGGCGA GCCGGTACCC
GGGCCCCTGG GATCGGGCGA GGTCCCCCTG GGCAGGTTCG CTGTGTCGCT GGACGAGCAG
ACCATCGCCG GGGTCACCAC GGGCGGCCGC GAGGTCGTCA CGAGCCAGGC CACCCCGGGC
GCCGACGTCC GGGAGGTCCT GGCCGACGGG GTCTTCACCG AGCTGTCCTG GGACGTCAAC
GGCGACCTGT GGGTGGTCGA GGAGGTCGAC GACGAGGACG GGGAGGACGG CGAGGAGGCC
CGGGAGGACT CCGACGCCGA GAGCCCCAAC CTCAACGGGC CGCCGCCCTC CCCGGGGCCG
ACGGACCTGT GGCTGCTGCG CGACGGCCAC GAGGTGGTCC GCGTGGACGT GTCCGCCCTG
CGCGACAGGC CGCTGGTGCA GTTCCAGATC TCCCGGGACG GCACCCGCGC CGCGGTGGTG
ACCGAGGTGG ACGGGCGCCG CTCGCTCCAG GTGGGCCGCG TGGTGCAAGG CGCCGACGGG
CAGGTCTCCG TGGAGTCGTT CGTGACCCTG GCCCGCGAGC TGGAGGACGT CACCGACATC
TCCTGGCGCT CGGGCGACCA GTTGGTCGTG CTCGGCACCC GCGAGGGCGG TACGAGCCAG
GCCCTCCTGG TGTCGCTGGA CGGCGGCACG CCGCCCGCCA GCGCCGGAAC CCCCGTCGCC
AGCATGGTGA CCGTCTCCGG AGCCCCGGGC CAGCCCCTGG TGGCCGGTTC GGACGACGGC
AACATCTGGG TGTCCAGCGA CCCGCTCAAC TGGCAGAGCG TCGTGGAGGG CGGCTCGCCC
GCCTTCCCGG GCTGA
 
Protein sequence
MRTPHRFGRA ARACLAAAVA VVSMAACATV PTTGPVVASD GNESEGDPYG GYVRLLPAGP 
QEGVAPEGLV SDFLKDMGSF EQDYEAARSY MLSSTDQSWN PDGTVKVFPG HDTVDLDTEI
SGDGLTATVR MRSSLVATID EDGRYVSGDS GRLLDETFVL AREEEDPEGE WRIQSLPDEL
ILSQLDVERT HRPFNLYYFN PDENALVPDP VYLPVSNDEL TKRLLHRLVR GPSTWLDPSV
HSSFAEDADP EVEVEEDRVT VSVTAPGQAD EFGMGAQIAW TLRQLPEIQE FTLRVNGNEV
DFPGAEGESA DRPRPGSTYW SEVSPGATSP GVHVYYSHEG QLWSASDWDT DTFGSGEPVP
GPLGSGEVPL GRFAVSLDEQ TIAGVTTGGR EVVTSQATPG ADVREVLADG VFTELSWDVN
GDLWVVEEVD DEDGEDGEEA REDSDAESPN LNGPPPSPGP TDLWLLRDGH EVVRVDVSAL
RDRPLVQFQI SRDGTRAAVV TEVDGRRSLQ VGRVVQGADG QVSVESFVTL ARELEDVTDI
SWRSGDQLVV LGTREGGTSQ ALLVSLDGGT PPASAGTPVA SMVTVSGAPG QPLVAGSDDG
NIWVSSDPLN WQSVVEGGSP AFPG