Gene Ndas_2791 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_2791 
Symbol 
ID9246642 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp3333363 
End bp3334673 
Gene Length1311 bp 
Protein Length436 aa 
Translation table11 
GC content71% 
IMG OID 
ProductExtracellular ligand-binding receptor 
Protein accessionYP_003680710 
Protein GI297561736 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.55279 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.125183 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGACCAC TGAGCATCCT CTCCTCGACG CTGGCCGTCA CGGTTCTCCT CACCTCCTGC 
TCCAGCGGCA CGGACGGCTC CGACCGGGCC GTCACCCCCG GGGTGACGGC CGACGCCGTC
GTCATCGGCA CGCACCAACC GCTCACCGGA GCGGCCTCCC CGGGTTTCCG CCACGTCTCC
ACCGGCGCCC GCGCCGTGTT CGACTACATC AACGACAACG GCGGCATCCA CGGCCGCCGG
ATCGAGTACC AGGTCCAGGA CGACGCGTTC GACCCCGCGC AGACCCAGGA GGCCACCCGC
AGCCTCATCG ACGACCAGGA GATCTTCGCC ATGCTCGGCG GCCTGGGCAC CCCCACCCAC
GAGGCCGTGA TCGAGGAGCT CAACGAGGCG GGCGTCCCCG ACCTGTTCGT CTCCTCCGGC
GCCCTGGCCT GGGACCAGCC CGAGGTCTAC CCCCACAGCT ACGGCTTCCA GGTCGACTAC
ACCCGGGAGG CCAAGATCCA GGGCCAGTAC ATCGCCGAGA ACTTCCCCGG CGACAGGGTC
GGCCTGCTCT ACCAGAACGA CGACGTGGGC CCCTCCTCTC ACGCGGGGAT CGAGCAGTAC
CTCACCGAGG AGATCGTGGC CTGGGAGTCC TACGACCCCG GCGTCCCCGA GCTCGCCGGA
CAGGTCGAGG AGCTCAAGCG GTCGGGCGCC GAGGTCGCCG TCTGCCACTG CATCCCCGCC
TTCCTGGCCC TGGCCGTCCT GGAGGCCACC GCGATCGGCT ACACCCCGCA GTGGGTGGCG
CCCAGCTTCG GCGGCGACGT GGCGGTGGCC ACCGGCCTCA TCGAGGAGTA CGCGCAGGGC
ACGGCGGCCG AGAACGTCCC GCCCGAGGCC TTCCTGGACG GTCTGATCAT CACCGCGTTC
CTGCCGATGG CCGCCCAGCG CGAGGACCCG TGGACCGAGT TCTTCCTGGA GATCCACGAG
AGGTACAACG AGGGCACGCC CTTCACCGAC ACCACCGTCT ACGGCATGGT GCAGGCGGTC
CTGTTCGCCC AGGTGCTCAT GGAGGCCGGC CCCGACCTGA CCCGGGAGAG CCTGCTCGGC
ACCCTCAACT CCCACGAGTG GACGGGGCCC GGCCTGGTGC CGTTCAACGC CACAGAGGAC
GACCACAGCG GCTACGCCGG GGTGATGGTG GTGCAGCACC ACGCCGGCGA GGAGCCCGAG
ATCCTCCAGG AGCCCATGGT CACCGACAGC GACGGCGGGG AGGTCCTGCC CTTCGAGCTG
GACCGGCCCT CGCCCGACGA GGTGTCCCTC TTCGGGGGGG CCGGCGGCTA G
 
Protein sequence
MRPLSILSST LAVTVLLTSC SSGTDGSDRA VTPGVTADAV VIGTHQPLTG AASPGFRHVS 
TGARAVFDYI NDNGGIHGRR IEYQVQDDAF DPAQTQEATR SLIDDQEIFA MLGGLGTPTH
EAVIEELNEA GVPDLFVSSG ALAWDQPEVY PHSYGFQVDY TREAKIQGQY IAENFPGDRV
GLLYQNDDVG PSSHAGIEQY LTEEIVAWES YDPGVPELAG QVEELKRSGA EVAVCHCIPA
FLALAVLEAT AIGYTPQWVA PSFGGDVAVA TGLIEEYAQG TAAENVPPEA FLDGLIITAF
LPMAAQREDP WTEFFLEIHE RYNEGTPFTD TTVYGMVQAV LFAQVLMEAG PDLTRESLLG
TLNSHEWTGP GLVPFNATED DHSGYAGVMV VQHHAGEEPE ILQEPMVTDS DGGEVLPFEL
DRPSPDEVSL FGGAGG