Gene Ndas_4123 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_4123 
Symbol 
ID9247997 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp4924976 
End bp4926574 
Gene Length1599 bp 
Protein Length532 aa 
Translation table11 
GC content69% 
IMG OID 
Producthypothetical protein 
Protein accessionYP_003682024 
Protein GI297563050 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.433169 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATTCCT CACGCCCCGA GACCGGCGGC ACGCCCGAAC CGGTCCGCCA GTCCCCCACC 
GACGCACAGC CACCCTTCGT CCCCGCCCAG AGGCAGGCCG CTCCCCCCGA GGCCGAGGCC
GCCTCCGGTA CGGCCGCTCC CGGCGGCGCT CCGCCCGAGG AGCGCGTGTG GAAGGCGGGC
CGGCTGGAGC CGATGCCCAT CCGCCCCCTG CCCAAGGCGC CGCCCTCGAT CCACATCCTG
GGCCCCACCG TGTTCCTCGT CGCCCTCGGC GTGGGCATGG GCGAGTCCTA CATGTGGCCG
AGGCTGGTCC TCGTCTTCGG CCCCGAGATC CGCTGGCTGT TCCTGGTCGG CGTCACCCTC
CAGGCCGTGG TCATGCTGGA GATGTCGCGC TACGCGATGG CCACCGGGGA GAGCATCTTC
TACGGCGCCG CGCGCGTGTT CAAACCGCTC ATGTGGTTCT TCTTCATCAC GGCGATGCTG
GTCTACATCT GGCCGGGCCA CCTCTCGGCG GGCGCCTCCG CCTTCGAACG GGTGACCGGC
ATCCCCTGGC AGGCCACGGC GGTCGCCGGG ATGCTCCTCG TCGGTGTGGT GTTCACCCTG
GCGAAGGTCA TCTACAACCT GCTGGAGAAC GTGCTGTCGA TCTGCATCGG CATGCTGGTG
GTCGGCAGCG CCGTGATCGC GGCCATCGTG GGCGACCTGT CCGACCTGAC CTCCACCCTC
ACCGGGATGT TCGCGTTCGG CTACCTGCCG GAGGAGGCCA CCACGGCCCT GTGGTTCCCC
GTGATCGTGG GCTCCATCGC CTTCGCCGGG CCCTCGGGCA TGCAGCAGAT GTGGTACACC
CTGCACCTGC GCGACAAGGG CGCCGGGATG GGCTCGCACA TCCCCAAGAT CCGGGGCCTG
CGGCACGCGG GCGAGCAGGA GGCCATGCCC ACGCGCGGCT TCATGTTCGA CACCTCCGAC
GCCTCGGAGA TGGAGAAGTG GAAGGGCTGG CGGCGCTGGG TCACCTTCGA CGCGATGGTC
CTGTTCTGGG GCATCACGAT GCTGGTGACG ATCTCCTTCA CCGTGCTGGC CCAGGCCTCG
GCCCGCTTCG ACCCGAACGT GACCGACCTG CTGCGCGACG GCGACCGCGA CGCCGCCCTG
GACGCCATGG CGGCCTCCTT CTCGGCGGCC GGGAGCCCGG TCCTGGGCAC GGTGTTCTTC
TGCTTCATCG CGCTCATCGG CCTCAACGCC ACGCTGGGGC TGTTCGACTC CTTCTCGCGC
GGCCAGGCCG ACATGACCTT CAACTTCGTG CCGGGCGCCA AGAAGGTCGG CATGTCGAGG
CTGTACGCCC TCTTCCTGTG GGGCCTGATC GCCTTCGGCA TCGTCATCCT GCTCTTCGGC
CCCGCCGACG GCCCGGCGGC GATCCTGGAC GTGCTGGCCT TCCTGTCGGC GTTCGCGATG
GGCGCCTACT GCGTGGTGCT GCTGCTGGTC AACAACCTCA CCCTGCCCAA GCCGATCCGG
CCGGGCATCC TCTCCAACGC CGTCATCGCC TTCGCGGCGG TGTTCTACCT CGGCGCCCTG
TTCTACTCGC TGTTCGCCTT CGGGGTCGTG ATCGACTGA
 
Protein sequence
MDSSRPETGG TPEPVRQSPT DAQPPFVPAQ RQAAPPEAEA ASGTAAPGGA PPEERVWKAG 
RLEPMPIRPL PKAPPSIHIL GPTVFLVALG VGMGESYMWP RLVLVFGPEI RWLFLVGVTL
QAVVMLEMSR YAMATGESIF YGAARVFKPL MWFFFITAML VYIWPGHLSA GASAFERVTG
IPWQATAVAG MLLVGVVFTL AKVIYNLLEN VLSICIGMLV VGSAVIAAIV GDLSDLTSTL
TGMFAFGYLP EEATTALWFP VIVGSIAFAG PSGMQQMWYT LHLRDKGAGM GSHIPKIRGL
RHAGEQEAMP TRGFMFDTSD ASEMEKWKGW RRWVTFDAMV LFWGITMLVT ISFTVLAQAS
ARFDPNVTDL LRDGDRDAAL DAMAASFSAA GSPVLGTVFF CFIALIGLNA TLGLFDSFSR
GQADMTFNFV PGAKKVGMSR LYALFLWGLI AFGIVILLFG PADGPAAILD VLAFLSAFAM
GAYCVVLLLV NNLTLPKPIR PGILSNAVIA FAAVFYLGAL FYSLFAFGVV ID