Gene Ndas_3077 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_3077 
Symbol 
ID9246933 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp3679149 
End bp3680816 
Gene Length1668 bp 
Protein Length555 aa 
Translation table11 
GC content62% 
IMG OID 
Productprotein of unknown function DUF262 
Protein accessionYP_003680992 
Protein GI297562018 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.392398 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAACAGC TCGAAGCCCA CGAGGTCCCC CTGCACAAGG TCTTCTCCAG CGACTACGAC 
TTTCGCATCC CCGACTACCA GCGCCCCTAC GCCTGGGAGG CCGAGGACGC CCTCCAGCTT
CTCGACGACC TCAAGGAAGC TCTGGAACGC GACAGGGAAG AGCCCTACTT CCTCGGATCG
GCTGTGCTGG TCAAGAGCAA GGAATCGGCC ATTGCCGAGG TCATCGACGG CCAGCAGCGC
CTGACGACCC TCACCATCCT GTTCGCGATC CTGCGTGACC AGACCAAGGA CGCGGAGCTG
CGTACCGAGC TGGAGAAGAT GGTGGTGGAG CCGGGCAGTA AGATGCTCAA GCTGGACCCC
AAACCCCGGT TGGCGCTACG GCCGAAGGAC GTGGAGTTCT TCCGCGAGCA CGTGCAGACG
ACTGGTTCCG TTCCCGGTCT GCTTGGCCTT CCCCGTACGG CTCTGAAGAC CAGTGCTCAG
GAGGCGGTAC AGACCAACGC GAAGGTTCTG TCCCGTGCGC TTGAGGGGTG GTCCGACGAG
CGTCGGTTGG AACTCGCCGG AATGCTCAGC GCGCGAACCT ACCTCGTCGT GGTCAGCACT
CCAGACCTGA ACAGCGCACA CCGTATCTTC AGCGTCATGA ACGCCCGAGG ACTCGACCTG
TCCCCGGCCG ACATCTTCAA AGCGAGGATC ATCGGTGACC TGGATCCGAA GCTCAGCAGC
ATGTACGCGG CCAAGTGGGA GGACGCCGAG GAGTCGCTGG GACGCGACGA CTTCGCCGAT
ACCTTCCTCC ATTTGAGGCT GATCTTCTCG GGTGAACGTG CTCGGCGGGA GCTGTTGCTG
GAGTTTCCCA AACAGGTCCT CTCGCGCTAT CTGCCGGGCA ACGGCGCGGA GTTCATCGAT
GACGTCCTGA TTCCCTACAC CGACGCCTAC GCTCAGATCC GCGATCAGAG CCACTCCTTC
CCAGCCGGGG CGGACAAGGT CAGTGCCTGG TTCAAGCGCT TGGAGCAGCT CGACAACAGC
GACTGGCGGC CGGCCGCGCT CTGGGCGGTG CGTCACCACC GCCACGACCC CGACTGGCTC
GACCAGTTCT TCCGCCGCCT GGAGCGGCTG GCTGCCAGTA TGTTCATCCG CCGGGTCTAT
CGGACACCCC GGATACAGCG CTACGTCGAA CTCGTACGTG AGCTCAACTC TGGTAAGGGC
TTGGACGCGC GTTCCTTCGA ACTCAGTGAA GAGGAGAAGC GCGCGACCCG GGCCGAACTC
GACGGTGAAC TCTATCTGTC CACCAAAGTC CGTCGCTGCG TCCTGCTCCG CCTCGATGAG
ATCCTCGCGG ACGAGTCCGG GGTCGTCTAC GAGTACGAGA CCATCACGGT TGAGCACGTC
CTTCCACAGA ATCCGGCCCC GGGATGGACG TCCTCCTTCA ACCAGGAACA GCGCGACTAC
TGGACTCACC GCGTCGGTAA TCTTGTTCTA CTCAACCGGA GGAAGAACTC ACAGGCACAG
AACTACGGCT TCCTCAGGAA GAAGGAGAAG TACTTCATGG GGAAGGGCGG AGTGGTGACT
TTCGCACTCA CCAGCCAGGT GCTCACCCAC TCCGAGTGGA CCCCTGAGGT GATCCAAGAG
CGTCAGGAAC GGCTGGTCGA GACGCTGGCT CGGGAATGGG ATCTGTGA
 
Protein sequence
MQQLEAHEVP LHKVFSSDYD FRIPDYQRPY AWEAEDALQL LDDLKEALER DREEPYFLGS 
AVLVKSKESA IAEVIDGQQR LTTLTILFAI LRDQTKDAEL RTELEKMVVE PGSKMLKLDP
KPRLALRPKD VEFFREHVQT TGSVPGLLGL PRTALKTSAQ EAVQTNAKVL SRALEGWSDE
RRLELAGMLS ARTYLVVVST PDLNSAHRIF SVMNARGLDL SPADIFKARI IGDLDPKLSS
MYAAKWEDAE ESLGRDDFAD TFLHLRLIFS GERARRELLL EFPKQVLSRY LPGNGAEFID
DVLIPYTDAY AQIRDQSHSF PAGADKVSAW FKRLEQLDNS DWRPAALWAV RHHRHDPDWL
DQFFRRLERL AASMFIRRVY RTPRIQRYVE LVRELNSGKG LDARSFELSE EEKRATRAEL
DGELYLSTKV RRCVLLRLDE ILADESGVVY EYETITVEHV LPQNPAPGWT SSFNQEQRDY
WTHRVGNLVL LNRRKNSQAQ NYGFLRKKEK YFMGKGGVVT FALTSQVLTH SEWTPEVIQE
RQERLVETLA REWDL