Gene Ndas_4028 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_4028 
Symbol 
ID9247900 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp4820534 
End bp4822771 
Gene Length2238 bp 
Protein Length745 aa 
Translation table11 
GC content73% 
IMG OID 
Producthypothetical protein 
Protein accessionYP_003681931 
Protein GI297562957 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.304691 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGAACAGA CCAATCACAA CTTCCTCAAC GGAGGAGGTC AGGCGGGGAT CTCCGACCGC 
ATGCGTGACC TGCTCTCCCA GGCCGCCCAG GAGCACGTCT CCGAGCAGAA GTCGCAGGGC
GCCGTCAGCG AGGAGATGCG CCAGCGCCTG GAGGGCATGG AGTGGCTGCT CCGCGAGCTG
CGCGAGCGCG AGCTCACCGC CCTCACCGAG TCGGTCGCCA CGGTCAACGG CCGCGTCGAC
GAGTTCCTCG CCCGGCCGCC GGAGTGGGCG GAGACCCTCG CCGAGCACAT CGAGGTCGTC
GCCCAGCAGG TCAAGCCCCT CTCGGACCTG CCCTCCCTGC GCGCGGACAC CCACCGCATC
GCCGGTCACC TGGACACCGC GCTCACCCGC CTCCAGCGGA TGGCGGAGAC GGGCAACCGC
ACCTCCGAGC AGGTCAACGA GCTGGGCGAG CGCCTGACGG GGCTGGACGA GAGCCTCGAC
ACCCGTCTGA ACGGACTCGG AGAGACCCTC GGCGCCCGCC TGGACGGGCT GAGCGAGACC
TTCGGCGCCC GCCTGGACGC GCTCGACACC GCGCTGGCCT CCCTGACCGC CAGGACCGAG
GCCGTGCAGG CGTCGCTGTC CGCGCTCTCG GAGTCGGCGG AGAGCCGCCA CGAGGCCCTC
ACCGCCGCGG TCGGCGAGAG CCGCACCGAA CTGGCCGAGG CGCTCGCCGG CGGGCGGACC
GAGATCTCCG AGGCCCTGGC CGAGGGCCGC ACCGAGCTGG CAGAGGCCGT GGCCGCGAGC
CGCGAGAGCC TCACCGCCGT CGTCAACGAG AGCCGCGAGA GCCTCACCGC CGTCGTGAGC
GAGAGCCGCG AGGCCCTGGC CGCGGCCGAC GCCCGGCTGG GCGAGACCGT CGAGGCCAAG
GCCGAGGAGG CCGCCGCCAG GGCGGAGCAG GCCAACGGCG AGCTGCGCAC GCTCGTGGAG
GAGCGCACCG AGCGTCTGCG CACCTCGATG GAGGAGCGCA CCGCCGAGCA GCACGAGACG
CTGCTGTCGC GCCTTCAGGA GAACACCGGC GAGCTGGGCG AGCGCACCGC CGCCCTGACC
GAGGAGCTGG GCGCCCTGTC CACCTCCTCC GCCGAGCGCC ACGAATCCCT CACCGCCAAG
CTGGCCGAGA GCGTCGGCTA CCTCAGCACC CAGGCCGAGG AGAACAACGC CGAGACGACC
GCCGCGGTCG CCGACGTCAC CGGCGCGCTC GCCAAGCTCC GCGAGGACCA CGAGTCGTCG
CTCACCGAGC TGCGCACCCA CCTGCGCCAG CGCCTGGCCG AGCTCAACGA GCAGATGGAG
CTGGGCCGCA CCGAGACCCG GGAGCAGGCC GAGGCCGCCT CCGAGCGCCT GCGCGAGGCG
GTGACCGAGC GCACCGACGC CCTGGGCGCC CGGATCGCCG AGGACGCCGA GCGCGTCACC
TCCGACTTCG CCGAGCTGCG GACGTTCGTC CAGGAGAACG GCGACCGGCT CGCCACCGAG
TCCGAGGAGC GTTCGCGGAC GCTGACCGAG TCGCTCACCG AGCGGGTGGA GGCGCACCGG
ACGGCGCTGG ACGAGCGTCT GGAGCGCCAG CGCGAGGCCC TGACCGGCAA GGTCGACACC
CACCTTGCGC AGATCACCGG CAAGGTGGAC CACGAGCTGG GCCGCCTCAC CGACCGCTTC
GACACCTTCG AGGGGCACTT CGAGGGAAGC TTCGAGGGGG TCGAGGGCAA GCTCGACCGC
ATCGACGGCC GCATGGACGG CGTCAACGGC CGCCTGGACG GCCTGGACGG CCGGGTCAAC
GGTGTCGAGG GCCAGTTCGA GGGCGTCAGC GGGCACTTCG AGGGCGTGGA CGGCCGTATG
GAGGCCCTCG ACGACCGGCT GGAGGCGCTC AACCAGCGGC TCAACCAGCT GCCGCAGACC
ATGGAGGTCA GCGAGCTGCA CCGCCGCCTG ACCGAGCTGG TGGAGCGGCC GCAGCTGGAC
CACACCGGCA AGCTGGACGA GATCGACGAG CACGTCACCT CGGCCGTGGC GCCGGTGCTG
CGCGAGCTGA AGCAGCGCCC GGACCGGCAC GAGCTGGAGG AGACCGTCAC CGAGGCGGTC
GAGAACTCGC ACGACGACAT CACCAAGCGG TTCGCCTCCC TGGAGGAGAC GGTGCTCGCC
CTGGCCGAGG CGCTGCTGCG CCCCGGCCGG GACGGCAAGA AGAAGCGCCG CCGCGACGAG
GACGAGGACG ACGAGTAG
 
Protein sequence
MEQTNHNFLN GGGQAGISDR MRDLLSQAAQ EHVSEQKSQG AVSEEMRQRL EGMEWLLREL 
RERELTALTE SVATVNGRVD EFLARPPEWA ETLAEHIEVV AQQVKPLSDL PSLRADTHRI
AGHLDTALTR LQRMAETGNR TSEQVNELGE RLTGLDESLD TRLNGLGETL GARLDGLSET
FGARLDALDT ALASLTARTE AVQASLSALS ESAESRHEAL TAAVGESRTE LAEALAGGRT
EISEALAEGR TELAEAVAAS RESLTAVVNE SRESLTAVVS ESREALAAAD ARLGETVEAK
AEEAAARAEQ ANGELRTLVE ERTERLRTSM EERTAEQHET LLSRLQENTG ELGERTAALT
EELGALSTSS AERHESLTAK LAESVGYLST QAEENNAETT AAVADVTGAL AKLREDHESS
LTELRTHLRQ RLAELNEQME LGRTETREQA EAASERLREA VTERTDALGA RIAEDAERVT
SDFAELRTFV QENGDRLATE SEERSRTLTE SLTERVEAHR TALDERLERQ REALTGKVDT
HLAQITGKVD HELGRLTDRF DTFEGHFEGS FEGVEGKLDR IDGRMDGVNG RLDGLDGRVN
GVEGQFEGVS GHFEGVDGRM EALDDRLEAL NQRLNQLPQT MEVSELHRRL TELVERPQLD
HTGKLDEIDE HVTSAVAPVL RELKQRPDRH ELEETVTEAV ENSHDDITKR FASLEETVLA
LAEALLRPGR DGKKKRRRDE DEDDE