Gene Ndas_5291 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_5291 
Symbol 
ID9249189 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014211 
Strand
Start bp454037 
End bp455743 
Gene Length1707 bp 
Protein Length568 aa 
Translation table11 
GC content69% 
IMG OID 
Producthypothetical protein 
Protein accessionYP_003683177 
Protein GI297564204 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.872454 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCACTC CCACCGGTAA GACAACCCGG GCCGAACGGC TCGCGCAACC GCTCGCCCGT 
GAGGTGGCCG AACAGGTCGC CGCCGACCAC GGCGTGTGCA TCCGCCCCGT GTCCCTTCGA
CGCACCAACA TCGCCACAGG TGCCGCCGAG GTGATCGACG TTCCGTGCGG GTCCACGCTT
GAATCGCGGT GTCCGGCCTG CGCCAAGAGG AAGCGCAGCC TTCGCCGCTC TCAGTGTGAA
GAGGGCTGGC ACCTGGTTAC CGAACCTGTG GTGGAGCCTG ATCCGCCGTC CGAGGAACAA
CGCGGTTGGG TGGAACAGCG GGCGTTGGTC ACCGCCGAAC GGGATCGCCT CGCTGCCACC
GGGGTCGCTG ACCCGGAACA GCTCTCGGCG CTGGATGCGG CGATCGCGGA CCTGGACGAG
GAGATCACCG CCTCCGGTCT TCGAGGTTCG GTCACCCGCT CCGGCGACTC CTCGGGTTCG
TCGGGGCCGC GTCGGGTGCG CTCGACCAAA CGCCGTCAGG ACGTGCCCGA CCTTCCCAAG
CGACCTATGA CGAGGAAGAC GGTCGGGCGG GCCTTCACTG ACCCGGCCTC GGGCAAGGTG
TTTCGGCCCT CGCTGTTCAT CACCCTGACG TTGGACTCCT ACGGGCGGGT GCGCTCGGAC
GGCACCCCGG TCGATCCCTC CACGTACGAC TACCGGCGGG CCGCCCGGGA CACCCTGCAC
TTCTCCAAGC TGGTGGATCG GTTCGTGCAG AACCTGCGCC GCGTGGCCGG GTTCGATGTC
CAGTACTTCG CCGCCGTGGA ACCCCAACGA CGGTTGGCCC CGCACCTGCA CATGGCCACA
CGCGGCACCA TCCCCCGGGC AGAGCTGCGC CAGATCGCCG CCGCGACCTA CCACCAGGTG
TGGTGGCCTA CCGCCGACCG CGTCGTCTTC GACGGTGACC ACCTGCCGGT CTGGGACGAG
GACGCGGGTA CCTATCTCGA CCCGACCACC GGGGAAGTCC TGCCCACGTG GGACGAGGCA
CTGGACGCCC TCGACGACGA CCCGGACGCG GAGCCTCATC ACGTGGTGCG CTTCGGCCGA
CAGGTGGACG CCAAGGGTGT GGTCGCGGGC TCGGAGGACG CTTCGCGGTG TGTGCGCTAC
CTGGCCAAGT ACCTGACCAA GGACATCGCC GACTGCCACC AGGTGGAGAC CACCCGACAG
GAACAGCACG TGGACCGGCT CCTCGACGCC CTGCGCTTCG AACCGTGCTC GCCTCGATGC
GCGAACTGGC TGCGCTACGG CATCCAACCC GACGACGCCA AACCGGGACA GCGCCCCGGG
TTCTGCCGGT CCAAGGCCCA TCGCCGCGAA CACCTGGGCT ACGCGGGCCG CCGCGTCCTG
GTCTCGCGCA AGTGGTCCGG CAAGACTCTG GCCGACCACA AGGCGGACCG GCTCGCCTGG
GTCCTCAACG CCCTCGGAAT TAACGGCGTT AATAGCGACC CCGCCAACGA GGACCAAGGC
GAGGAGGACA CTCCGCGTCC CCCCGTGCTG TCCTCGGTCG CCTCGGGCTC CTTCGAGTGG
GAGTTGGCCC GGCCCACCGA CCCCGACGTG GCTCCTCGGG AGCAACGCCT CCTGCGCGCG
GTGGGCGAAG CCCTCAAACG CCGTGCCCAA CTCGACGCGG CACGGCGGAA CGATCTTTCG
GCAATCCACG GAAGCAGGAC GGCATGA
 
Protein sequence
MPTPTGKTTR AERLAQPLAR EVAEQVAADH GVCIRPVSLR RTNIATGAAE VIDVPCGSTL 
ESRCPACAKR KRSLRRSQCE EGWHLVTEPV VEPDPPSEEQ RGWVEQRALV TAERDRLAAT
GVADPEQLSA LDAAIADLDE EITASGLRGS VTRSGDSSGS SGPRRVRSTK RRQDVPDLPK
RPMTRKTVGR AFTDPASGKV FRPSLFITLT LDSYGRVRSD GTPVDPSTYD YRRAARDTLH
FSKLVDRFVQ NLRRVAGFDV QYFAAVEPQR RLAPHLHMAT RGTIPRAELR QIAAATYHQV
WWPTADRVVF DGDHLPVWDE DAGTYLDPTT GEVLPTWDEA LDALDDDPDA EPHHVVRFGR
QVDAKGVVAG SEDASRCVRY LAKYLTKDIA DCHQVETTRQ EQHVDRLLDA LRFEPCSPRC
ANWLRYGIQP DDAKPGQRPG FCRSKAHRRE HLGYAGRRVL VSRKWSGKTL ADHKADRLAW
VLNALGINGV NSDPANEDQG EEDTPRPPVL SSVASGSFEW ELARPTDPDV APREQRLLRA
VGEALKRRAQ LDAARRNDLS AIHGSRTA