Gene Ndas_4161 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_4161 
Symbol 
ID9248035 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp4968044 
End bp4969276 
Gene Length1233 bp 
Protein Length410 aa 
Translation table11 
GC content74% 
IMG OID 
Productprotein of unknown function DUF1006 
Protein accessionYP_003682062 
Protein GI297563088 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAGAAC CGCACCCCCG CCGCGACGCG GAACTCTCCC TCTCCCAGGC GCGCAGGATC 
GCGCTCGCCG CCCAGGGGTT CAGCGACCCC CGGCCCGCCG GGAAGCCGAC CCTCGCCCAC
CTGGCGCGGG TGGTGCGCAG GGTGGGCATC CTCCAGATCG ACAGCGTGAA CGTGCTGGCC
CGCAGCCAGT ACCTGCCGGT CTTCGCGCGC ATGGGCGCCT ACGACACCGC CCTGCTGGAC
CGGGCGACCA CCTCCGCGAC CGGGTCGGGG GCCGCGCGGC TGGTGGAGTG CTGGGCGCAC
GAGGCCAGCC TGGTGCCGCC CTCCACCCGC CAGCTCATGC GCCACCGGAT GGAGCGCAAC
CGCGCCCAGG AGCGCGTGGG GTGGATGGAC CGCATCCGGG AGGAGAAGCC CGAGCTGGTC
AAGGCGGTGC TGGAGGAGGT GGTGCGGATC GGCCCGGCCA GCGCCCGGCA GGTGGAGGCG
GCGCTGGCGC ACGACGTGCC GCGCGCCAAG GACCACTGGG GGTGGAACTG GTCCGAGGTC
AAGAGGTGCC TGGAGTACCT GTTCTGGGTG GGCGACCTGA CCTCCAACGG GCGCAACACG
CAGTTCGAGC GGCTCTACGA CCTGCCCGAG CGGGTGCTGC CGCCGGAGGT GTACGCGGCG
CCGGAGACCA CCCCGGAGGA GGCGCACCGG GAGCTGGTGT CGATCGCGGC CCGCGCCCAC
GGGGTGGCCA CCGAGGCGTG CCTGCGCGAC TACTTCCGCC TCAAGTCCAC CGAGTCCCGG
GCGGCCGTCT CCGACCTGGT GGAGGCGGGT GAGCTGGTGC CGGTGCGGGT GCGGGGCTGG
GACCGCACGG CCTATCTGCA CCGGGACGCC CGGGTGCCGC GCAGCGTGCG GGCGCGGGCG
CTGCTGAGCC CGTTCGACTC GCTGGTGTGG ACGCGCGACC GGGCGGAGGA GCTGTTCGGG
TTCCGGTACC GGCTGGAGAT CTACGTGCCC GCCGCCAAAC GGGTGCACGG CTACTACGTG
CTGCCGTTCC TGCTGGGCGA GGACCTGGTG GCGCGGGTGG ACCTCAAGGC GGACCGCGCC
TCGGGGACCC TGCTGGCGCA GCGGATCACG CTGGAGGACG GCGCGCCCGC CGAGACGCTG
CCCGAACTGG TCGAGCAGCT GACCGAGATG GCGGCCTGGC TGGGCCTGTC CAACGGCGTC
GGCGGCCCGG CGCTGGCCTC CGGCGGACGC TAG
 
Protein sequence
MAEPHPRRDA ELSLSQARRI ALAAQGFSDP RPAGKPTLAH LARVVRRVGI LQIDSVNVLA 
RSQYLPVFAR MGAYDTALLD RATTSATGSG AARLVECWAH EASLVPPSTR QLMRHRMERN
RAQERVGWMD RIREEKPELV KAVLEEVVRI GPASARQVEA ALAHDVPRAK DHWGWNWSEV
KRCLEYLFWV GDLTSNGRNT QFERLYDLPE RVLPPEVYAA PETTPEEAHR ELVSIAARAH
GVATEACLRD YFRLKSTESR AAVSDLVEAG ELVPVRVRGW DRTAYLHRDA RVPRSVRARA
LLSPFDSLVW TRDRAEELFG FRYRLEIYVP AAKRVHGYYV LPFLLGEDLV ARVDLKADRA
SGTLLAQRIT LEDGAPAETL PELVEQLTEM AAWLGLSNGV GGPALASGGR