Gene Ndas_4361 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_4361 
Symbol 
ID9248236 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp5195210 
End bp5196607 
Gene Length1398 bp 
Protein Length465 aa 
Translation table11 
GC content67% 
IMG OID 
Producthypothetical protein 
Protein accessionYP_003682256 
Protein GI297563282 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.77847 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTGCTGG GTTTCCGCGT CGAGAACCAC AAGTCCATCC GGGAGGAGCA ACAGCTCCTG 
CTCACTCCCG TCTACGACGA CGCCCGCCCG CAGGAGGCCG GCTGGGAGGC CACCACGGTC
GCGGGGGTCT TCGGGGCCAA CGCCTCGGGC AAGTCGAACC TGCTGGACGC GCTCTCCTTC
ATGCGGGACA CCGTCCGGTG GTCGATGAGC CACAACGAGC CGGGGAGCGG AATCCAGCGC
CACCCCTTCA AGTTGAGCGC GGACGCCCGG GAGGAGCCCT CCACGTTCGT GGTCGACCTC
GTGATCGACG GTGTCCGCCA CACCTACGGA TTCGGGGTCG ATGACGAGCG GGTCGTGGAG
GAGTGGTTGT ACAGCTACCC CAGACAGCGC AGACGCGTCG TGTTCGAGCG TGAGGGGGAG
GAGTTCTCCT TCGGTGACCA GACCTCGGGG AAACTGCGCC AGGTCAAGGA GATCACCGGT
CGGAACGTCC TCTTCCTCAC CGTCGCCGCC CGCGCCTCGA ACGCGGAGGT GGAACCGGTC
TACCGCTGGT TCTCCGAGGG CCTGGTGTCC GCCACTGAAC GCAGCCCGGA CCACCCGGCC
TGGTTGCGTG GAGGAGCGGC CTCCGAGGAG CGCATGACCG CTCTCGGTCG CCTGCTGAAG
TCCGCTGACA CCGGCATCGA GGCCGTGGAA CTGCATGAGC AGGGTTCCGG TTCCGGGCCC
GGGACGGCAA CCTCCAAAGC AGTGCTGGCC GCAGGGGAAT GGCCCCGTCT GTGGGCTAGG
AAGCAGTCCT CACAGGAGCG GCGGGACGAC CGCGGATCCG GTGTCGTCCA CCTCGCGACG
GGCAGCGGTA AGTCGTCCTA TGTGGCGCTC CTCGCAGACC TGATCCGCGA ACGCACCACG
CTCCTCTTCC ACCACCGAGG AGACGAGTCC GCGACCCCGC TCCTGTGGGA GGAGGAGTCA
CTGGGCACCC GAGCGTTCAC CACGATCGGC TTCGACGCCC AACGCGCCCT GGAAGCAGGC
GGTGTCCTCG TGGTCGACGA GATCGACGCC AGCCTCCACC CCTACCTCTC CGCCAAGGTC
ATCTCCCTCT TCCAGGATGA AGAACACAAC CCCAAGGGCG CCCAACTGAT CTTCACCAGC
CACGACGCGG CCCTGCTCGG ACGCGTACGC GGTGAGGAGG TCCTCAAACG CGACCACATC
TGGTTCGTGG ACAAGGACGA CCGTGGACGG ACCTCGCTCT ATCCGCTCAG CGACTTCAAG
CCCCGGGGGG ACGACAACCG CGCCCGGCGC TACCTCACGG GCCGCTACGG CGCGGTTCCG
GACGTGGACG ACGAACTGTT CCGGGACGCC CTGCACCGGC GCGAGCAGTC ACGGGAGTCG
GAGGAAGCCG CCCCGTGA
 
Protein sequence
MLLGFRVENH KSIREEQQLL LTPVYDDARP QEAGWEATTV AGVFGANASG KSNLLDALSF 
MRDTVRWSMS HNEPGSGIQR HPFKLSADAR EEPSTFVVDL VIDGVRHTYG FGVDDERVVE
EWLYSYPRQR RRVVFEREGE EFSFGDQTSG KLRQVKEITG RNVLFLTVAA RASNAEVEPV
YRWFSEGLVS ATERSPDHPA WLRGGAASEE RMTALGRLLK SADTGIEAVE LHEQGSGSGP
GTATSKAVLA AGEWPRLWAR KQSSQERRDD RGSGVVHLAT GSGKSSYVAL LADLIRERTT
LLFHHRGDES ATPLLWEEES LGTRAFTTIG FDAQRALEAG GVLVVDEIDA SLHPYLSAKV
ISLFQDEEHN PKGAQLIFTS HDAALLGRVR GEEVLKRDHI WFVDKDDRGR TSLYPLSDFK
PRGDDNRARR YLTGRYGAVP DVDDELFRDA LHRREQSRES EEAAP