Gene Ndas_4324 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_4324 
Symbol 
ID9248199 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp5157203 
End bp5159044 
Gene Length1842 bp 
Protein Length613 aa 
Translation table11 
GC content73% 
IMG OID 
Producthypothetical protein 
Protein accessionYP_003682219 
Protein GI297563245 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTGGTGGG TTGGCTCCTG GCGCCCGGCC GTCGCGGTCG CGACGGCCGT CTTCGTGTTG 
TCCTCACTGG TCACGGCGCT TGTCGTCGCG TGGCCGGGTG GGGAGGCCCA CGCCGCGACG
CCAGCGGCAC CGGCTCCCAC AGTACCGGGG GGCTCCGTGG CGGCGGCCAG CGACCGGTCG
GCCCCGGTCA TCACCAACGA CGTCCGCCTC GAACTCGACC ACGCGGGCGT CCTCCACGGC
GAGGAGACCA TCTCCTTCTC CGGCGGCGCC CCGGCCACCT TCACCCGCTC CCTGACCGAG
ACGATGCTCT ACGACACCGA GTACGACCGG AAGTTCGAGG TCACCGGTGT GCGGGCCACC
GACCTGGACG GCGACGCCCT GGACGCCCGG GTGAGCAGGG ACGACGGCTC CCTGTCCGTG
GAGATCCCCA CCGAGGCCAC CTCCGGGGTC GTGCTCTCCT ACCGGGTGAG CGGAACCGTG
AGCGAGGTCT CCCAGGGCGT CCAGATGGAG TGGCGGGCCG TGGGCGCCTA CAGCCACACC
GTGGAGACCA CCGACGTGAC CGTCACCGCG CCCCTGCCGC CCGGGGCGCT GTCCTGCCTG
GCCGGGGAGC CGCGCAGCGC CATGTACTGC ACGGCCTCCG ACATGGGCGC CGACGCGGGC
GTGGCCCACT TCCTCCAGGC CGACATGGAG CCCAGCGACC GGCTCGACAT CGTCGTCAAC
TACCCGCCCG GCACCGCCGA GGGCGAACCC ATCCTCACCC GCCGCTGGTC CCTGGCCTCC
GCGTTCGCGG TCACCCCGGC CACCGCCAGC GTGTTCGGAC TGCTGCTGGC GGTGCTCCTG
GGCGGCCTGG TGGTGCTGAT CCGCGTCCGG GGCCGCGACG AGCGCGTCCT GCGCGACGAG
GCCTCGGCCG GAGACCACGC CCCGGTGGCC GAGGGCGAGC ACGGACGCCT GCGGTTCGCC
CCGCCCGACG ACGTGCACCC CGGACAGATC GGCACGCTCG TCAACGAGAC CGTGGACATC
ACCGACCTCA CCGGTGCCGT GGTCGACCTC GCGGTGCGCG GCTACGTGCG CCTGGAGGAA
CTGCCGCACG AGCACTTCAC GTCCGTGGAC TGGCGGCTGG TGCGCCTGGA CGGCCCCCCG
GAGGACACCC TCCGCTCCTA CGAGTGGCTT CTGCTGGACG CGCTCTTCGG CGGGCGCCCG
ACCGTGCGCC TGTCCCAGGT CGGCTCGCCC CGCACCTCCC CGGACTTCCC GGCGCGCATC
GACCGCGTGC GCGAGGAGCT GTACCGGGAC ATGGTGCGGC TGAAGTGGTT CGCCCGCTCG
CCCAGCCAGG TGCGCGGCCG CTGGAGCGCC GTCGGCATGG CGGTGACGGC CGCGGGCGTG
CTCCTGACCG GCGTCCTGGC GGTCTTCACC AGCGCGGCGT TCACCGGGCT GGCCGTCATC
ATCGCGGGCG CCGCCGTCAC GGCGGGCGCG CAGTACATGC CCGCCAAGAC CGCCCTGGGC
AGCTCGGTGT ACGCGCACAC GGTGGGCTTC CGCGACTACC TGCTGAGCCC GCGCTCGGCG
ACCGCTCCCC CGGGGCAGCG GGTGGAGCTG TACTCGCGCT ACCTGCCCTA CGCGGTGATC
TTCGACGACG TCGACCACTG GGCCAACATC CTGGCCTCGG CCGCGCTCAC CGACCTGACC
CCCGAGGAGC TGACCGGTGA CGGCCTGGCC TGGTACACGG GGCCGAGGGA GTGGCGGATC
GAGGACTTCG CGGACTCCAT CACCACGTTC GTGGTGACGC TGACCGGCAT CATCACCAAC
GCCCGGCGGC TGCGGGCGCT CAACCAGTTC CAGCGGCGGT AG
 
Protein sequence
MWWVGSWRPA VAVATAVFVL SSLVTALVVA WPGGEAHAAT PAAPAPTVPG GSVAAASDRS 
APVITNDVRL ELDHAGVLHG EETISFSGGA PATFTRSLTE TMLYDTEYDR KFEVTGVRAT
DLDGDALDAR VSRDDGSLSV EIPTEATSGV VLSYRVSGTV SEVSQGVQME WRAVGAYSHT
VETTDVTVTA PLPPGALSCL AGEPRSAMYC TASDMGADAG VAHFLQADME PSDRLDIVVN
YPPGTAEGEP ILTRRWSLAS AFAVTPATAS VFGLLLAVLL GGLVVLIRVR GRDERVLRDE
ASAGDHAPVA EGEHGRLRFA PPDDVHPGQI GTLVNETVDI TDLTGAVVDL AVRGYVRLEE
LPHEHFTSVD WRLVRLDGPP EDTLRSYEWL LLDALFGGRP TVRLSQVGSP RTSPDFPARI
DRVREELYRD MVRLKWFARS PSQVRGRWSA VGMAVTAAGV LLTGVLAVFT SAAFTGLAVI
IAGAAVTAGA QYMPAKTALG SSVYAHTVGF RDYLLSPRSA TAPPGQRVEL YSRYLPYAVI
FDDVDHWANI LASAALTDLT PEELTGDGLA WYTGPREWRI EDFADSITTF VVTLTGIITN
ARRLRALNQF QRR