Gene Ndas_3329 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_3329 
Symbol 
ID9247191 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp3979795 
End bp3980883 
Gene Length1089 bp 
Protein Length362 aa 
Translation table11 
GC content74% 
IMG OID 
Producthypothetical protein 
Protein accessionYP_003681241 
Protein GI297562267 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTTCACAC GTGCCTCCTC CCCCGCCGCC CGCTGGCGCA CCGCACTGCT CGGTGCGGCG 
AGCGCCGTCC TGTTGACCGC CTCCTGCGCC CCCGCCGACG ACGGGGCCGC CGAGGCGCCG
GGGGTCGGCG GCGTCGAGCG CGAGGTGGCC TTCGAGGGCG GCGGGCACGA GGTGTCCGGC
ACCTTCCGGA TGCCCCAGGA CGTCTCCGGT ACCGTTCCGG GCGCGCTCAT CATCTCCGGC
AGCGGTCCCA CCGACCGCGA CGGCAACAGC TCCGTGCGCC CCGACGCCAA CACCAACCAG
AACCTGTCCC GGGTGCTCGC CGAGGTCGGG GTGGCCTCGC TGCGCTACGA CAAGTACGGC
AGCGGTGCGG ACTTCGAGAC GCCGGACCCC GGCGGCGAGG TCGCGCCGGT GGACCCGGTG
ATCTTCGACG AGCAGGTCGC GGCCGCCTAC GAGGAGCTCG TCTCCCAGCC CGAGGTGGAC
CCCGAGCGGG TGGTCGTCGT CGGGCACAGC GAGGGCGCCC TGTACGCGCT GCGCGCCCAC
GATCTCATCG GCGCGGAGCA CCCCTCCCCC GCGCTCGTGC TGGCCGCGCC TCCCGGTACG
CGCTACCTCG ACCTGATCGA CCGCCAGCTC ACCGAGCAGG CGCGTGCCGT CGAGGCCTCG
GGTCAGGTGG AGGAGGGACA GGTGGTCCAG CTCCTCTCGG ACTCCCGCGC CGGACGGGCC
GCGATCAGGT CCGGGCGTGC GCTCAGCGGG GTGGAGATGC CCAGTGGGGG CCTGGGCGTC
TACACCCCGG TCAACGAGGA CTTCCTGGCC TACATGGACG CCTTCGACCC CGTCGACCTG
GCGGAGGACC TGCCCGGGGG CACCCCGGTG CTGGTGCTGT GGGGGGAGCG GGACGCGCAG
GTCTCCCGGC AGGACGTGGA CCGGCTGATG ACCGGTCTGG ACGGCGCGCG ACGGGTGGAC
GTCCCCGACA CCGACCACAT CTTCCGCCGC TACCGGGACG AGCCCGGGGC CACGGTCCTG
GACGAACAGC GCCCCTTCGC CGAGGAGGTC GCGCCCGCGC TGGAGGAGTT CCTGGGCACG
GCCTGGTAA
 
Protein sequence
MFTRASSPAA RWRTALLGAA SAVLLTASCA PADDGAAEAP GVGGVEREVA FEGGGHEVSG 
TFRMPQDVSG TVPGALIISG SGPTDRDGNS SVRPDANTNQ NLSRVLAEVG VASLRYDKYG
SGADFETPDP GGEVAPVDPV IFDEQVAAAY EELVSQPEVD PERVVVVGHS EGALYALRAH
DLIGAEHPSP ALVLAAPPGT RYLDLIDRQL TEQARAVEAS GQVEEGQVVQ LLSDSRAGRA
AIRSGRALSG VEMPSGGLGV YTPVNEDFLA YMDAFDPVDL AEDLPGGTPV LVLWGERDAQ
VSRQDVDRLM TGLDGARRVD VPDTDHIFRR YRDEPGATVL DEQRPFAEEV APALEEFLGT
AW