Gene Ndas_1398 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_1398 
Symbol 
ID9245248 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp1712806 
End bp1713912 
Gene Length1107 bp 
Protein Length368 aa 
Translation table11 
GC content78% 
IMG OID 
Productprotein of unknown function DUF6 transmembrane 
Protein accessionYP_003679336 
Protein GI297560362 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.509319 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000513734 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGGAACACA CCGAGACACG TCCCCGCCGC TCCGTACCGC CCCTCCTGCT CGCCGCCGGA 
CTGGTGGTGA TGTGGAGCTC CGGCTTCGTC GGCGCGGATC TGGGCACCCG GTACGCGCCC
GCGACCACGT TGCTGGCCTG GCGTTTCCTG GTCGTGGCCG CCCTGCTGGC GGGGTGGTGG
CTGTGGCGGG GCCCGCGGAT GTCCCGGCGG GACCTGGCGG CGCACGCCGT GCTGGGCCTG
CTGGCCCAGT CCGGGTACCT GTACGGGGTG TTCGCCGCCG CGCAGGCGGG CGTGGCCGCC
GGGACGAGCG CGCTGGTGGC CGCCCTCCAG CCCCTGGTGG CGACCGCCCT GGCCGTCCCG
CTGCTGGGCG AACGGGTGCG GCCGCGCCAG CTGGCGGGCC TCGCGCTGGG GCTGGGCGGG
GTCGGCCTGG TGGTGGGCGC GGACCTGTTC CGGCCGGGCG CGGCGCCGTG GTGGGGCTAC
CTGCTGCCGT TCGGGGCGAT GCTGTCCCTG GTGGCCGCGA CCCTGCTGGA GCGCCGCGCG
CGGCCCGGGG GCTCGGTGGT GCAGGCCCTG GCGGTGCAGT GCGCGGTGAG CGCGGTGCTG
TTCACGGGGC TGGCCGCGGT CACCGGGACG CTGGCGCCGC CCGCCGACCC CGGGTTCTGG
GCGGCGGTGG TGTGGGTGGT GGTGCTGTCC ACCCTGGGCG GCTACGGCCT GTACTGGGCC
GTCCTGGCCC GCTCGGGCGT GGCCCGGGTG TCGGCCCTGC TGTACCTGAC CCCGCCCACC
ACGCTGGTGT GGTCGTGGCT GATGTTCGGC GATCCCGTGG GGCCAGCCGC CCTGGCGGGG
ATGGCGGTGT GCGCGGTGGC CGTGGTGCTG GTGAGCACCG GGGGAACCGG GAGCCGGGCC
GCCCGGACGG ACGCGAAGGC TACCGGCGCT CCCGACCGGG AACCGAAACG CACGGCCGGT
TCCACCACGA GCACCCCCGA CCGGCTACCG GGGCGACGGG CCGAGGCCGC CGCCGCTGCT
CCCGCCCGGG CCCCGGAACG CGGCACCGGG GGCGCTACGG ACACTCCCGG GCCGGAAGCG
GACCGACCCC GTCCGAGGAG ACGGTGA
 
Protein sequence
MEHTETRPRR SVPPLLLAAG LVVMWSSGFV GADLGTRYAP ATTLLAWRFL VVAALLAGWW 
LWRGPRMSRR DLAAHAVLGL LAQSGYLYGV FAAAQAGVAA GTSALVAALQ PLVATALAVP
LLGERVRPRQ LAGLALGLGG VGLVVGADLF RPGAAPWWGY LLPFGAMLSL VAATLLERRA
RPGGSVVQAL AVQCAVSAVL FTGLAAVTGT LAPPADPGFW AAVVWVVVLS TLGGYGLYWA
VLARSGVARV SALLYLTPPT TLVWSWLMFG DPVGPAALAG MAVCAVAVVL VSTGGTGSRA
ARTDAKATGA PDREPKRTAG STTSTPDRLP GRRAEAAAAA PARAPERGTG GATDTPGPEA
DRPRPRRR