Gene Ndas_1987 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_1987 
Symbol 
ID9245837 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp2407986 
End bp2409644 
Gene Length1659 bp 
Protein Length552 aa 
Translation table11 
GC content78% 
IMG OID 
Productputative integral membrane protein 
Protein accessionYP_003679919 
Protein GI297560945 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.285044 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000714658 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGAATTACT TTTATCGCCT GTGGGGTGCG CTACGCCGCA CCCTCACCAC CCGCGGGGCC 
TTCGCGCCCA GGCCCGTGCG CGGCAACGTC GCCGTCACCG CGCTGCGCGC CGGAGTGTGC
GTGGCCCTGC CCCTGCTGGC GCTGCACGCC GCGGGGCGCA TCGACCTGGC CCCCTACGCC
GCCATGGGCT GCTTCACCGC CCTGTACGCG CGCGACGACA CCTACGCCCG CCGCGCCCGG
CTCCTGGCCG TGGTCGGCGC GACCCTGACC CTGGCCGTGG CCCTGGGCGC GCTGGTCTCG
GCCGCGCTCC CCCACCCCCT GGCCGCCATC ACGGTCGTGG CCCTGGTCGC CGGAGGGGCC
AAGTACCTCT CCGACGCCCT GGAGTTCGGC GCCCCGGCCG GGCTGATGTT CGTCTTCGCC
GCCGGGGTGG CCGCCTACAA CCCCCAGCCC CTGACCGCGC TGCCGGTGGT GGCGGCCACC
ACCGCGGCCG CGGCCGCCCT GTGCTGGGCC CTGGCCCTGG TCGGCGCCCT GGTCCATCCC
ACCGCCCCCG AACGCCTGGC GGTCGCCCGC GCCCTGCACG CCCTCGCCCG CCACCTGCGC
CACCCCGCAC CCCCCGCCCG CACCGGTGCC GAGACCGCCC TCCACCAGGC CTGGCACGTC
CTGCTGTCCT CCCCCGGGCA CACCCCGACC CGCCAGGCCC TGGAGGTCCT CACCGCCCGC
GCCGAAACCC TGCTCACCGG AACCGGAGAC CGCGTCACCG ACGCCCGCGC CGCCGCCGAG
ATGAGCGAAC TGGCCCGCCG CCTGCGCACC GAACGCGCCG TCGAACCCCT GGTCGGCGCC
GACGAGCACG CCGCCCTGTC CCAGGCCGCC GCACACCTGC GCCGCCACCA CGCACCCCCG
CCCGGGGCGG TCGAGCGCCT GCGCGCCGCA CTGCGCGCGC CCTCGCCCAC CCCGGTGTCG
GTGGTGCGCA TCGTCCTGGC CTGCCTGCTG GCCGGTGCCC TGGCCTGGGC GCTGGGCATG
GGACACGGCT ACTGGGCCGC CGTCTCGGCC GGATCGGTCC TGCAGGCCGT CAACGTCACC
ACCACCTGGC ACCGCACCCT GCAACGCGGC GCGGGCACGG TCGTGGGCGT GGCGCTGGCC
GCCGTGCTCT TCAGCCTGGA CTACTCCCCG CTGGGCGTCA TCGCGCTGGT GGTGGCGTGC
CAGATGGCGG CCGAACTGGT GGTGACCACC AACTACTCCT ACGCCATCGT GTTCGTCACC
CCCCTCACCC TGGCCCTGTC GGGCCTGGCC GACGCCGACC CGGGCGCGGA CGGGCTGGCG
GCCGAGCGGC TGTGGGCCAC CGTGCTGGGA GCCGCGGTCG GGCTCCTGGT GTGCGCGGCG
GTCGCCAACC GCCGCGCCGG GGACCACCTG AGCCAGGCCC TGGCCGCGTG CGAGCGCGAA
CTCGCCCGGG CGCGGGAGTG CGCGGGCGCC GACGCGGCCG CGCACCGACG CCTGGCGCGC
AGCCTGGTGG CGCTGCGCGC CTCCCACGCC CTGGCCGAGG GGGAGCCGTG GCTGGCCGGG
GCGTGCGCGC GGGAGGTCCA GGACGTGGAA CGCCGCGCCC GCGCCCTGCT GGACGGGGCC
GCCCTGCCCG GCCGAGCGGT CGGCGCCCTT CCGGAGTGA
 
Protein sequence
MNYFYRLWGA LRRTLTTRGA FAPRPVRGNV AVTALRAGVC VALPLLALHA AGRIDLAPYA 
AMGCFTALYA RDDTYARRAR LLAVVGATLT LAVALGALVS AALPHPLAAI TVVALVAGGA
KYLSDALEFG APAGLMFVFA AGVAAYNPQP LTALPVVAAT TAAAAALCWA LALVGALVHP
TAPERLAVAR ALHALARHLR HPAPPARTGA ETALHQAWHV LLSSPGHTPT RQALEVLTAR
AETLLTGTGD RVTDARAAAE MSELARRLRT ERAVEPLVGA DEHAALSQAA AHLRRHHAPP
PGAVERLRAA LRAPSPTPVS VVRIVLACLL AGALAWALGM GHGYWAAVSA GSVLQAVNVT
TTWHRTLQRG AGTVVGVALA AVLFSLDYSP LGVIALVVAC QMAAELVVTT NYSYAIVFVT
PLTLALSGLA DADPGADGLA AERLWATVLG AAVGLLVCAA VANRRAGDHL SQALAACERE
LARARECAGA DAAAHRRLAR SLVALRASHA LAEGEPWLAG ACAREVQDVE RRARALLDGA
ALPGRAVGAL PE