Gene Ndas_2146 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_2146 
Symbol 
ID9245996 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp2567521 
End bp2569074 
Gene Length1554 bp 
Protein Length517 aa 
Translation table11 
GC content72% 
IMG OID 
Productprotein of unknown function DUF112 transmembrane 
Protein accessionYP_003680074 
Protein GI297561100 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACGTTC TCACCAACCT CATGGCGGGG TTCGCCAACG CGCTGAGCCC CGAGTACCTC 
TTCCTGGCGC TGCTGGGCGT CATGCTCGGC ACGGCCGTGG GCGTGGTGCC CGGCCTGGGC
TCCTCCATGG CCGTCGCGCT GCTGCTGCCG GTCACCTTCC ACCTCGACCC CACCGGCGCG
TTCATCATGT TCGCCGGTGT GTACTACGGC GGGCTGTTCG GCGACTCCAC CGCCGCCATC
CTGATGAACA CGCCCGGACA GGCCTCGGCC ATCGCCACCG CCATCGAGGG CCACAAGATG
GCCAAGCGCG GGCGGGCTCC GCAGGCTCTG GCCACGGCGG CGATCGGCGC GTTCATCGGC
GCGATCGTGT CCACGGCGCT GGTGGCGTTC TTCTCACCGG TCATCGTGCA GCTGGCGCTG
CAGTTCGGTC CCGCCGAGTA CTTCGCGCTC ACCGTCTTCG CGTTCGTGGC CACCTCCGCG
GTGGTCTCGG ACTCGGCCGT GCGCGGCCTG ATCGCCCTGG GCATCGGCCT GGCCATCTCG
ATGGTGGGCA TCGACGGCCT CAGCGGCGCC GAGCGCTTCA CCCTGGGCGT TCCCCAGCTC
TTCGAGGGCT TCAGCATCAT CACGGTGACC GTGGCGCTGC TGGCGATCGG CGAGGTGCTG
CACGTGGCCG CCACCGCCCA CGCGGGCGAC GCGGCCAGCG GTGGCCTGCG GCGCACCGGC
ACTCCCTGGC TGAGCCGACG CGACCTGCGG CGCACGCTTC CGGCCTGGAT GCGCGGAACC
CTGTTCGGCC TGCCGTTCGG GTCCATCCCG GCGGGCGGCT CGGAGATCCC GACCTTCCTG
GCCTACGGCA CCGAGCGCAA GCTCGCCGCG CGGCGCGCCC GGCGCGGCAA GGGCGAGGAC
GAGTTCGGCG ACGGCGCGAT CGAGGGCGTG GCCGCGCCGG AGGCGGCCGC CAGCGCCACC
GCCGGAACCG CGATGGGCAC CCTGCTGGGT CTGGGCCTGC CCACGTCGGC GACCGCCGCG
ATCATGCTGG CCGCCTTCCA GCAGTACGGG ATGCAGCCCG GTCCGCTGCT GTTCGAGCGC
GACGGCGACC TGGTCTGGGC GCTGCTGGCC AGCCTGTTCA TCGGCAGCGT CATGCTGCTC
ATCCTCAACC TGCCCTTCGC GCCGCTGTGG GCCAAGCTGC TGCTGGTCCC CAAGCCGTAC
ATGTACGCCG GGATCGCGGT GTTCTCGTCC CTGGGCGTGT ACGCGGCCGC GTCGTCCATG
GTGGACCTGT GGCTGATGCT GATCCTGGGC CTGGTGGCTC TGATGATGCG CCGCTTCGAC
ATTCCGCTGG CTCCGGTGCT GATCGCGGTC GTCCTGGGCC CGGTGGCCGA GACGGAGCTG
CGCCGCGCCC TGGCCGTGGG GCAGGGCGAC GTGTCGGTGC TCGTGGACAG CGGGATCACC
CTGGGCATCT ACGGGCTGCT GCTGACGATC GGCCTGGTCG TGGCCGTCGG CCGACTGCGC
GCCCGGCGGG CCGCGCGTAA GGACCACACC CCGGTGGAGG CGGGCCGCGG CTGA
 
Protein sequence
MDVLTNLMAG FANALSPEYL FLALLGVMLG TAVGVVPGLG SSMAVALLLP VTFHLDPTGA 
FIMFAGVYYG GLFGDSTAAI LMNTPGQASA IATAIEGHKM AKRGRAPQAL ATAAIGAFIG
AIVSTALVAF FSPVIVQLAL QFGPAEYFAL TVFAFVATSA VVSDSAVRGL IALGIGLAIS
MVGIDGLSGA ERFTLGVPQL FEGFSIITVT VALLAIGEVL HVAATAHAGD AASGGLRRTG
TPWLSRRDLR RTLPAWMRGT LFGLPFGSIP AGGSEIPTFL AYGTERKLAA RRARRGKGED
EFGDGAIEGV AAPEAAASAT AGTAMGTLLG LGLPTSATAA IMLAAFQQYG MQPGPLLFER
DGDLVWALLA SLFIGSVMLL ILNLPFAPLW AKLLLVPKPY MYAGIAVFSS LGVYAAASSM
VDLWLMLILG LVALMMRRFD IPLAPVLIAV VLGPVAETEL RRALAVGQGD VSVLVDSGIT
LGIYGLLLTI GLVVAVGRLR ARRAARKDHT PVEAGRG