Gene Ndas_1934 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_1934 
Symbol 
ID9245784 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp2356884 
End bp2357978 
Gene Length1095 bp 
Protein Length364 aa 
Translation table11 
GC content78% 
IMG OID 
Producthypothetical protein 
Protein accessionYP_003679867 
Protein GI297560893 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.367322 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCACCACC ACACCACTCC CGAAGAGGAC CGTGCCGGTC TCCCCGAGCC CCGCCTGGTC 
CTGACCCGCT CCGGTTTCGG CGCGGCCGAG CCCCTGTTCA GCGCCGATCT GCGGGCGAGC
GCCGACCTCG CCCAGGCACG GGAGGCGGTC GCCGCGCACG CCTCCCGGCT GTGGACGGAG
GCCGCCCGCG ACGGCGCCGA CGACCGCCCC CTGTACTGGG CGCGGCTGGT GCTCTCGGCC
CGGCTGCGCT CCTGGCGGCC GGACTTCGCC GTGTCCGACG CCGACCGCGC CGACCTGCTG
CACCTGCTGG AGACGGGCTC GCGCGGCATC GCCGACCTCG ACTTCCCGCC GGGCGACCGG
TGGGTGCGGG TGGTCGTGAC CGGCTTCGAC CCCTTCCGCC TGGACGAGGA CCCGGCGTGC
TCCAACCCCT CCGGCGCGGC GGCGCTGGAC CTGAACGGGT GGACGTTCCC CGTGGGCGGG
CGCACGGCCG TGGTGCGCAC CGCGGTGTTC CCGGTGCGCT GGGCGGACTT CGACGCCGGG
CTGGTGGAGG AGGCGCTGGC CGAGCAGCAC CGGCGCGCGG ACGCGGTGGT CACGCTCAGC
CGCGGGCGGC CGGGCCGCTT CGACCTGGAG GTGTGGAACG GCGTCTGGCG CGGCGGGGGC
GAGGACAACC TGCGGGTGTC GCGCACCGGC CGGGCACCGG TGCCGGGCCC CGGCGTCCCG
GAGTGGACGC TCTCCTCGCT GCCCCACGAA CGGATCCTGG CGGCGGCGCG GGAGGACCCC
TACCCGGTGG TGGTCAACAC CGAGGTGGCC GAGGCCGCGC CGGACGGCCG CGGGGTGGTG
GTGCGCCCGG ACGGCCCCAC GGAGGGGTCG GCGGCGCGCC GCGGGGGCGG CGGGGACTAC
CTGTCCAACG AGATCGCCTA CCGCAACACG CTGCTGCGGG AACGGGCGGG GCGGCGGATC
GCGGCCGGGC ACGTGCACCT GCCCAAGGTG GCCTCCCCTC GGGAGAACGC CGAGGTCCTG
GCGCAGGTCC GGCGGATCGT GGCGGCCGTG GCCGCCGACG CGGCCGAGCC CCGGGAGGAG
GCGGGGCGGG GCTGA
 
Protein sequence
MHHHTTPEED RAGLPEPRLV LTRSGFGAAE PLFSADLRAS ADLAQAREAV AAHASRLWTE 
AARDGADDRP LYWARLVLSA RLRSWRPDFA VSDADRADLL HLLETGSRGI ADLDFPPGDR
WVRVVVTGFD PFRLDEDPAC SNPSGAAALD LNGWTFPVGG RTAVVRTAVF PVRWADFDAG
LVEEALAEQH RRADAVVTLS RGRPGRFDLE VWNGVWRGGG EDNLRVSRTG RAPVPGPGVP
EWTLSSLPHE RILAAAREDP YPVVVNTEVA EAAPDGRGVV VRPDGPTEGS AARRGGGGDY
LSNEIAYRNT LLRERAGRRI AAGHVHLPKV ASPRENAEVL AQVRRIVAAV AADAAEPREE
AGRG