Gene Ndas_2594 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_2594 
Symbol 
ID9246445 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp3089597 
End bp3091024 
Gene Length1428 bp 
Protein Length475 aa 
Translation table11 
GC content73% 
IMG OID 
Productprotein of unknown function DUF162 
Protein accessionYP_003680518 
Protein GI297561544 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.487179 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0287829 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGCGA CGTTCCTGGG CATCCCGCCC TTTCCCGAGG CCGCCGCGGG CGCGGTCAAC 
GACGCCCGCC TGCGGCACAA CCTGCGCAGG GCCACGCACA CCATCCGCGA CAAGCGCGCC
TCCGTCGTCG ACGAGCTGCA CGAGGACTGG CAGCGCCTGC GGGCGGAGGG CGCCGCGGTC
AAGGAGCACA CCCTGCGCCA CCTGGACCAC TACCTGGAAC AGCTGGAGGA GTCCGTCCAC
CGGGCGGGCG GACGGGTGCA CTGGGCCTCC GACGCGGCCG AGGCCAACGA GATCGTCACC
CGGCTGGTCC GGGACACCGG CGAGACCGAC GTGGTCAAGG TCAAGTCGAT GGCCACCCAG
GAGATCGAAC TCAACGACGC CCTCGCCGCC GCCGGGATCA CCGCCTACGA GACCGACCTG
GCCGAGCTGA TCGTGCAGCT GGGCGAGGAC CTGCCCTCGC ACATCCTGGT GCCCGCCATC
CACCTGGGCC GCGCCCAGAT CCGCGAGATC TTCCTGGAGC AGATGGCCGA GTGGGGCGTG
CCCGCGCCCC GGGGCCTGAC CGACGACCCC CGGGCGCTGG CCGAGGCGGC CCGCGTGCAC
CTGCGCGAAC GCTTCCTGCG CACCCGCACC GCGATCTCGG GGGCCAACTT CGCGGTGGCC
GACAGCGGCA CGCTCGTGGT GCTGGAGTCG GAGGGCAACG GGCGCATGTG CCTGACCCTG
CCCCGGACGC TGATCTCCGT GGTCGGCATC GAGAAGATCG TGCCGACCTG GTCGGACCTG
GAGGTGTTCC TCCAGTTGCT GCCGCGTTCC TCCACCGGCG AGCGGATGAA CCCCTACACC
TCCACCTGGA CCGGGGTGAC GCCGGGCGAC GGCCCCCAGG AGTTCCACCT GGTGCTGCTG
GACAACGGCC GCACCGACGT GCTGGCCGAC ACCGTGGGCC GCCAGGCGCT GCGCTGCATC
CGCTGCTCGG CGTGCCTGAA CACCTGCCCG GTCTACGAGC GCACCGGCGG GCACTCCTAC
GGTTCGGTCT ACCCGGGCCC GATCGGCGCG ATCCTCACCC CGCAGCTGCG GGGGATGTCC
TCGCCGGTGG ACGAGGCGCT GCCCTACGCG TCCTCGCTGT GCGGGGCCTG CTACGAGGTG
TGCCCGGTGG CCATCGACAT CCCCGAGGTG CTGGTGCACC TGCGCGAGGA GGTCGTGGAG
CGCTCCGGGC ACGCGGGGGA GAAGGCGCTC ATGGCGGGCG CCGAGGCGGT GCTGTCCTCC
CCGCGGACGC TGGGCGCGGT CCAGCGGGCG GCGGGGCTGG GGCGCCGCGC GGTCCCGCGC
CACCTGCCGG GCCTGGCGGG GGCCTGGACC GACACCAGGG ACGTGCCCGA CGTCCCGGCC
GAGTCCTTCC GCCAGTGGTG GGACAGGCGC GAGGGGGAGG GCCGATGA
 
Protein sequence
MSATFLGIPP FPEAAAGAVN DARLRHNLRR ATHTIRDKRA SVVDELHEDW QRLRAEGAAV 
KEHTLRHLDH YLEQLEESVH RAGGRVHWAS DAAEANEIVT RLVRDTGETD VVKVKSMATQ
EIELNDALAA AGITAYETDL AELIVQLGED LPSHILVPAI HLGRAQIREI FLEQMAEWGV
PAPRGLTDDP RALAEAARVH LRERFLRTRT AISGANFAVA DSGTLVVLES EGNGRMCLTL
PRTLISVVGI EKIVPTWSDL EVFLQLLPRS STGERMNPYT STWTGVTPGD GPQEFHLVLL
DNGRTDVLAD TVGRQALRCI RCSACLNTCP VYERTGGHSY GSVYPGPIGA ILTPQLRGMS
SPVDEALPYA SSLCGACYEV CPVAIDIPEV LVHLREEVVE RSGHAGEKAL MAGAEAVLSS
PRTLGAVQRA AGLGRRAVPR HLPGLAGAWT DTRDVPDVPA ESFRQWWDRR EGEGR