Gene Ndas_3196 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_3196 
Symbol 
ID9247053 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp3820194 
End bp3822062 
Gene Length1869 bp 
Protein Length622 aa 
Translation table11 
GC content71% 
IMG OID 
ProductAMP-dependent synthetase and ligase 
Protein accessionYP_003681110 
Protein GI297562136 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGTAACA CTTCATCCCC CTTTTCCTCC GTCCCCGGTA TGTTCCTCTC CCGCGTCGCC 
GAGTCCCCCG ACTCCGAGGC GTTCAGCTAC CCCCTTCCCG GCGCGTCCGG CGCCCCCGAG
AAGTGGGAGA CCCTCACCTG GTCCCAGACC CGCGACCGGG TGCGCGACAT CGCGCTGGGC
CTGCACGACC TGGGCGTGAC CCCCCAGGCC CGGTGCGCGA TCGTCTCCAG CACACGCGTG
GAGTGGATCC TCGCCGACAT GGCCGTGCTG TGCGCGGGCG GAGCCTCCAC CACCGTCTAC
CCCTCCTCCA CCCCGCCCGA CTGCGCCTTC ATCGTCTCCG ACTCGGGCAG CATGGTGGCC
TTCGCCGAGA ACGACGAGCA GGTCGCCAAG CTGGTCGACC AGCGCGCGCG GATGCCGGGG
CTGTCGCGGG TGATCGCCTT CGAGGGCGCG GCCGGGGGCG ACGACGACTG GGTGATGTCC
CTGGAGGAGC TGGCCGCCCG GGGCGCCGAA CTCCACGCCC AGGACCCCGA CCTGTTCGAG
GCGCTGGTCG CGGCGGTCGC ACCCGACCAC CTGGCCACGC TGATCTACAC CTCCGGCACC
ACGGGCAGAC CCAAGGGCGT GCGGCTGGAC CACGCGAACT GGCTCTACGA GTCCCAGGCC
ATGGCGGACC TGGACACCGA GCTGCGCGCG CAGGGCTGGG ACCTGATGGG CCCCGACGAC
GTCCAGTACC TGTGGCTGCC GCTGTCGCAC GTGTTCGGAA AGCTCATGCA GGTCAGCCAG
CTGCGGATCG GCTTCCGGAC CGCCGTGGAC GGCCGCCCCG ACCGGATCGT GGACAACCTG
GCCGTCGTGC GGCCCACCTT CATGGCCGCC GCCCCGCGCA TCTTCGAGAA GGTGTACAAC
CGCGTGGTCA TGCAGGCCCG GGAGGGCGGC GCGGCCAGGT ACCGGATCTT CCGGTGGGCC
GCGGGGGTGG GCGACCGGGT CGCCCGGCTC AGGGAGGAGG GCCGCGAACC CACGGGCCTG
CTCGCCGCGC AGCACCGGGC CGCCGACAGG CTCGTGTTCG CCAGGCTGCG GGCGCGCTTC
GGGGAGCGGC TGAAGTTCTT CATCTCCGGC AGCGCGCCGC TGTCCCCCGA GATCGGCCGG
TTCTTCTACG GCGCCGGGAT CGTCATCCTG GAGGGTTACG GGCTCACCGA GACCAGCGCC
GGGACGTTCG TCAACCGGCC CGGCGACGTG CGGTTCGGCA CGGTCGGACT GCCCATGCCC
GGTACCGAGG TGCGCATCGC CGAGGACGGC GAGATCCTCA TCCGGGGCGG CGGCGTGATG
CGCGGCTACC ACAACCTCTC CGGCGCCACG GAGGAGGTGC TCACCGCGGA GGGCTGGTTC
GCCACCGGGG ACATCGGCGC CCTGGAGAAC GGCCGCCTGC GCATCACCGA CCGCAAGAAG
GAGCTGATCA AGACCTCCGG CGGCAAGTAC GTGGCGCCCC AGAGCATCGA GAGCCGCTTC
AAGGCGCTGT GCCCCTACGT CAGCAACCTG GTCGTGCACG GCGACCGGCG CCCCTACTGC
GTGGCGCTGG TGGCCCTGGA CCCCGAGGCG ATCGACGCGT GGGCGGCGGA GCACGGACTG
GGCCACCTCG ACTACACGGG CCTGACGCGG GAGCCCAGGG TCCGGGAGAT GGTGCAGGAG
GCCGTCGACG AGCTCAACCG TGGTCTGCCC CGGCACGAGA CCGTGAAGAG GTTCGCGATC
CTGCCGAGCG ACCTGACGGT CGAGGAGGGC GAGATGACGC CCAGCCTGAA GATGCGGCGG
CGGGCGGTCG AGGACAAGTA CCGCGACCTG CTCGACGGCA TGTACGAGGA GACCGTCCAC
CGGTTCTGA
 
Protein sequence
MGNTSSPFSS VPGMFLSRVA ESPDSEAFSY PLPGASGAPE KWETLTWSQT RDRVRDIALG 
LHDLGVTPQA RCAIVSSTRV EWILADMAVL CAGGASTTVY PSSTPPDCAF IVSDSGSMVA
FAENDEQVAK LVDQRARMPG LSRVIAFEGA AGGDDDWVMS LEELAARGAE LHAQDPDLFE
ALVAAVAPDH LATLIYTSGT TGRPKGVRLD HANWLYESQA MADLDTELRA QGWDLMGPDD
VQYLWLPLSH VFGKLMQVSQ LRIGFRTAVD GRPDRIVDNL AVVRPTFMAA APRIFEKVYN
RVVMQAREGG AARYRIFRWA AGVGDRVARL REEGREPTGL LAAQHRAADR LVFARLRARF
GERLKFFISG SAPLSPEIGR FFYGAGIVIL EGYGLTETSA GTFVNRPGDV RFGTVGLPMP
GTEVRIAEDG EILIRGGGVM RGYHNLSGAT EEVLTAEGWF ATGDIGALEN GRLRITDRKK
ELIKTSGGKY VAPQSIESRF KALCPYVSNL VVHGDRRPYC VALVALDPEA IDAWAAEHGL
GHLDYTGLTR EPRVREMVQE AVDELNRGLP RHETVKRFAI LPSDLTVEEG EMTPSLKMRR
RAVEDKYRDL LDGMYEETVH RF