Gene Ndas_0429 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_0429 
Symbol 
ID9244268 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp519424 
End bp521142 
Gene Length1719 bp 
Protein Length572 aa 
Translation table11 
GC content72% 
IMG OID 
ProductAMP-dependent synthetase and ligase 
Protein accessionYP_003678382 
Protein GI297559408 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.312463 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGAGAG AGACGGCAGC CGCGGGCGCG GCCGAGTTCC GCGCCGCCCG GGACCTGCTC 
CTGGACCTGC GCGAGGACCA GGGGAGGGCG CACCGGGAGT TCACCTGGCC CCGGTCGGAC
CGGTTCAACT GGGCGCTGGA CTACTTCGAC GAGGTGGCCG GGCGCCGTCC GGACCGGACG
GCGCTGTGGA TCGTGGAGGA GGACGGCGGC CAGGCCAGGT ACACCTACCG GCGGATGTCG
GAGCGCTCCG CCCAGGTGGC CAACTGGCTG AACAACCAGG GCGTGCACCC GGGCGACCGG
ATCCTGCTGA TGCTGGGCAA CCAGGTGGAG CTGTGGGAGA CCCTGCTCGC CGCGACCAAG
CTCGGCGCCG TGGTCTGCCC CACCCCCACC TCCCTGTCCG AGGGCGACCT GCTGGACCGG
CTGGAGCGCG GCGAGATCGC GCACGTGGTG TGCTCGGCCG CCGAGACCGA GAAGTTCGCC
CCGCTGCGGG GGCACTGGAC GCGGATCTGC ACGGGCTACA TGGAGGGGTG GCTGAACTAC
GCCGACTCCG AGCACGCCGG TCTGGACTTC CACCCCCCGC ACACCACCCA CCCCGACGAC
CCCCTGCTGC TGTACTTCAC CTCCGGCACC GCCACCCTGC CCAAGCTGGT CGTGCACACC
CAGCGCTCGT ACCCGGTGGG GCACCTGTCG ACCATGTACT GGCTGGGTGT GCGGCCGGGC
GACGTCCACC TCAACGTGTC CGAGCCGGGG TGGGCCAAGC ACGCCTACGG CAGCGTGTTC
GCGCCGTGGA ACGCCGAGGC GACGGTGCTG GTCGTCAACC AGGAGCGCTT CGACGCGGCG
GGGCTGCTGG ACGCGATCGT GCGCTGCGGG GTGGACACGC TGTGCGCGCC GCCGACGGTG
TGGCGGACTC TGGCGCAGGC CGACCCCGCC GCGTGGGACG TGGGGCTGCG CGAGGCGGTG
GCCGCCGGGG AGCCGCTCAA CCCCGAGGTG GTGGACCGCG TGCGCGAGGC ATGGGGCGTG
ACCGTGCGCG ACGGGTTCGG GCAGACCGAG ACGACCGTAC TGCTGGGCAA CGGCCCCGGT
CAGCGCGTGG TGCGCGGGTC GATGGGGCGG GAGATGCCGG GCTACGACGT GGTGCTGACG
GACCCGGCCA CCGACGAGCC CGCCGACACC GGGCAGATCT GCGTGGACCT GGCCCGGGAG
CCGGTGGGGG TGATGAAAGG CTACGCCGAC AACCCCGGCC TCAGCCGCGA GGTGGTCCGC
GGGGACCGCT ACCGCACCGG TGACATCGCC AGCCGGGATT CGAACGGTTA CATTACCTAT
ATCGGTCGAT CCGACGATGT GTTCAAGGCC TCAGATTACC GCATCTCACC ATTCGAACTC
GAAAGCGTTC TGGTCGAGCA CGAATACGTG GTCGAGGCGG CCGTCGTTCC CTCCCCCGAC
CCGCTGCGGC TGGCGGTGGC CAAGGCGTAC GTGGCCCTGG CCGAGGGGGT GGCCCCCGAC
GCCGAGACCG CGCGGTCGAT CCTGGCCCAC GCGCGCGAGC GCCTGTCACC GCACCAGAGG
GTTCGCCGCC TGGAGTTCGG TGAGCTGCCC AAGACCGTCT CCGGTAAGAT CCGTCGCGTG
CAGCTGCGTC GGGCCGAGGC CGAGCGCGGC ACGGTCGCCG ACGGCGCGCG CAACCCGCGC
GAGTACTGGG AGGAGGACCT CCCGGGGTTG GAGCGGTGA
 
Protein sequence
MSRETAAAGA AEFRAARDLL LDLREDQGRA HREFTWPRSD RFNWALDYFD EVAGRRPDRT 
ALWIVEEDGG QARYTYRRMS ERSAQVANWL NNQGVHPGDR ILLMLGNQVE LWETLLAATK
LGAVVCPTPT SLSEGDLLDR LERGEIAHVV CSAAETEKFA PLRGHWTRIC TGYMEGWLNY
ADSEHAGLDF HPPHTTHPDD PLLLYFTSGT ATLPKLVVHT QRSYPVGHLS TMYWLGVRPG
DVHLNVSEPG WAKHAYGSVF APWNAEATVL VVNQERFDAA GLLDAIVRCG VDTLCAPPTV
WRTLAQADPA AWDVGLREAV AAGEPLNPEV VDRVREAWGV TVRDGFGQTE TTVLLGNGPG
QRVVRGSMGR EMPGYDVVLT DPATDEPADT GQICVDLARE PVGVMKGYAD NPGLSREVVR
GDRYRTGDIA SRDSNGYITY IGRSDDVFKA SDYRISPFEL ESVLVEHEYV VEAAVVPSPD
PLRLAVAKAY VALAEGVAPD AETARSILAH ARERLSPHQR VRRLEFGELP KTVSGKIRRV
QLRRAEAERG TVADGARNPR EYWEEDLPGL ER