Gene Ndas_2704 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_2704 
Symbol 
ID9246555 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp3225463 
End bp3226863 
Gene Length1401 bp 
Protein Length466 aa 
Translation table11 
GC content77% 
IMG OID 
ProductAMP-dependent synthetase and ligase 
Protein accessionYP_003680625 
Protein GI297561651 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.431608 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCTGCTGG GTTCGGGCGC GGACGAGGGG ACCGCGGTGA GCGTCGGCGG GGCGGGGCTG 
GACCGCGAGG GGCTGTGGTC GGCCGCGGCC GCCGTGGCCG AGCGGGTCGC GGGGGCGGAC
GCCGTCGCCG TGCACGGCGA GGCCTCGCTG TCCACGGTCG TCGCGGTGGT CGGCGGCCTG
CTGGCCGGGG TGCCCGTGGT CCCGGTCCCG GCGGACTCGG GGACCGCCGA GCGCCGCCAC
ATCGTGCGCG ACTCCGGCGC CGCGCTGTGG CTGGGCGCCC CGAGGGAGGA CGTGGACCTC
CCCGTCGTCC CGGTGGACCC GGCCGAGCGC TCCTCGTTCG CGCTCCCCGA ACCGCCGCCC
GAGTCCACCG CGCTGGTCAT GTACACCTCC GGGACCACCG GACCGCCCAA GGGCGCCCTC
ATCCCGCGCC GGGCCGTGGC CGCCGACCTG GACGCGCTCG CCGACGCCTG GGACTGGACG
CCCGACGACG TGCTGGTGCA CGGTCTGCCG CTGTTCCACG TGCACGGCCT GATCCTGGGC
GTGCTGGGGG CCCTGCGCGT GGGCAGTCCG CTGCTGCACA CCGTCCGCCC CACCCCCGCC
GCCTACGCGG CGGCGGCGCA GGGGACTCGG CGCGGAACCC TGTTCTTCGG CGTGCCGACG
GTGTGGTCGC GGATCGCCCG CGACCCCGAC AGCGCACGCG CCCTGTCCGG GGCGCGGCTG
CTGGTCTCGG GCAGCGCCCC GCTGCCCGAC ACGGTGGCCG ACGGCCTGCG GGGGGCGTGC
GGCCACAGCC CCGTGGAGCG GTACGGGATG ACCGAGACGC TGATCACCGT GGCGGCGCGC
GCCGACGCGC CCCGGCGCAC CGGCTGGGTC GGGACGGCGC TGCCGGGTCT GGAGACGCGG
CTGCGCGGCG AGCACGGGGA GCCCGTCGCC TCCGACGGCG AGAGCGTCGG CGAGCTCCAG
GTCCGCGGGG CCACCCTGTT CGGGGGCTAC CTCGGGCTGC CGGAGGCCAC GGCCGCGGCG
TGGACCGGGG ACGGCTGGTT CCGCACCGGC GACGCGGCGG TCCGCGACGG GGACGGCTGG
CACCGGATCG TGGGCCGGAT GTCGGTGGAC ATGATCAAGA CCGGCGGCTA CCGGGTCGGC
GCGGGCGAGG TCGAGGCGGT GCTGCTCGGC CATCCCGGGG TGATGGAGGC CGCCGTGGTG
GGCGAGGCCG ACGACGACCT CGGCCAGCGG ATCGTGGCCT ACCTGGTGGG CGAGGGCATC
TCCCCCGAGG CGGTCATCGA CTTCGTGGCC GAGCGCCTGT CGGTGCACAA GCGCCCGCGC
GAGGTGCGTG TGGTGGACAC GCTGCCGCGC AACGCGATGG GCAAGATCCA GAAGAAGCTG
CTGGGCAACG CGTCCGCCTG A
 
Protein sequence
MLLGSGADEG TAVSVGGAGL DREGLWSAAA AVAERVAGAD AVAVHGEASL STVVAVVGGL 
LAGVPVVPVP ADSGTAERRH IVRDSGAALW LGAPREDVDL PVVPVDPAER SSFALPEPPP
ESTALVMYTS GTTGPPKGAL IPRRAVAADL DALADAWDWT PDDVLVHGLP LFHVHGLILG
VLGALRVGSP LLHTVRPTPA AYAAAAQGTR RGTLFFGVPT VWSRIARDPD SARALSGARL
LVSGSAPLPD TVADGLRGAC GHSPVERYGM TETLITVAAR ADAPRRTGWV GTALPGLETR
LRGEHGEPVA SDGESVGELQ VRGATLFGGY LGLPEATAAA WTGDGWFRTG DAAVRDGDGW
HRIVGRMSVD MIKTGGYRVG AGEVEAVLLG HPGVMEAAVV GEADDDLGQR IVAYLVGEGI
SPEAVIDFVA ERLSVHKRPR EVRVVDTLPR NAMGKIQKKL LGNASA