Gene Ndas_3365 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_3365 
Symbol 
ID9247230 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp4021148 
End bp4022356 
Gene Length1209 bp 
Protein Length402 aa 
Translation table11 
GC content71% 
IMG OID 
ProductAMP-dependent synthetase and ligase 
Protein accessionYP_003681276 
Protein GI297562302 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCTGCGGG AGAAGTACCT CCGCCACGTC TCCGGGGCCT CCGCGCCGGA GCTGTCCGAC 
CTCCCCACGC TCGACAGGGA CGAACTGGGC CGGGCGATCG ACACCCTGGT GCGCACCGAC
CCGTCCGCCC TGACCCGCGC CTCCCTGAAC GTCATGGGGG GCACCCGGTC GACCATGCGC
CTGGGGGCGG TCCCGGCGGA CCTGTACCTG GACGAGATCG CGCCGCACGT GCGCCCCTTC
GAACAGGGCG ACCTGTTCAC CACCCTCGGC ACCCCGTTCC ACATGCGGGC CTGCCAGGAG
CTGCACAACG GACTCGCCGC CAGGGCCGGA GTGCCCACAC TCTCCATGGA CGCGCCGACC
GACCAGATGA TCGACGCCTA CCTCGACCTG TTCGAGCGCC ACGGGGTCAA CGCCCTGGGC
ACCACCCTCG ACACCTTCCG GAGCCTGCTC CGTTACTGCG CCGCGTCCGG CCGGGACCTC
GGGTTCCTGC GCAAGGTGCT GTGGAGCGGT CCGGCCATGG ACGCCGCCAC CCGAGCGCTG
ATCCGGACAC ACTTTCCCCA CCTGCGCACA TGGGCGCTCT TCGGCTCGGC GGAGACCTGG
ATCATCGGGC ACAGCGGCCC GGACTGCGCC AACGACACCC TCCACCCGCT CCCCCACCAG
TACACGGAGA TCGTCGACGG GCGCATGCTG GTGACCGTCA CACACGAGAA GGCGGTCGTC
CCGCTGCTGC GCTACGAGAC CGGGGTCGCG GCCGAATGGA CGGCCTGCCC CTGCGGCCTG
CCCGGTCCCG CGGTGCGCAC CCACAGCCGC ATAGACGCCC CGATGGGGCC GCTCAGCCGC
GTGGTCTCCC CCCTCGACCT CGTGCCGCTG GCCCTGCGGC TCGACTCGGT GGAGGCGGCC
CAGGTCGTCC TGGTCGATCC CCACACCGAG GACGAACGGC TCCACCTGCG GGTCCGGCTG
CGCCCGGAGA CCAGGTCCGA GCTCTACACC GGCGAGTGGA TCCGGCAGCA CGTGCTGTCC
GAGTCGCTGG GGCTGTCCGA GGTGACGGAG GAGGCTCCGG AGTCCTTCGA GGTCATCGTC
TCCCGGCACA TGCTGCGGGA ACTCCCGGAC GGGTCGGCCC CCGAGTTCCT GGTGCGCGAG
GGGGGACGCC TCCGAATCCA ATCGATATCG AGTCAGGGTC AGGGTTCTTA TGGTACCTTC
TCGGCATAG
 
Protein sequence
MLREKYLRHV SGASAPELSD LPTLDRDELG RAIDTLVRTD PSALTRASLN VMGGTRSTMR 
LGAVPADLYL DEIAPHVRPF EQGDLFTTLG TPFHMRACQE LHNGLAARAG VPTLSMDAPT
DQMIDAYLDL FERHGVNALG TTLDTFRSLL RYCAASGRDL GFLRKVLWSG PAMDAATRAL
IRTHFPHLRT WALFGSAETW IIGHSGPDCA NDTLHPLPHQ YTEIVDGRML VTVTHEKAVV
PLLRYETGVA AEWTACPCGL PGPAVRTHSR IDAPMGPLSR VVSPLDLVPL ALRLDSVEAA
QVVLVDPHTE DERLHLRVRL RPETRSELYT GEWIRQHVLS ESLGLSEVTE EAPESFEVIV
SRHMLRELPD GSAPEFLVRE GGRLRIQSIS SQGQGSYGTF SA